add PyTorchModelHubMixin to TabFM by kashif · Pull Request #33 · google-research/tabfm

kashif · 2026-07-01T10:36:23Z

Makes TabFM extend PyTorchModelHubMixin so it gets from_pretrained,
save_pretrained, and push_to_hub for free.

load() now calls TabFM.from_pretrained(HF_REPO_ID, subfolder=model_type)
instead of snapshot_download + manual torch.load + load_state_dict
save_pretrained writes model.safetensors (preferred over .bin) plus a
proper config.json with all init params including is_classifier
_from_pretrained translates the legacy hub task: "classification" field to
is_classifier: bool so existing weights load without any config update
Removes the Config/ClassificationConfig/RegressionConfig dataclasses and
all the manual json/bin saving from convert_and_upload.py

Users can now also do:

from tabfm.src.pytorch.model import TabFM
model = TabFM.from_pretrained("google/tabfm-1.0.0-pytorch", subfolder="classification")
model.push_to_hub("my-org/my-tabfm-fork")

TabFM now extends PyTorchModelHubMixin giving it from_pretrained, save_pretrained, and push_to_hub. The load() helper uses TabFM.from_pretrained() instead of manual snapshot_download + torch.load. save_pretrained writes model.safetensors which is the preferred format. Remove redundant config dataclasses and manual json/bin saving.

erzel

Thanks for the contribution. Please see my comments below.

kashif · 2026-07-02T07:28:23Z

@erzel addressed all the review points: TabFM_HF now lives in tabfm_v1_0_0.py, subclasses TabFM and PyTorchModelHubMixin, and always delegates to the superclass _from_pretrained (only the config fetching differs by resolving to a local dir first). Duplicated config translation logic is merged into one helper, the pytype annotation is back, added a logging.warning when config.json is missing, and the local path check in load() raises immediately. Verified hub subfolder load, local dir load, and load() all work.

# Conflicts: # tabfm/src/pytorch/tabfm_v1_0_0.py

erzel

Thanks for making the changes.

kashif requested review from abhidas, erzel, rajatsen91, siriuz42, tamannarayan and weihaokong as code owners July 1, 2026 10:36

kashif added 2 commits July 1, 2026 12:42

fix license to other for non-commercial weights

61c5675

fix subfolder support in _from_pretrained

69f5e90

NielsRogge mentioned this pull request Jul 1, 2026

Improve Hugging Face integration #36

Closed

wrap long lines in _from_pretrained to fit 80 cols

5fb1e88

erzel requested changes Jul 1, 2026

View reviewed changes

move HF hub code from TabFM into TabFM_HF

81c2669

Merge remote-tracking branch 'origin/main' into add-pytorch-hub-mixin

ce1612d

# Conflicts: # tabfm/src/pytorch/tabfm_v1_0_0.py

erzel approved these changes Jul 2, 2026

View reviewed changes

erzel merged commit 5df7def into google-research:main Jul 2, 2026
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add PyTorchModelHubMixin to TabFM#33

add PyTorchModelHubMixin to TabFM#33
erzel merged 6 commits into
google-research:mainfrom
kashif:add-pytorch-hub-mixin

kashif commented Jul 1, 2026

Uh oh!

erzel left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kashif commented Jul 2, 2026

Uh oh!

erzel left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

kashif commented Jul 1, 2026

Uh oh!

erzel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kashif commented Jul 2, 2026

Uh oh!

erzel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants