Fix ModuleDict wrapping #260

xmfan · 2025-11-21T01:54:00Z

Stacked PRs:

Fix ModuleDict wrapping

stack-info: PR: #260, branch: xmfan/stack/24

fmassa

This generally LGTM and I was also thinking about doing something like that!

I wonder if we could (or should?) simplify/generalize the implementation to keep the original subclass information around as well?

fmassa · 2025-12-01T15:02:25Z

autoparallel/api.py

+                    ref_submod = getattr(ref_curr_mod, attr_name)
+                    if isinstance(ref_submod, torch.nn.ModuleDict):
+                        setattr(curr_mod, attr_name, torch.nn.ModuleDict())
+                    else:
+                        setattr(curr_mod, attr_name, torch.nn.Module())
+                else:
+                    setattr(curr_mod, attr_name, torch.nn.Module())
+            else:
+                setattr(curr_mod, attr_name, torch.nn.Module())


I wonder if we would want to keep the whole original class structure around (maybe with a nn.Module subclass indicating that the class has been AutoParallelized).
Something like

cls = type(ref_submod) new_inst = ref_submod.__new__(cls) new_inst.__dict__ = ref_submod.__dict__.copy() setattr(curr_mod, attr_name, new_inst)

or if we want a subclass

cls = type(ref_submod) new_cls = type(f"AutoP[{cls.__name__}]", (cls,), ref_submod.__dict__.copy()) new_inst = new_cls.__new__(new_cls) new_inst.__dict__ = ref_submod.__dict__.copy() setattr(curr_mod, attr_name, new_inst)

(but we need to cache those new classes to avoid creating too many redundant classes maybe?)

i can give it a try

stack-info: PR: #260, branch: xmfan/stack/24

xmfan · 2025-12-09T23:54:17Z

autoparallel/api.py

+                try:
+                    cls = type(ref_submod)
+                    new_inst = ref_submod.__new__(cls)
+                    new_inst.__dict__ = ref_submod.__dict__.copy()


this works for nn.Module subclasses without duplicating memory because all params/buffers are stored under __dict__._modules, and the dict is shallow copied

fmassa

LGTM, thanks!

xmfan added a commit that referenced this pull request Nov 21, 2025

Fix ModuleDict wrapping

b12845a

stack-info: PR: #260, branch: xmfan/stack/24

xmfan force-pushed the xmfan/stack/24 branch from 0d60f4a to b12845a Compare November 21, 2025 01:54

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 21, 2025

xmfan added a commit that referenced this pull request Nov 21, 2025

Fix ModuleDict wrapping

655a7f5

stack-info: PR: #260, branch: xmfan/stack/24

xmfan force-pushed the xmfan/stack/24 branch from b12845a to 655a7f5 Compare November 21, 2025 01:57

xmfan requested a review from fmassa November 21, 2025 01:57

xmfan added a commit that referenced this pull request Nov 21, 2025

Fix ModuleDict wrapping

501386f

stack-info: PR: #260, branch: xmfan/stack/24

xmfan force-pushed the xmfan/stack/24 branch from 655a7f5 to 501386f Compare November 21, 2025 02:01

xmfan marked this pull request as draft November 21, 2025 02:17

xmfan added a commit that referenced this pull request Nov 21, 2025

Fix ModuleDict wrapping

23edb8e

stack-info: PR: #260, branch: xmfan/stack/24

xmfan force-pushed the xmfan/stack/24 branch from 501386f to 23edb8e Compare November 21, 2025 17:12

xmfan added a commit to pytorch/torchtitan that referenced this pull request Nov 21, 2025

fix moduledict with AP meta-pytorch/autoparallel#260

2b1fb92

xmfan requested review from bdhirsh and wconstab November 21, 2025 17:56

xmfan added a commit that referenced this pull request Nov 21, 2025

Fix ModuleDict wrapping

8f7ce99

stack-info: PR: #260, branch: xmfan/stack/24

xmfan force-pushed the xmfan/stack/24 branch from 23edb8e to 8f7ce99 Compare November 21, 2025 17:57

xmfan marked this pull request as ready for review November 21, 2025 18:00

fmassa reviewed Dec 1, 2025

View reviewed changes

xmfan force-pushed the xmfan/stack/24 branch from 8f7ce99 to 397b7a6 Compare December 9, 2025 05:01

xmfan mentioned this pull request Dec 9, 2025

Make MoE axis_name patcheable #275

Merged

xmfan force-pushed the xmfan/stack/24 branch from 397b7a6 to df76f42 Compare December 9, 2025 20:36

xmfan requested a review from fmassa December 9, 2025 21:05

Fix ModuleDict wrapping

5b91ac1

stack-info: PR: #260, branch: xmfan/stack/24

xmfan force-pushed the xmfan/stack/24 branch from df76f42 to 5b91ac1 Compare December 9, 2025 21:24

xmfan commented Dec 9, 2025

View reviewed changes

fmassa approved these changes Dec 10, 2025

View reviewed changes

xmfan merged commit 4f9d4a4 into main Dec 10, 2025
4 of 6 checks passed

fmassa deleted the xmfan/stack/24 branch December 11, 2025 10:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix ModuleDict wrapping #260

Fix ModuleDict wrapping #260

Uh oh!

xmfan commented Nov 21, 2025 •

edited

Loading

Uh oh!

fmassa left a comment

Uh oh!

fmassa Dec 1, 2025 •

edited

Loading

Uh oh!

xmfan Dec 2, 2025

Uh oh!

xmfan Dec 9, 2025 •

edited

Loading

Uh oh!

fmassa left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix ModuleDict wrapping #260

Fix ModuleDict wrapping #260

Uh oh!

Conversation

xmfan commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!