refactor: AutoModel entry point #730

akoumpa · 2025-10-28T02:19:10Z

HF

from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained(
    "meta-llama/Llama-3.1-8B",
)

NeMo AutoModel

from nemo_automodel import NeMoAutoModelForCausalLM
from torch.distributed.device_mesh import init_device_mesh

mesh = init_device_mesh("cuda", mesh_shape=(1,1,1,1,2), mesh_dim_names=("pp","dp_replicate","dp_shard","cp","tp"))

model = NeMoAutoModelForCausalLM.from_pretrained(
    "meta-llama/Llama-3.1-8B",
    device_mesh=mesh,
    distributed={"tp_size": 2, "cp_size": 1, "pp_size": 1, "dp_size": 2, "backend": "nccl"},
)

This PR extends the Auto API to include the device_mesh and distributed options. The goal is to provide a drop-in class that supports models with distributed processing.

copy-pr-bot · 2025-10-28T02:19:14Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Signed-off-by: Alexandros Koumparoulis <[email protected]>

akoumpa changed the title ~~Refactor: AutoModel entry point~~ refactor: AutoModel entry point Oct 28, 2025

akoumpa force-pushed the akoumparouli/refactor_auto_entrypoint branch from 56564c8 to 31f29b8 Compare October 30, 2025 05:51

akoumpa added 2 commits November 5, 2025 22:10

wip

de26acd

Signed-off-by: Alexandros Koumparoulis <[email protected]>

step

c375b42

Signed-off-by: Alexandros Koumparoulis <[email protected]>

akoumpa force-pushed the akoumparouli/refactor_auto_entrypoint branch from 32c3357 to c375b42 Compare November 6, 2025 06:10

baby steps

ff845a8

Signed-off-by: Alexandros Koumparoulis <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor: AutoModel entry point #730

refactor: AutoModel entry point #730

Uh oh!

akoumpa commented Oct 28, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

refactor: AutoModel entry point #730

Are you sure you want to change the base?

refactor: AutoModel entry point #730

Uh oh!

Conversation

akoumpa commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

copy-pr-bot bot commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

akoumpa commented Oct 28, 2025 •

edited

Loading