easy - add compile flag to configs #634

felipemello1 · 2025-12-09T17:53:51Z

We currently have 3 places for compile

This PR creates a single flag at the top and sets it to true as default, which helps with memory / tok/s

For the current bsz / seq len, its not a huge difference in speed/memory, but as the models/sequence grow, it becomes more relevant.

…pile_flag

JenniferWang · 2025-12-10T14:45:11Z

.meta/mast/qwen3_4b_mast.yaml

 model: "Qwen/Qwen3-4B"
 off_by_n: 1 # Off by one by default
 launcher: mast
+compile: true # Enable torch.compile for trainer/ref_model, and CUDA graphs for vLLM


This file should have been removed?

its there: https://github.com/meta-pytorch/torchforge/blob/main/.meta/mast/qwen3_4b_mast.yaml

But i can delete it in this PR if you want

OOf, why are there so many configs.
Yes, i missed it in https://github.com/meta-pytorch/torchforge/pull/632/files . Please just remove it.

JenniferWang · 2025-12-10T14:45:49Z

apps/grpo/qwen3_8b.yaml

 max_res_tokens: 2048
 model: "Qwen/Qwen3-8B"
 off_by_n: 1 # Off by one by default
+compile: true # Enable torch.compile for trainer/ref_model, and CUDA graphs for vLLM


Why not enabling it by default if you're updating all of the configs?

wdym by "enabling it by default"? We still need to expose the flag because compile can be tricky in some setups. It also add a bit of warmup time, so if someone is just quickly testing something, they may want to set it to false

I see. I was suggesting that to reduce the number of hyper parameters in the yaml config because

We seem to want it to be enable for production runs

This is a niche config (many things can slow down warmup time) that I don't expect people to remember to toggle in practice. We can set the default to be false when launching the job in local mode or ONLY set them to be true for large models.

Not a big deal.

JenniferWang · 2025-12-10T14:47:28Z

Will enabling compile add to the job starting up time? Is there usually instrumentation around that?

felipemello1 · 2025-12-10T14:49:19Z

Will enabling compile add to the job starting up time? Is there usually instrumentation around that?

Yes, the larger the model, the longer it takes. Its between a couple of seconds to 60s. But it decreases peak memory by 40% and increase Tok/s > 20%, in my experience

add compile flag to configs

2eac5d9

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Dec 9, 2025

Merge branch 'main' of https://github.com/meta-pytorch/forge into com…

e8ee616

…pile_flag

felipemello1 changed the title ~~add compile flag to configs~~ easy - add compile flag to configs Dec 9, 2025

JenniferWang reviewed Dec 10, 2025

View reviewed changes

JenniferWang approved these changes Dec 10, 2025

View reviewed changes

felipemello1 merged commit 7b8580a into meta-pytorch:main Dec 10, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

easy - add compile flag to configs #634

easy - add compile flag to configs #634

felipemello1 commented Dec 9, 2025 •

edited

Loading

Uh oh!

JenniferWang Dec 10, 2025

Uh oh!

felipemello1 Dec 10, 2025

Uh oh!

JenniferWang Dec 10, 2025

Uh oh!

JenniferWang Dec 10, 2025

Uh oh!

felipemello1 Dec 10, 2025

Uh oh!

JenniferWang Dec 10, 2025 •

edited

Loading

Uh oh!

JenniferWang commented Dec 10, 2025

Uh oh!

felipemello1 commented Dec 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

easy - add compile flag to configs #634

easy - add compile flag to configs #634

Conversation

felipemello1 commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JenniferWang Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

felipemello1 Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

JenniferWang Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

JenniferWang Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

felipemello1 Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

JenniferWang Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JenniferWang commented Dec 10, 2025

Uh oh!

felipemello1 commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

felipemello1 commented Dec 9, 2025 •

edited

Loading

JenniferWang Dec 10, 2025 •

edited

Loading

felipemello1 commented Dec 10, 2025 •

edited

Loading