Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add llama3.3 nemotron super 49b recipes CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation
#1506 opened Nov 11, 2025 by yuki-97 Draft
build: Use dynamic engine for generate.
#1502 opened Nov 11, 2025 by shanmugamr1992 Loading…
4 tasks
feat: pipeline-rl style # of inflight prompt regulation documentation Improvements or additions to documentation
#1499 opened Nov 10, 2025 by youngeunkwon0405 Loading…
4 tasks
fix: patch python path to include transformers_modules in __init__ CI:L0 Run doctests and unit tests
#1492 opened Nov 10, 2025 by hemildesai Loading…
1 of 4 tasks
feat: allow uv-less execution and fingerprint the environment CI:L1 Run doctests, unit tests, and functional tests CI Relating to CI documentation Improvements or additions to documentation
#1491 opened Nov 9, 2025 by terrykong Draft
fix: Megatron static inference and adapt to mcore engine API changes CI:L1 Run doctests, unit tests, and functional tests r0.4.0
#1488 opened Nov 7, 2025 by shanmugamr1992 Loading…
4 tasks
fix: fixing the sequence parallel related issue in mcore path bug Something isn't working CI:L1 Run doctests, unit tests, and functional tests
#1487 opened Nov 7, 2025 by youngeunkwon0405 Loading…
4 tasks
feat: Add AceMathRL recipe
#1484 opened Nov 6, 2025 by ffrujeri Draft
4 tasks
feat: fp16 for DTensor policies
#1474 opened Nov 5, 2025 by adil-a Loading…
Mmanohara/merge grpo helpsteer cp tp community-request
#1472 opened Nov 4, 2025 by nv-mmanohara Loading…
4 tasks
feat: DTensorPolicyV2 GPT-OSS support CI:L0 Run doctests and unit tests
#1470 opened Nov 4, 2025 by adil-a Loading…
build: Ensure automodel has deepep and TE
#1456 opened Oct 31, 2025 by chtruong814 Loading…
4 tasks
feat: Random dataset with specified input and output sequence length CI:L0 Run doctests and unit tests
#1453 opened Oct 31, 2025 by guyueh1 Loading…
4 tasks
feat: Add GPT-OSS support via mcore
#1452 opened Oct 31, 2025 by ashors1 Draft
4 tasks
feat: Fp8 moe rollout CI:L0 Run doctests and unit tests documentation Improvements or additions to documentation
#1446 opened Oct 29, 2025 by guyueh1 Loading…
4 tasks
feat: enhance advantages tracking and normalization stability in GRPO CI:L1 Run doctests, unit tests, and functional tests r0.4.0
#1423 opened Oct 24, 2025 by ffrujeri Loading…
fix: add theoretical TFlops for H200 GPU CI:L0 Run doctests and unit tests
#1422 opened Oct 24, 2025 by roclark Loading…
4 tasks done
DRAFT: feat: Enable simulated user for multi-turn GRPO
#1412 opened Oct 22, 2025 by ahmadki Loading…
4 tasks
feat: Add support for IPO and RPO algorithm community-request documentation Improvements or additions to documentation
#1388 opened Oct 17, 2025 by sanjana-inflection Loading…
1 of 4 tasks
feat: additional validation losses for preference data documentation Improvements or additions to documentation
#1367 opened Oct 15, 2025 by jveronvialard Draft
4 tasks
feat: GSPO-token
#1357 opened Oct 14, 2025 by pjin-nvidia Draft
4 tasks
ProTip! Add no:assignee to see everything that’s not assigned.