-
Notifications
You must be signed in to change notification settings - Fork 166
NVIDIA-NeMo RL Discussions
Sort by:
Latest activity
Categories, most helpful, and community links
Categories
Community links
Discussions
-
You must be logged in to vote 📣 On‑Policy Distillation for LLMs in NVIDIA NeMo-RL
publishedPublished discussions -
You must be logged in to vote 📣 -
You must be logged in to vote 📣 NeMo-RL V0.3: Scalable and Performant Post-training with Nemo-RL via Megatron-Core
publishedPublished discussions -
You must be logged in to vote 📣 NeMo-RL: Journey of Optimizing Weight Transfer in Large MoE Models by 10x
publishedPublished discussions -
You must be logged in to vote 📣 -
You must be logged in to vote 📣 -
You must be logged in to vote 📣 -
You must be logged in to vote 💡 -
You must be logged in to vote 💬 motivation for HF datasets?
questionFurther information is requested -
You must be logged in to vote 🔬 -
You must be logged in to vote 🗳️