-
Notifications
You must be signed in to change notification settings - Fork 293
Pull requests: NovaSky-AI/SkyRL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP] Add OpenReward environment integration example
#1458
opened Apr 4, 2026 by
tyfeng1997
•
Draft
2 of 4 tasks
feat: LLM-synthesized hints for failed trajectories
#1456
opened Apr 4, 2026 by
dzorlu
Loading…
4 tasks
[skyrl-train] feat: add native GMPO policy loss with validation and tests
#1449
opened Apr 2, 2026 by
taivu1998
Loading…
Fix event-loop blocking in one-step-off async save/export paths
#1446
opened Apr 2, 2026 by
taivu1998
Loading…
Change default KL estimator from k3 to k2 for loss-based KL
#1445
opened Apr 2, 2026 by
taivu1998
Loading…
[skyrl-train] Add trainer-side max_response_length for Dr. GRPO normalization and DAPO overlong handling
#1440
opened Apr 2, 2026 by
taivu1998
Loading…
[megatron] support qwen3.5 models for megatron, bump mbridge + megatron-core to latest
#1425
opened Apr 1, 2026 by
erictang000
Loading…
[WIP][tx] Add initial implementation of RayJaxBackend
#1418
opened Mar 31, 2026 by
andrewsykim
•
Draft
[SkyRL][train] Support prompt_logprobs in /sample in the new inference stack
#1417
opened Mar 31, 2026 by
nithinvc
Loading…
6 tasks done
[train] Add Virtual Pipeline Parallelism support to Megatron
#1400
opened Mar 27, 2026 by
tamoghnokandar
Loading…
[tinker] Support PPO loss with Tinker and add critic model in SkyRLTrainBackend
#1389
opened Mar 25, 2026 by
tamoghnokandar
Loading…
4 tasks done
log all trajectory repetitions per training step (not just first)
#1388
opened Mar 25, 2026 by
arteemg
Loading…
[train] Prefix-aware merge for step-wise trajectories (#1277)
#1377
opened Mar 24, 2026 by
deepsheth3
Loading…
Ulysses position_ids pre-gather, NUMA rewrite, and operational improvements
#1371
opened Mar 23, 2026 by
ashutoshuiuc
Loading…
Add Anthropic Messages API (/v1/messages) endpoint
#1369
opened Mar 23, 2026 by
ashutoshuiuc
Loading…
Improved LoRA weight swap and robust transitions_to_training_data
#1368
opened Mar 23, 2026 by
ashutoshuiuc
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.