-
Notifications
You must be signed in to change notification settings - Fork 2.8k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Align KTO doc with DPO and fix Logged metrics wording
#6258
opened Jul 2, 2026 by
qgallouedec
Member
Loading…
Align KTO with DPO: Remove stray misplaced comment in KTO
_get_train_sampler
#6255
opened Jul 2, 2026 by
qgallouedec
Member
Loading…
Align KTO with DPO: Fix KTO
max_length docstring (truncation direction)
#6254
opened Jul 2, 2026 by
qgallouedec
Member
Loading…
Align KTO with DPO: Fix KTO
compute_metrics type hint (EvalLoopOutput → EvalPrediction)
#6253
opened Jul 2, 2026 by
qgallouedec
Member
Loading…
Raise a clear error when GKD student and teacher vocab sizes differ
#6252
opened Jul 2, 2026 by
sergiopaniego
Member
Loading…
2 tasks done
Fix teacher quantization kwargs and guard eval callback in GKD example
#6251
opened Jul 2, 2026 by
sergiopaniego
Member
Loading…
2 tasks done
implement message level rollout with linear trajectories
#6250
opened Jul 2, 2026 by
AmineDiro
Member
Loading…
Standardize
TrainerCallback import to the public top-level path
#6249
opened Jul 2, 2026 by
qgallouedec
Member
Loading…
Use trl's guarded
is_liger_kernel_available in DPOTrainer
#6247
opened Jul 2, 2026 by
qgallouedec
Member
Loading…
Align KTO with DPO: Align
F import with the rest of the repo
#6246
opened Jul 2, 2026 by
qgallouedec
Member
Loading…
Align KTO with DPO: Align Liger loss naming
#6244
opened Jul 2, 2026 by
albertvillanova
Member
Loading…
Fix activation offload storage dedupe reuse
#6241
opened Jul 2, 2026 by
winglian
Contributor
Loading…
8 tasks
Fix missing mm_token_type_ids when training new Qwen VLMs with liger kernel
#6234
opened Jul 1, 2026 by
apardyl
Contributor
Loading…
4 of 8 tasks
Align ORPO with DPO: support iterable and dict eval datasets
#6230
opened Jul 1, 2026 by
DaoyuanLi2816
Contributor
Loading…
Add ORPOTrainer tests to align coverage with DPO
#6229
opened Jul 1, 2026 by
DaoyuanLi2816
Contributor
Loading…
Fix vLLM server-mode generation in
OnlineDPOTrainer
#6228
opened Jun 30, 2026 by
qgallouedec
Member
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-29.