Skip to content

fix: correct role of the beta hyperparameter on the DPO loss#818

Merged
rasbt merged 1 commit intorasbt:mainfrom
andreas-yin:patch-1
Sep 13, 2025
Merged

fix: correct role of the beta hyperparameter on the DPO loss#818
rasbt merged 1 commit intorasbt:mainfrom
andreas-yin:patch-1

Commits

Commits on Sep 13, 2025