fix: correct role of the beta hyperparameter on the DPO loss#818
Merged
rasbt merged 1 commit intorasbt:mainfrom Sep 13, 2025
Merged
fix: correct role of the beta hyperparameter on the DPO loss#818rasbt merged 1 commit intorasbt:mainfrom
rasbt merged 1 commit intorasbt:mainfrom