[trainer,hparams,docs] feat: add CRD (Centered Reward Distillation) algorithm#121
Open
yuanzhi-zhu wants to merge 1 commit intoX-GenGroup:mainfrom
Open
[trainer,hparams,docs] feat: add CRD (Centered Reward Distillation) algorithm#121yuanzhi-zhu wants to merge 1 commit intoX-GenGroup:mainfrom
yuanzhi-zhu wants to merge 1 commit intoX-GenGroup:mainfrom