In this repo I shared about how to implement simple RLHF using GPT. This repo is a code from a medium post here: https://medium.com/p/5fc5ae16da40 (feel free to read and a clap is appreciated, lol)
ardyadipta/exploring_rlhf
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|