Categories Pygpt
7 days
30 days
All time
Recent
Popular
There’s now a Python library for RLHF called TRLX!
(The same reinforcement learning strategy used in training ChatGPT)
It works well with Hugging Face models, supports multiple RL strategies, and requires very little code!
Check out the repo here: https://t.co/qFUw5bf82r
Thanks to the wonderful folks with CarperAI!
(The same reinforcement learning strategy used in training ChatGPT)
It works well with Hugging Face models, supports multiple RL strategies, and requires very little code!

Check out the repo here: https://t.co/qFUw5bf82r
Thanks to the wonderful folks with CarperAI!