trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback

Alexander Havrilla | Maksym Zhuravinskyi | Duy Phung | Aman Tiwari | Jonathan Tow | Stella Biderman | Quentin Anthony | Louis Castricato |

Paper Details:

Month: December
Year: 2023
Location: Singapore
Venue: EMNLP |