ReFT: Reasoning with Reinforced Fine-Tuning

Luong Trung | Xinbo Zhang | Zhanming Jie | Peng Sun | Xiaoran Jin | Hang Li |

Paper Details:

Month: August
Year: 2024
Location: Bangkok, Thailand
Venue: ACL |