Fine-Tuning Language Models with Reward Learning on Policy

Hao Lang | Fei Huang | Yongbin Li |

Paper Details:

Month: June
Year: 2024
Location: Mexico City, Mexico
Venue: NAACL |

Citations

URL

No Citations Yet

Field Of Study