Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model

Zhiwei He | Xing Wang | Wenxiang Jiao | Zhuosheng Zhang | Rui Wang | Shuming Shi | Zhaopeng Tu |

Paper Details:

Month: June
Year: 2024
Location: Mexico City, Mexico
Venue: NAACL |

Citations

URL