Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback

Khanh Nguyen | Hal Daumé III | Jordan Boyd-Graber |

Paper Details:

Month: September
Year: 2017
Location: Copenhagen, Denmark
Venue: EMNLP |

Citations

URL