Offline Reinforcement Learning from Human Feedback in Real-World Sequence-to-Sequence Tasks

Julia Kreutzer | Stefan Riezler | Carolin Lawrence |

Paper Details:

Month: August
Year: 2021
Location: Online
Venue: ACL | IJCNLP | spnlp |

Citations

URL

No Citations Yet

Field Of Study