NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control
Xiang Fan
|
Yiwei Lyu
|
Paul Pu Liang
|
Ruslan Salakhutdinov
|
Louis-Philippe Morency
|
Paper Details:
Month: July
Year: 2023
Location: Toronto, Canada
Venue:
F |
i |
n |
d |
i |
n |
g |
s |
- |
A |
C |
L |
Citations
URL
No Citations Yet
https://github.com/huggingface/
Field Of Study