NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint
Zhipeng Chen
|
Kun Zhou
|
Xin Zhao
|
Junchen Wan
|
Fuzheng Zhang
|
Di Zhang
|
Ji-Rong Wen
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand and virtual meeting
Venue:
F |
i |
n |
d |
i |
n |
g |
s |
- |
A |
C |
L |
Citations
URL
No Citations Yet
No URLs Found
Field Of Study