NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Disentangling Length from Quality in Direct Preference Optimization
Ryan Park
|
Rafael Rafailov
|
Stefano Ermon
|
Chelsea Finn
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand and virtual meeting
Venue:
F |
i |
n |
d |
i |
n |
g |
s |
- |
A |
C |
L |
Citations
URL
No Citations Yet
https://github.com/eric-mitchell/direct-preference-
https://twitter.com/i/web/
https://github.com/lm-sys/FastChat/
https://github.com/eric-mitchell/direct-
https://huggingface.co/datasets/
Field Of Study