NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
ALaRM: Align Language Models via Hierarchical Rewards Modeling
Yuhang Lai
|
Siyuan Wang
|
Shujun Liu
|
Xuanjing Huang
|
Zhongyu Wei
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand and virtual meeting
Venue:
F |
i |
n |
d |
i |
n |
g |
s |
- |
A |
C |
L |
Citations
URL
No Citations Yet
https://ALaRM-fdu.github.io
https://huggingface.co/openbmb/UltraRM-13b
https://github.com/allenai/FineGrainedRLHF
https://huggingface.co/datasets/Helsinki-NLP/europarl
https://github.com/languagetool-org/languagetool
https://github.com/pemistahl/lingua-py
https://github.com/textstat/textstat
https://github.com/huggingface/trl
Field Of Study