Adversarial Preference Optimization: Enhancing Your Alignment via RM-LLM Game

Pengyu Cheng | Yifan Yang | Jian Li | Yong Dai | Tianhao Hu | Peixin Cao | Nan Du | Xiaolong Li |

Paper Details:

Month: August
Year: 2024
Location: Bangkok, Thailand and virtual meeting
Venue: F | i | n | d | i | n | g | s | - | A | C | L |

Citations

URL

No Citations Yet