Aligning as Debiasing: Causality-Aware Alignment via Reinforcement Learning with Interventional Feedback

Yu Xia | Tong Yu | Zhankui He | Handong Zhao | Julian McAuley | Shuai Li |

Paper Details:

Month: June
Year: 2024
Location: Mexico City, Mexico
Venue: NAACL |

Citations

URL

No Citations Yet

No URLs Found

Field Of Study