SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF

Yi Dong | Zhilin Wang | Makesh Sreedhar | Xianchao Wu | Oleksii Kuchaiev |

Paper Details:

Month: December
Year: 2023
Location: Singapore
Venue: F | i | n | d | i | n | g | s | - | E | M | N | L | P |