SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding

Zhangchen Xu | Fengqing Jiang | Luyao Niu | Jinyuan Jia | Bill Yuchen Lin | Radha Poovendran |

Paper Details:

Month: August
Year: 2024
Location: Bangkok, Thailand
Venue: ACL |