Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection

Jun Yan | Vikas Yadav | Shiyang Li | Lichang Chen | Zheng Tang | Hai Wang | Vijay Srinivasan | Xiang Ren | Hongxia Jin |

Paper Details:

Month: June
Year: 2024
Location: Mexico City, Mexico
Venue: NAACL |