NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Making Harmful Behaviors Unlearnable for Large Language Models
Xin Zhou
|
Yi Lu
|
Ruotian Ma
|
Yujian Wei
|
Tao Gui
|
Qi Zhang
|
Xuanjing Huang
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand and virtual meeting
Venue:
F |
i |
n |
d |
i |
n |
g |
s |
- |
A |
C |
L |
Citations
URL
No Citations Yet
https://github.com/xzhou20/
https://github.com/anthropics/hh-rlhf/tree/
https://github.com/LLM-Tuning-Safety/
Field Of Study