“That Is a Suspicious Reaction!”: Interpreting Logits Variation to Detect NLP Adversarial Attacks

Edoardo Mosca | Shreyash Agarwal | Javier Rando Ramírez | Georg Groh |

Paper Details:

Month: May
Year: 2022
Location: Dublin, Ireland
Venue: ACL |

Citations

URL