Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Models

Qihuang Zhong | Liang Ding | Li Shen | Peng Mi | Juhua Liu | Bo Du | Dacheng Tao |

Paper Details:

Month: December
Year: 2022
Location: Abu Dhabi, United Arab Emirates
Venue: F | i | n | d | i | n | g | s | - | E | M | N | L | P |