Optimizing Deeper Transformers on Small Datasets

Peng Xu | Dhruv Kumar | Wei Yang | Wenjie Zi | Keyi Tang | Chenyang Huang | Jackie Chi Kit Cheung | Simon J.D. Prince | Yanshuai Cao |

Paper Details:

Month: August
Year: 2021
Location: Online
Venue: ACL | IJCNLP |