GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model

Shicheng Tan | Weng Lam Tam | Yuanchun Wang | Wenwen Gong | Shu Zhao | Peng Zhang | Jie Tang |

Paper Details:

Month: July
Year: 2023
Location: Toronto, Canada
Venue: ACL |

Citations

URL