MergeDistill: Merging Language Models using Pre-trained Distillation

Simran Khanuja | Melvin Johnson | Partha Talukdar |

Paper Details:

Month: August
Year: 2021
Location: Online
Venue: Findings |