Self-Distilled Quantization: Achieving High Compression Rates in Transformer-Based Language Models

James O’Neill | Sourav Dutta |

Paper Details:

Month: July
Year: 2023
Location: Toronto, Canada
Venue: ACL |