An Architecture for Accelerated Large-Scale Inference of Transformer-Based Language Models

Amir Ganiev | Colton Chapin | Anderson De Andrade | Chen Liu |

Paper Details:

Month: June
Year: 2021
Location: Online
Venue: NAACL |