When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute

Tao Lei |

Paper Details:

Month: November
Year: 2021
Location: Online and Punta Cana, Dominican Republic
Venue: EMNLP |