Hierarchical Transformers Are More Efficient Language Models

Piotr Nawrot | Szymon Tworkowski | MichaƂ Tyrolski | Lukasz Kaiser | Yuhuai Wu | Christian Szegedy | Henryk Michalewski |

Paper Details:

Month: July
Year: 2022
Location: Seattle, United States
Venue: Findings | NAACL |