What Language Model to Train if You Have One Million GPU Hours?

Teven Le Scao | Thomas Wang | Daniel Hesslow | Stas Bekman | M Saiful Bari | Stella Biderman | Hady Elsahar | Niklas Muennighoff | Jason Phang | Ofir Press | Colin Raffel | Victor Sanh | Sheng Shen | Lintang Sutawika | Jaesung Tae | Zheng Xin Yong | Julien Launay | Iz Beltagy |

Paper Details:

Month: December
Year: 2022
Location: Abu Dhabi, United Arab Emirates
Venue: F | i | n | d | i | n | g | s | - | E | M | N | L | P |