NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Niklas Muennighoff
Number of Papers:- 10
Number of Citations:- 0
First ACL Paper:- 2022
Latest ACL Paper:- 2024
Venues:-
s
EMNLP
d
i
-
L
P
ACL
E
M
EACL
N
F
n
g
Co-Authors:-
Aakanksha Naik
Abhilasha Ravichander
Abinaya Mahendiran
Adam Roberts
Ahmed Baruwa
Similar Authors:-
2024
2023
2022
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
ACL
Shivalika Singh |
Freddie Vargus |
Daniel D’souza |
Börje Karlsson |
Abinaya Mahendiran |
Wei-Yin Ko |
Herumb Shandilya |
Jay Patel |
Deividas Mataciunas |
Laura O’Mahony |
Mike Zhang |
Ramith Hettiarachchi |
Joseph Wilson |
Marina Machado |
Luisa Moura |
Dominik Krzemiński |
Hakimeh Fadaei |
Irem Ergun |
Ifeoma Okoh |
Aisha Alaagib |
Oshan Mudannayake |
Zaid Alyafeai |
Vu Chien |
Sebastian Ruder |
Surya Guthikonda |
Emad Alghamdi |
Sebastian Gehrmann |
Niklas Muennighoff |
Max Bartolo |
Julia Kreutzer |
Ahmet Üstün |
Marzieh Fadaee |
Sara Hooker |
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model
ACL
Ahmet Üstün |
Viraat Aryabumi |
Zheng Yong |
Wei-Yin Ko |
Daniel D’souza |
Gbemileke Onilude |
Neel Bhandari |
Shivalika Singh |
Hui-Lee Ooi |
Amr Kayid |
Freddie Vargus |
Phil Blunsom |
Shayne Longpre |
Niklas Muennighoff |
Marzieh Fadaee |
Julia Kreutzer |
Sara Hooker |
OLMo: Accelerating the Science of Language Models
ACL
Dirk Groeneveld |
Iz Beltagy |
Evan Walsh |
Akshita Bhagia |
Rodney Kinney |
Oyvind Tafjord |
Ananya Jha |
Hamish Ivison |
Ian Magnusson |
Yizhong Wang |
Shane Arora |
David Atkinson |
Russell Authur |
Khyathi Chandu |
Arman Cohan |
Jennifer Dumas |
Yanai Elazar |
Yuling Gu |
Jack Hessel |
Tushar Khot |
William Merrill |
Jacob Morrison |
Niklas Muennighoff |
Aakanksha Naik |
Crystal Nam |
Matthew Peters |
Valentina Pyatkin |
Abhilasha Ravichander |
Dustin Schwenk |
Saurabh Shah |
William Smith |
Emma Strubell |
Nishant Subramani |
Mitchell Wortsman |
Pradeep Dasigi |
Nathan Lambert |
Kyle Richardson |
Luke Zettlemoyer |
Jesse Dodge |
Kyle Lo |
Luca Soldaini |
Noah Smith |
Hannaneh Hajishirzi |
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
ACL
Luca Soldaini |
Rodney Kinney |
Akshita Bhagia |
Dustin Schwenk |
David Atkinson |
Russell Authur |
Ben Bogin |
Khyathi Chandu |
Jennifer Dumas |
Yanai Elazar |
Valentin Hofmann |
Ananya Jha |
Sachin Kumar |
Li Lucy |
Xinxi Lyu |
Nathan Lambert |
Ian Magnusson |
Jacob Morrison |
Niklas Muennighoff |
Aakanksha Naik |
Crystal Nam |
Matthew Peters |
Abhilasha Ravichander |
Kyle Richardson |
Zejiang Shen |
Emma Strubell |
Nishant Subramani |
Oyvind Tafjord |
Evan Walsh |
Luke Zettlemoyer |
Noah Smith |
Hannaneh Hajishirzi |
Iz Beltagy |
Dirk Groeneveld |
Jesse Dodge |
Kyle Lo |
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
EMNLP
Holy Lovenia |
Rahmad Mahendra |
Salsabil Maulana Akbar |
Lester James Validad Miranda |
Jennifer Santoso |
Elyanah Aco |
Akhdan Fadhilah |
Jonibek Mansurov |
Joseph Marvin Imperial |
Onno P. Kampman |
Joel Ruben Antony Moniz |
Muhammad Ravi Shulthan Habibi |
Frederikus Hudi |
Jann Railey Montalan |
Ryan Ignatius Hadiwijaya |
Joanito Agili Lopo |
William Nixon |
Börje F. Karlsson |
James Jaya |
Ryandito Diandaru |
Yuze Gao |
Patrick Amadeus Irawan |
Bin Wang |
Jan Christian Blaise Cruz |
Chenxi Whitehouse |
Ivan Halim Parmonangan |
Maria Khelli |
Wenyu Zhang |
Lucky Susanto |
Reynard Adha Ryanda |
Sonny Lazuardi Hermawan |
Dan John Velasco |
Muhammad Dehan Al Kautsar |
Willy Fitra Hendria |
Yasmin Moslem |
Noah Flynn |
Muhammad Farid Adilazuarda |
Haochen Li |
Johanes Lee |
R. Damanhuri |
Shuo Sun |
Muhammad Reza Qorib |
Amirbek Djanibekov |
Wei Qi Leong |
Quyet V. Do |
Niklas Muennighoff |
Tanrada Pansuwan |
Ilham Firdausi Putra |
Yan Xu |
Tai Ngee Chia |
Ayu Purwarianti |
Sebastian Ruder |
William Chandra Tjhi |
Peerat Limkonchotiwat |
Alham Fikri Aji |
Sedrick Keh |
Genta Indra Winata |
Ruochen Zhang |
Fajri Koto |
Zheng Xin Yong |
Samuel Cahyawijaya |
MTEB: Massive Text Embedding Benchmark
EACL
Niklas Muennighoff |
Nouamane Tazi |
Loic Magne |
Nils Reimers |
FinGPT: Large Generative Models for a Small Language
EMNLP
Risto Luukkonen |
Ville Komulainen |
Jouni Luoma |
Anni Eskelinen |
Jenna Kanerva |
Hanna-Mari Kupari |
Filip Ginter |
Veronika Laippala |
Niklas Muennighoff |
Aleksandra Piktus |
Thomas Wang |
Nouamane Tazi |
Teven Scao |
Thomas Wolf |
Osma Suominen |
Samuli Sairanen |
Mikko Merioksa |
Jyrki Heinonen |
Aija Vahtola |
Samuel Antao |
Sampo Pyysalo |
Crosslingual Generalization through Multitask Finetuning
ACL
Niklas Muennighoff |
Thomas Wang |
Lintang Sutawika |
Adam Roberts |
Stella Biderman |
Teven Le Scao |
M Saiful Bari |
Sheng Shen |
Zheng Xin Yong |
Hailey Schoelkopf |
Xiangru Tang |
Dragomir Radev |
Alham Fikri Aji |
Khalid Almubarak |
Samuel Albanie |
Zaid Alyafeai |
Albert Webson |
Edward Raff |
Colin Raffel |
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
ACL
Zheng Xin Yong |
Hailey Schoelkopf |
Niklas Muennighoff |
Alham Fikri Aji |
David Ifeoluwa Adelani |
Khalid Almubarak |
M Saiful Bari |
Lintang Sutawika |
Jungo Kasai |
Ahmed Baruwa |
Genta Winata |
Stella Biderman |
Edward Raff |
Dragomir Radev |
Vassilina Nikoulina |
What Language Model to Train if You Have One Million GPU Hours?
F
i
n
d
i
n
g
s
-
E
M
N
L
P
Teven Le Scao |
Thomas Wang |
Daniel Hesslow |
Stas Bekman |
M Saiful Bari |
Stella Biderman |
Hady Elsahar |
Niklas Muennighoff |
Jason Phang |
Ofir Press |
Colin Raffel |
Victor Sanh |
Sheng Shen |
Lintang Sutawika |
Jaesung Tae |
Zheng Xin Yong |
Julien Launay |
Iz Beltagy |
.