NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Pedro Ortiz Suarez
Number of Papers:- 8
Number of Citations:- 0
First ACL Paper:- 2022
Latest ACL Paper:- 2024
Venues:-
TACL
s
A
i
d
-
L
LREC
JEP/TALN/RECITAL
C
COLING
WMT
N
F
n
g
Co-Authors:-
Ahmed Baruwa
Ahsan Wahab
Alessia Battisti
Alexander Weber
Alexandre Bartz
Similar Authors:-
2024
2022
Tokenizer Choice For LLM Training: Negligible or Crucial?
F
i
n
d
i
n
g
s
-
N
A
A
C
L
Mehdi Ali |
Michael Fromm |
Klaudia Thellmann |
Richard Rutmann |
Max Lübbering |
Johannes Leveling |
Katrin Klug |
Jan Ebert |
Niclas Doll |
Jasper Buschhoff |
Charvi Jain |
Alexander Weber |
Lena Jurkschat |
Hammam Abdelwahab |
Chelsea John |
Pedro Ortiz Suarez |
Malte Ostendorff |
Samuel Weinbach |
Rafet Sifa |
Stefan Kesselheim |
Nicolas Flores-Herr |
Occiglot at WMT24: European Open-source Large Language Models Evaluated on Translation
WMT
Eleftherios Avramidis |
Annika Grützner-Zahn |
Manuel Brack |
Patrick Schramowski |
Pedro Ortiz Suarez |
Malte Ostendorff |
Fabio Barth |
Shushen Manakhimova |
Vivien Macketanz |
Georg Rehm |
Kristian Kersting |
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
TACL
Julia Kreutzer |
Isaac Caswell |
Lisa Wang |
Ahsan Wahab |
Daan van Esch |
Nasanbayar Ulzii-Orshikh |
Allahsera Tapo |
Nishant Subramani |
Artem Sokolov |
Claytone Sikasote |
Monang Setyawan |
Supheakmungkol Sarin |
Sokhar Samb |
Benoît Sagot |
Clara Rivera |
Annette Rios |
Isabel Papadimitriou |
Salomey Osei |
Pedro Ortiz Suarez |
Iroro Orife |
Kelechi Ogueji |
Andre Niyongabo Rubungo |
Toan Q. Nguyen |
Mathias Müller |
André Müller |
Shamsuddeen Hassan Muhammad |
Nanda Muhammad |
Ayanda Mnyakeni |
Jamshidbek Mirzakhalov |
Tapiwanashe Matangira |
Colin Leong |
Nze Lawson |
Sneha Kudugunta |
Yacine Jernite |
Mathias Jenny |
Orhan Firat |
Bonaventure F. P. Dossou |
Sakhile Dlamini |
Nisansa de Silva |
Sakine Çabuk Ballı |
Stella Biderman |
Alessia Battisti |
Ahmed Baruwa |
Ankur Bapna |
Pallavi Baljekar |
Israel Abebe Azime |
Ayodele Awokoya |
Duygu Ataman |
Orevaoghene Ahia |
Oghenefego Ahia |
Sweta Agrawal |
Mofetoluwa Adeyemi |
Le projet FREEM : ressources, outils et enjeux pour l’étude du français d’Ancien Régime (The F RE EM project: Resources, tools and challenges for the study of Ancien Régime French)
JEP/TALN/RECITAL
Simon Gabay |
Pedro Ortiz Suarez |
Rachel Bawden |
Alexandre Bartz |
Philippe Gambette |
Benoît Sagot |
BERTrade: Using Contextual Embeddings to Parse Old French
LREC
Loïc Grobol |
Mathilde Regnault |
Pedro Ortiz Suarez |
Benoît Sagot |
Laurent Romary |
Benoit Crabbé |
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
LREC
Julien Abadji |
Pedro Ortiz Suarez |
Laurent Romary |
Benoît Sagot |
A Data-driven Approach to Named Entity Recognition for Early Modern French
COLING
Pedro Ortiz Suarez |
Simon Gabay |
From FreEM to D’AlemBERT: a Large Corpus and a Language Model for Early Modern French
LREC
Simon Gabay |
Pedro Ortiz Suarez |
Alexandre Bartz |
Alix Chagué |
Rachel Bawden |
Philippe Gambette |
Benoît Sagot |
.