Pre-training Data Quality and Quantity for a Low-Resource Language: New Corpus and BERT Models for Maltese

Kurt Micallef | Albert Gatt | Marc Tanti | Lonneke van der Plas and Claudia Borg |

Paper Details:

Month: July
Year: 2022
Location: Hybrid
Venue: DeepLo | NAACL |