Investigating Web Corpus Filtering Methods for Language Model Development in Japanese

Rintaro Enomoto | Arseny Tolmachev | Takuro Niitsuma | Shuhei Kurita | Daisuke Kawahara |

Paper Details:

Month: June
Year: 2024
Location: Mexico City, Mexico
Venue: NAACL |