NLPExplorer
  • Papers
  • Venues
  • Authors
  • Field of Study
  • URLs
  • #ACL2020
  • ACL Timeline
  • ACL Moments
  • API
  • Team

ParaCrawl: Web-Scale Acquisition of Parallel Corpora

Marta Bañón | Pinzhen Chen | Barry Haddow | Kenneth Heafield | Hieu Hoang | Miquel Esplà-Gomis | Mikel L. Forcada | Amir Kamran | Faheem Kirefu | Philipp Koehn | Sergio Ortiz Rojas | Leopoldo Pla Sempere | Gema Ramírez-Sánchez | Elsa Sarrías | Marek Strelec | Brian Thompson | William Waites | Dion Wiggins | Jaume Zaragoza |

Paper Details:

Month: July
Year: 2020
Location: Online
Venue: ACL |

Citations

URL

No Citations Yet

  • https://github.com/bitextor/bitextor
  • https://archive.org/
  • https://www.isi.edu/natural-language/
  • http://opus.lingfil.uu.se/
  • https://www.httrack.com/
  • https://github.com/internetarchive/
  • https://github.com/aitjcize/creepy
  • https://www.oasis-open.org/standards#
  • http://www.ecma-international.org/
  • https://engineering.fb.com/ai-research/
  • http://opus.nlpl.eu/EUbookshop.php
  • http://www.statmt.org/
  • https://www.ofgem.gov.uk/electricity/
  • http://www.csd3.cam.ac.uk/

Field Of Study

Linguistic Trends
Embeddings
Task
Language Identification Machine Translation
Language
Multilingual Chinese English Japanese Spanish French
Dataset
News Web Crawl