NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Out-of-the-Box and into the Ditch? Multilingual Evaluation of Generic Text Extraction Tools
Adrien Barbaresi
|
Gaƫl Lejeune
|
Paper Details:
Month: May
Year: 2020
Location: Marseille, France
Venue:
LREC |
WAC |
WS |
Citations
URL
No Citations Yet
https://commoncrawl.org
https://chromium.googlesource.com/chromium/dom-distiller
https://www.w3.org/TR/2017/REC-html52-20171214/
https://spectrum.ieee.org/computing/software/the-top-
https://github.com/Alir3z4/html2text/
https://github.com/weblyzard/inscriptis
https://github.com/jmriebold/BoilerPy3
https://github.com/dragnet-org/dragnet
https://github.com/goose3/goose3
https://github.com/miso-belica/jusText
https://github.com/codelucas/newspaper
https://github.com/fhamborg/news-please
https://github.com/buriy/python-readability
https://github.com/rundimeco/waddle
https://github.com/miso-belica/jusText/issues/12
https://github.com/rundimeco/waddle
Field Of Study