NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Scaling to Very Very Large Corpora for Natural Language Disambiguation
Michele Banko
|
Eric Brill
|
Paper Details:
Month: July
Year: 2001
Location: Toulouse, France
Venue:
ACL |
Citations
URL
Using Web-scale N-grams to Improve Base NP Parsing Performance
Emily Pitler
|
Shane Bergsma
|
Dekang Lin
|
Kenneth Church
|
Heterogeneous Parsing via Collaborative Decoding
Muhua Zhu
|
Jingbo Zhu
|
Tong Xiao
|
Automatic Treebank Conversion via Informed Decoding
Muhua Zhu
|
Jingbo Zhu
|
Scalable Language Processing Algorithms for the Masses: A Case Study in Computing Word Co-occurrence Matrices with MapReduce
Jimmy Lin
|
Graph-based Analysis of Semantic Drift in Espresso-like Bootstrapping Algorithms
Mamoru Komachi
|
Taku Kudo
|
Masashi Shimbo
|
Yuji Matsumoto
|
Generating Confusion Sets for Context-Sensitive Error Correction
Alla Rozovskaya
|
Dan Roth
|
A Unified Approach to Transliteration-based Text Input with Online Spelling Correction
Hisami Suzuki
|
Jianfeng Gao
|
Scaling to Large³ Data: An Efficient and Effective Method to Compute Distributional Thesauri
Martin Riedl
|
Chris Biemann
|
Web Text Corpus for Natural Language Processing
Vinci Liu
|
James R. Curran
|
Correcting Grammatical Verb Errors
Alla Rozovskaya
|
Dan Roth
|
Vivek Srikumar
|
Unsupervised Relation Extraction of In-Domain Data from Focused Crawls
Steffen Remus
|
Tell Me What You Do and I’ll Tell You What You Are: Learning Occupation-Related Activities for Biographies
Elena Filatova
|
John Prager
|
Using the Web as an Implicit Training Set: Application to Structural Ambiguity Resolution
Preslav Nakov
|
Marti Hearst
|
Introduction to the Special Issue on the Web as Corpus
Adam Kilgarriff
|
Gregory Grefenstette
|
Using the Web to Obtain Frequencies for Unseen Bigrams
Frank Keller
|
Mirella Lapata
|
Word Translation Disambiguation Using Bilingual Bootstrapping
Hang Li
|
Cong Li
|
Sample Selection for Statistical Parsing
Rebecca Hwa
|
The Noisy Channel Model for Unsupervised Word Sense Disambiguation
Deniz Yuret
|
Mehmet Ali Yatbaz
|
A Large-Scale Pseudoword-Based Evaluation Framework for State-of-the-Art Word Sense Disambiguation
Mohammad Taher Pilehvar
|
Roberto Navigli
|
Adapting to Learner Errors with Minimal Supervision
Alla Rozovskaya
|
Dan Roth
|
Mark Sammons
|
A Lightweight and Efficient Tool for Cleaning Web Pages
Stefan Evert
|
Identification of Multiword Expressions in the brWaC
Rodrigo Boos
|
Kassius Prestes
|
Aline Villavicencio
|
Complementarity, F-score, and NLP Evaluation
Leon Derczynski
|
Building a Web-Scale Dependency-Parsed Corpus from CommonCrawl
Alexander Panchenko
|
Eugen Ruppert
|
Stefano Faralli
|
Simone P. Ponzetto
|
Chris Biemann
|
Weakly Supervised Natural Language Learning Without Redundant Views
Vincent Ng
|
Claire Cardie
|
The Web as a Baseline: Evaluating the Performance of Unsupervised Web-based Models for a Range of NLP Tasks
Mirella Lapata
|
Frank Keller
|
Semi-Automatic Entity Set Refinement
Vishnu Vyas
|
Patrick Pantel
|
The Effect of Corpus Size on Case Frame Acquisition for Discourse Analysis
Ryohei Sasano
|
Daisuke Kawahara
|
Sadao Kurohashi
|
Qme! : A Speech-based Question-Answering system on Mobile Devices
Taniya Mishra
|
Srinivas Bangalore
|
Automatic Parallel Fragment Extraction from Noisy Data
Jason Riesa
|
Daniel Marcu
|
Paving the Way to a Large-scale Pseudosense-annotated Dataset
Mohammad Taher Pilehvar
|
Roberto Navigli
|
Scaling Context Space
James Curran
|
Marc Moens
|
An Unsupervised Approach to Recognizing Discourse Relations
Daniel Marcu
|
Abdessamad Echihabi
|
Shallow Parsing on the Basis of Words Only: A Case Study
Antal van den Bosch
|
Sabine Buchholz
|
An Empirical Study of Active Learning with Support Vector Machines forJapanese Word Segmentation
Manabu Sassano
|
Offline Strategies for Online Question Answering: Answering Questions Before They Are Asked
Michael Fleischman
|
Eduard Hovy
|
Abdessamad Echihabi
|
Analysis of Selective Strategies to Build a Dependency-Analyzed Corpus
Kiyonori Ohtake
|
Weakly Supervised Learning for Hedge Classification in Scientific Literature
Ben Medlock
|
Ted Briscoe
|
Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation
Michael Bloodgood
|
Chris Callison-Burch
|
Creating Robust Supervised Classifiers via Web-Scale N-Gram Data
Shane Bergsma
|
Emily Pitler
|
Dekang Lin
|
Speech-Driven Access to the Deep Web on Mobile Devices
Taniya Mishra
|
Srinivas Bangalore
|
Unsupervised Morphology-Based Vocabulary Expansion
Mohammad Sadegh Rasooli
|
Thomas Lippincott
|
Nizar Habash
|
Owen Rambow
|
Generalized Character-Level Spelling Error Correction
Noura Farra
|
Nadi Tomeh
|
Alla Rozovskaya
|
Nizar Habash
|
Learning Word Representations from Scarce and Noisy Data with Embedding Subspaces
Ramon Astudillo
|
Silvio Amir
|
Wang Ling
|
Mário Silva
|
Isabel Trancoso
|
Grammatical Error Correction: Machine Translation and Classifiers
Alla Rozovskaya
|
Dan Roth
|
iLab-Edinburgh at SemEval-2016 Task 7: A Hybrid Approach for Determining Sentiment Intensity of Arabic Twitter Phrases
Eshrag Refaee
|
Verena Rieser
|
Evaluating the results of a memory-based word-expert approach to unrestricted word sense disambiguation
Veronique Hoste
|
Walter Daelemans
|
Iris Hendrickx
|
Antal van den Bosch
|
An Incremental Decision List Learner
Joshua Goodman
|
Ensemble Methods for Automatic Thesaurus Extraction
James Curran
|
Using the Web to Overcome Data Sparseness
Frank Keller
|
Maria Lapata
|
Olga Ourioupina
|
Statistical Named Entity Recognizer Adaptation
John D. Burger
|
John C. Henderson
|
William T. Morgan
|
A Very Very Large Corpus Doesn’t Always Yield Reliable Estimates
James R. Curran
|
Miles Osborne
|
Letter Level Learning for Language Independent Diacritics Restoration
Rada Mihalcea
|
Vivi Nastase
|
An Evaluation Exercise for Word Alignment
Rada Mihalcea
|
Ted Pedersen
|
Training a Naive Bayes Classifier via the EM Algorithm with a Class Distribution Constraint
Yoshimasa Tsuruoka
|
Jun’ichi Tsujii
|
Blueprint for a High Performance NLP Infrastructure
James R. Curran
|
Bootstrapping Coreference Classifiers with Multiple Machine Learning Algorithms
Vincent Ng
|
Claire Cardie
|
Weakly Supervised Learning Methods for Improving the Quality of Gene Name Normalization Data
Ben Wellner
|
Data Selection in Semi-supervised Learning for Name Tagging
Heng Ji
|
Ralph Grishman
|
CUCWeb: A Catalan corpus built from the Web
Gemma Boleda
|
Stefan Bott
|
Rodrigo Meza
|
Carlos Castillo
|
Toni Badia
|
Vicente López
|
All-word Prediction as the Ultimate Confusible Disambiguation
Antal van den Bosch
|
Exploring Large-Data Issues in the Curriculum: A Case Study with MapReduce
Jimmy Lin
|
Language Models for Contextual Error Detection and Correction
Herman Stehouwer
|
Menno van Zaanen
|
Mining of Parsed Data to Derive Deverbal Argument Structure
Olga Gurevich
|
Scott Waterman
|
The Design of a Proofreading Software Service
Raphael Mudge
|
Annotating Large Email Datasets for Named Entity Recognition with Mechanical Turk
Nolan Lawson
|
Kevin Eustice
|
Mike Perkowitz
|
Meliha Yetisgen-Yildiz
|
Search right and thou shalt find ... Using Web Queries for Learner Error Detection
Michael Gamon
|
Claudia Leacock
|
The UI System in the HOO 2012 Shared Task on Error Correction
Alla Rozovskaya
|
Mark Sammons
|
Dan Roth
|
Fast and Robust Arabic Error Correction System
Michael Nawar
|
Moheb Ragheb
|
CUFE@QALB-2015 Shared Task: Arabic Error Correction System
Michael Nawar
|
There’s no ‘Count or Predict’ but task-based selection for distributional models
Martin Riedl
|
Chris Biemann
|
No URLs Found
Field Of Study
Task
Tagging
Word Sense Disambiguation
Approach
Unsupervised Learning
Language
English
Dataset
News
Similar Papers
Bootstrap Domain-Specific Sentiment Classifiers from Unlabeled Corpora
Andrius Mudinas
|
Dell Zhang
|
Mark Levene
|
Expectation-Regulated Neural Model for Event Mention Extraction
Ching-Yun Chang
|
Zhiyang Teng
|
Yue Zhang
|
A Corpus of Corporate Annual and Social Responsibility Reports: 280 Million Tokens of Balanced Organizational Writing
Sebastian G.M. Händschke
|
Sven Buechel
|
Jan Goldenstein
|
Philipp Poschmann
|
Tinghui Duan
|
Peter Walgenbach
|
Udo Hahn
|
Argumentation Mining in User-Generated Web Discourse
Ivan Habernal
|
Iryna Gurevych
|
A Joint Model of Conversational Discourse Latent Topics on Microblogs
Jing Li
|
Yan Song
|
Zhongyu Wei
|
Kam-Fai Wong
|