NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Daniel Deutsch
Number of Papers:- 30
Number of Citations:- 1
First ACL Paper:- 2018
Latest ACL Paper:- 2024
Venues:-
NAACL
CoNLL
L
COLING
s
EMNLP
-
Findings
C
Eval4NLP
g
TACL
A
d
WS
ACL
EACL
WMT
i
NLPOSS
N
F
n
Co-Authors:-
Aditya Siddhant
Agnieszka Nowak
Ali Dabirmoghaddam
Alison Lui
Alon Lavie
Similar Authors:-
Omar Asbayou
Siwar Benayed
Rohan Chitnis
Rabib Islam
Toms Miks
2024
2023
2022
2021
2020
2019
2018
On the Role of Summary Content Units in Text Summarization Evaluation
NAACL
Marcel Nawrath |
Agnieszka Nowak |
Tristan Ratz |
Danilo Walenta |
Juri Opitz |
Leonardo Ribeiro |
João Sedoc |
Daniel Deutsch |
Simon Mille |
Yixin Liu |
Sebastian Gehrmann |
Lining Zhang |
Saad Mahamood |
Miruna Clinciu |
Khyathi Chandu |
Yufang Hou |
LLMRefine: Pinpointing and Refining Large Language Models via Fine-Grained Actionable Feedback
F
i
n
d
i
n
g
s
-
N
A
A
C
L
Wenda Xu |
Daniel Deutsch |
Mara Finkelstein |
Juraj Juraska |
Biao Zhang |
Zhongtao Liu |
William Yang Wang |
Lei Li |
Markus Freitag |
Finding Replicable Human Evaluations via Stable Ranking Probability
NAACL
Parker Riley |
Daniel Deutsch |
George Foster |
Viresh Ratnakar |
Ali Dabirmoghaddam |
Markus Freitag |
Mitigating Metric Bias in Minimum Bayes Risk Decoding
WMT
Geza Kovacs |
Daniel Deutsch |
Markus Freitag |
Improving Statistical Significance in Human Evaluation of Automatic Metrics via Soft Pairwise Accuracy
WMT
Brian Thompson |
Nitika Mathur |
Daniel Deutsch |
Huda Khayrallah |
MetricX-24: The Google Submission to the WMT 2024 Metrics Shared Task
WMT
Juraj Juraska |
Daniel Deutsch |
Mara Finkelstein |
Markus Freitag |
Are LLMs Breaking MT Metrics? Results of the WMT24 Metrics Shared Task
WMT
Markus Freitag |
Nitika Mathur |
Daniel Deutsch |
Chi-Kiu Lo |
Eleftherios Avramidis |
Ricardo Rei |
Brian Thompson |
Frederic Blain |
Tom Kocmi |
Jiayi Wang |
David Ifeoluwa Adelani |
Marianna Buchicchio |
Chrysoula Zerva |
Alon Lavie |
Beyond Human-Only: Evaluating Human-Machine Collaboration for Collecting High-Quality Translation Data
WMT
Zhongtao Liu |
Parker Riley |
Daniel Deutsch |
Alison Lui |
Mengmeng Niu |
Apurva Shah |
Markus Freitag |
Incorporating Question Answering-Based Signals into Abstractive Summarization via Salient Span Selection
EACL
Daniel Deutsch |
Dan Roth |
Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph Level
WMT
Daniel Deutsch |
Juraj Juraska |
Mara Finkelstein |
Markus Freitag |
MetricX-23: The Google Submission to the WMT 2023 Metrics Shared Task
WMT
Juraj Juraska |
Mara Finkelstein |
Daniel Deutsch |
Aditya Siddhant |
Mehdi Mirzazadeh |
Markus Freitag |
Quality Estimation Using Minimum Bayes Risk
WMT
Subhajit Naskar |
Daniel Deutsch |
Markus Freitag |
There’s No Data like Better Data: Using QE Metrics for MT Data Filtering
WMT
Jan-Thorsten Peter |
David Vilar |
Daniel Deutsch |
Mara Finkelstein |
Juraj Juraska |
Markus Freitag |
Results of WMT23 Metrics Shared Task: Metrics Might Be Guilty but References Are Not Innocent
WMT
Markus Freitag |
Nitika Mathur |
Chi-kiu Lo |
Eleftherios Avramidis |
Ricardo Rei |
Brian Thompson |
Tom Kocmi |
Frederic Blain |
Daniel Deutsch |
Craig Stewart |
Chrysoula Zerva |
Sheila Castilho |
Alon Lavie |
George Foster |
The Devil Is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation
WMT
Patrick Fernandes |
Daniel Deutsch |
Mara Finkelstein |
Parker Riley |
André Martins |
Graham Neubig |
Ankush Garg |
Jonathan Clark |
Markus Freitag |
Orhan Firat |
Ties Matter: Meta-Evaluating Modern Metrics with Pairwise Accuracy and Tie Calibration
EMNLP
Daniel Deutsch |
George Foster |
Markus Freitag |
Proceedings of the 4th Workshop on Evaluation and Comparison of NLP Systems
Eval4NLP
WS
Daniel Deutsch |
Rotem Dror |
Steffen Eger |
Yang Gao |
Christoph Leiter |
Juri Opitz |
Andreas Rücklé |
The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics
Eval4NLP
WS
Christoph Leiter |
Juri Opitz |
Daniel Deutsch |
Yang Gao |
Rotem Dror |
Steffen Eger |
A Needle in a Haystack: An Analysis of High-Agreement Workers on MTurk for Summarization
ACL
Lining Zhang |
Simon Mille |
Yufang Hou |
Daniel Deutsch |
Elizabeth Clark |
Yixin Liu |
Saad Mahamood |
Sebastian Gehrmann |
Miruna Clinciu |
Khyathi Raghavi Chandu |
João Sedoc |
Benchmarking Answer Verification Methods for Question Answering-Based Summarization Evaluation Metrics
ACL
Findings
Daniel Deutsch |
Dan Roth |
Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics
NAACL
Daniel Deutsch |
Rotem Dror |
Dan Roth |
On the Limitations of Reference-Free Evaluations of Generated Text
EMNLP
Daniel Deutsch |
Rotem Dror |
Dan Roth |
A Statistical Analysis of Summarization Evaluation Metrics Using Resampling Methods
TACL
Daniel Deutsch |
Rotem Dror |
Dan Roth |
Understanding the Extent to which Content Quality Metrics Measure the Information Quality of Summaries
CoNLL
EMNLP
Daniel Deutsch |
Dan Roth |
Towards Question-Answering as an Automatic Metric for Evaluating the Content Quality of a Summary
TACL
Daniel Deutsch |
Tania Bedrax-Weiss |
Dan Roth |
Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection
COLING
Disha Jindal |
Daniel Deutsch |
Dan Roth |
SacreROUGE: An Open-Source Library for Using and Developing Summarization Evaluation Metrics
EMNLP
NLPOSS
Daniel Deutsch |
Dan Roth |
Summary Cloze: A New Task for Content Selection in Topic-Focused Summarization
EMNLP
Daniel Deutsch |
Dan Roth |
A General-Purpose Algorithm for Constrained Sequential Inference
CoNLL
Daniel Deutsch |
Shyam Upadhyay |
Dan Roth |
A Distributional and Orthographic Aggregation Model for English Derivational Morphology
ACL
Daniel Deutsch |
John Hewitt |
Dan Roth |
Linguistic
Task
Approach
Language
Dataset Type
.