Evaluation Metrics in the Era of GPT-4: Reliably Evaluating Large Language Models on Sequence to Sequence Tasks

Andrea Sottana | Bin Liang | Kai Zou | Zheng Yuan |

Paper Details:

Month: December
Year: 2023
Location: Singapore
Venue: EMNLP |