Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks

Charlotte Siska | Katerina Marazopoulou | Melissa Ailem | James Bono |

Paper Details:

Month: August
Year: 2024
Location: Bangkok, Thailand
Venue: ACL |