NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation
Kun Qian
|
Shunji Wan
|
Claudia Tang
|
Youzhi Wang
|
Xuanming Zhang
|
Maximillian Chen
|
Zhou Yu
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, USA
Venue:
F |
i |
n |
d |
i |
n |
g |
s |
- |
E |
M |
N |
L |
P |
Citations
URL
No Citations Yet
https://github.com/
https://huggingface.co/spaces/
https://ai.google.dev/gemma/
https://huggingface.co/mistralai/
https://huggingface.co/HuggingFaceH4/zephyr-7b-beta
https://huggingface.co/01-ai/Yi-1.5-6B
https://huggingface.co/01-ai/Yi-1.5-9B
https://huggingface.co/meta-llama/Meta-Llama-3-8B
https://huggingface.co/meta-llama/
https://huggingface.co/SeaLLMs/
https://huggingface
https://platform.openai.com/docs/models/gpt-3-5-turbo
https://platform.openai.com/docs/models/gpt-4o
https://huggingface.co/datasets/
https://huggingface.co/
https://huggingface.co/datasets/truthfulqa/
https://huggingface.co/datasets/allenai/ai2_
Field Of Study