Muffin or Chihuahua? Challenging Multimodal Large Language Models with Multipanel VQA

Yue Fan | Jing Gu | Kaiwen Zhou | Qianqi Yan | Shan Jiang | Ching-Chen Kuo | Yang Zhao | Xinze Guan | Xin Wang |

Paper Details:

Month: August
Year: 2024
Location: Bangkok, Thailand
Venue: ACL |

Citations

URL