Cross-Modal Projection in Multimodal LLMs Doesn’t Really Project Visual Attributes to Textual Space

Gaurav Verma | Minje Choi | Kartik Sharma | Jamelle Watson-Daniels | Sejoon Oh | Srijan Kumar |

Paper Details:

Month: August
Year: 2024
Location: Bangkok, Thailand
Venue: ACL |