A Challenging Multimodal Video Summary: Simultaneously Extracting and Generating Keyframe-Caption Pairs from Video

Keito Kudo | Haruki Nagasawa | Jun Suzuki | Nobuyuki Shimizu |

Paper Details:

Month: December
Year: 2023
Location: Singapore
Venue: EMNLP |