Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning

Xin Wang | Yuan-Fang Wang | William Yang Wang |

Paper Details:

Month: June
Year: 2018
Location: New Orleans, Louisiana
Venue: NAACL |