Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs

Emanuele Bugliarello | Ryan Cotterell | Naoaki Okazaki | Desmond Elliott |

Paper Details:


Year: 2021
Location: Cambridge, MA
Venue: TACL |