Mask More and Mask Later: Efficient Pre-training of Masked Language Models by Disentangling the [MASK] Token

Baohao Liao | David Thulke | Sanjika Hewavitharana | Hermann Ney | Christof Monz |

Paper Details:

Month: December
Year: 2022
Location: Abu Dhabi, United Arab Emirates
Venue: F | i | n | d | i | n | g | s | - | E | M | N | L | P |