Jamo Pair Encoding: Subcharacter Representation-based Extreme Korean Vocabulary Compression for Efficient Subword Tokenization

Sangwhan Moon | Naoaki Okazaki |

Paper Details:

Month: May
Year: 2020
Location: Marseille, France
Venue: LREC |