Sub-Character Tokenization for Chinese Pretrained Language Models

Chenglei Si | Zhengyan Zhang | Yingfa Chen | Fanchao Qi | Xiaozhi Wang | Zhiyuan Liu | Yasheng Wang | Qun Liu | Maosong Sun |

Paper Details:


Year: 2023
Location: Cambridge, MA
Venue: TACL |