DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling

Shikhar Tuli | Chi-Heng Lin | Yen-Chang Hsu | Niraj Jha | Yilin Shen | Hongxia Jin |

Paper Details:

Month: June
Year: 2024
Location: Mexico City, Mexico
Venue: NAACL |