SwapMoE: Serving Off-the-shelf MoE-based Large Language Models with Tunable Memory Budget

Rui Kong | Yuanchun Li | Qingtian Feng | Weijun Wang | Xiaozhou Ye | Ye Ouyang | Linghe Kong | Yunxin Liu |

Paper Details:

Month: August
Year: 2024
Location: Bangkok, Thailand
Venue: ACL |

Citations

URL

No Citations Yet

No URLs Found

Field Of Study