MoE-I2: Compressing Mixture of Experts Models through Inter-Expert Pruning and Intra-Expert Low-Rank Decomposition

Cheng Yang | Yang Sui | Jinqi Xiao | Lingyi Huang | Yu Gong | Yuanlin Duan | Wenqi Jia | Miao Yin | Yu Cheng | Bo Yuan |

Paper Details:

Month: November
Year: 2024
Location: Miami, Florida, USA
Venue: F | i | n | d | i | n | g | s | - | E | M | N | L | P |

Citations

URL

No Citations Yet

No URLs Found

Field Of Study