Adaptive Feature-based Low-Rank Compression of Large Language Models via Bayesian Optimization

Yixin Ji | Yang Xiang | Juntao Li | Qingrong Xia | Zi Ye | Xinyu Duan | Zhefeng Wang | Kehai Chen | Min Zhang |

Paper Details:

Month: November
Year: 2024
Location: Miami, Florida, USA
Venue: F | i | n | d | i | n | g | s | - | E | M | N | L | P |