NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches
Jiayi Yuan
|
Hongyi Liu
|
Shaochen Zhong
|
Yu-Neng Chuang
|
Songchen Li
|
Guanchu Wang
|
Duy Le
|
Hongye Jin
|
Vipin Chaudhary
|
Zhaozhuo Xu
|
Zirui Liu
|
Xia Hu
|
Paper Details:
Month: November
Year: 2024
Location: Miami, Florida, USA
Venue:
F |
i |
n |
d |
i |
n |
g |
s |
- |
E |
M |
N |
L |
P |
Citations
URL
No Citations Yet
https://github
http://arxiv
https://github
https://paulgraham.com/articles.html
https://github.com/jy-yuan/KIVI
https://github.com/FMInference/FlexGen
https://github.com/FMInference/H2O/blob/main/
https://github.com/FMInference/H2O
https://github.com/thunlp/InfLLM/blob/
https://github.com/thunlp/InfLLM/blob/
https://github.com/thunlp/InfLLM
https://github.com/microsoft/LLMLingua
Field Of Study