NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
Ruikang Liu
|
Haoli Bai
|
Haokun Lin
|
Yuening Li
|
Han Gao
|
Zhengzhuo Xu
|
Lu Hou
|
Jun Yao
|
Chun Yuan
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand and virtual meeting
Venue:
F |
i |
n |
d |
i |
n |
g |
s |
- |
A |
C |
L |
Citations
URL
No Citations Yet
https://github.com/ruikangliu/IntactKV
https://huggingface.co/datasets/Aeala/
https://vicuna
https://github.com/AutoGPTQ/AutoGPTQ
https://github.com/mit-han-lab/llm-awq
https://github.com/OpenGVLab/OmniQuant
https://github.com/spcl/QuaRot
https://github.com/ist-daslab/gptq
https://github.com/hendrycks/test/pull/13
https://github.com/EleutherAI/
https://huggingface.co/spaces/lmsys/
https://huggingface.co/meta-llama/Llama-2-7b
https://huggingface.co/meta-llama/Llama-2-13b
https://huggingface.co/meta-llama/Llama-2-70b
https://huggingface.co/meta-llama/Meta-Llama-3-8B
https://huggingface.co/meta-llama/Meta-Llama-3-70B
https://huggingface.co/lmsys/vicuna-7b-v1.3
https://huggingface.co/lmsys/vicuna-13b-v1.3
https://huggingface.co/lmsys/vicuna-33b-v1.3
https://huggingface.co/lmsys/vicuna-7b-v1.5
https://huggingface.co/lmsys/vicuna-13b-v1.5
Field Of Study