NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Speculative Decoding via Early-exiting for Faster LLM Inference with Thompson Sampling Control Mechanism
Jiahao Liu
|
Qifan Wang
|
Jingang Wang
|
Xunliang Cai
|
Paper Details:
Month: August
Year: 2024
Location: Bangkok, Thailand and virtual meeting
Venue:
F |
i |
n |
d |
i |
n |
g |
s |
- |
A |
C |
L |
Citations
URL
No Citations Yet
https://huggingface.co/datasets/Aeala/ShareGPT_Vicuna
https://huggingface.co/TinyLlama/TinyLlama-1.1B-
https://huggingface.co/double7/vicuna-68m
https://github.com/FasterDecoding/
Field Of Study