Vector-Vector-Matrix Architecture: A Novel Hardware-Aware Framework for Low-Latency Inference in NLP Applications

Matthew Khoury | Rumen Dangovski | Longwu Ou | Preslav Nakov | Yichen Shen | Li Jing |

Paper Details:

Month: November
Year: 2020
Location: Online
Venue: EMNLP |

Citations

URL

No Citations Yet