Towards A Unified View of Sparse Feed-Forward Network in Pretraining Large Language Model

Zeyu Liu | Tim Dettmers | Xi Lin | Veselin Stoyanov | Xian Li |

Paper Details:

Month: December
Year: 2023
Location: Singapore
Venue: EMNLP |

Citations

URL

No Citations Yet