Making Asynchronous Stochastic Gradient Descent Work for Transformers

Alham Fikri Aji | Kenneth Heafield |

Paper Details:

Month: November
Year: 2019
Location: Hong Kong
Venue: EMNLP | WS |

Citations

URL

No Citations Yet

Field Of Study

Task
Machine Translation
Language
English
Dataset
News