NLPExplorer
Papers
Venues
Authors
Authors Timeline
Field of Study
URLs
ACL N-gram Stats
TweeNLP
API
Team
Effects of Parameter Norm Growth During Transformer Training: Inductive Bias from Gradient Descent
William Merrill
|
Vivek Ramanujan
|
Yoav Goldberg
|
Roy Schwartz
|
Noah A. Smith
|
Paper Details:
Month: November
Year: 2021
Location: Online and Punta Cana, Dominican Republic
Venue:
EMNLP |
Citations
URL
No Citations Yet
https://github.com/
https://huggingface.co/transformers/
Field Of Study