arxiv.org
arxiv-sanity.com
scholar.google.com
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
William Fedus and Barret Zoph and Noam Shazeer
arXiv e-Print archive - 2021 via Local arXiv
Keywords: cs.LG, cs.AI

more

[link]
Summary by CodyWild 1 week ago
Loading...
Your comment:


ShortScience.org allows researchers to publish paper summaries that are voted on and ranked!
About

Sponsored by: