arxiv.org
arxiv-vanity.com
scholar.google.com
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
William Fedus and Barret Zoph and Noam Shazeer
arXiv e-Print archive - 2021 via Local arXiv
Keywords: cs.LG, cs.AI

more



ShortScience.org allows researchers to publish paper summaries that are voted on and ranked!
About

Sponsored by: