Batch Renormalization: Towards Reducing Minibatch Dependence in Batch-Normalized Models
Sergey Ioffe
arXiv e-Print archive - 2017 via Local arXiv
Keywords: cs.LG


Summary by 7 years ago
"The problem with using moving averages [[in inference]]" -> I believe this is supposed to be [[in training]]?

Modified, thanks for pointing out.

Your comment: allows researchers to publish paper summaries that are voted on and ranked!

Sponsored by: