Batch Renormalization: Towards Reducing Minibatch Dependence in Batch-Normalized Models
Sergey Ioffe
arXiv e-Print archive - 2017 via Local arXiv
Keywords: cs.LG


Summary by 4 years ago
"The problem with using moving averages [[in inference]]" -> I believe this is supposed to be [[in training]]?

Modified, thanks for pointing out.

