Regularizing Neural Networks by Penalizing Confident Output Distributions on ShortScience.org

arxiv.org
arxiv-vanity.com
scholar.google.com

Regularizing Neural Networks by Penalizing Confident Output Distributions
Gabriel Pereyra and George Tucker and Jan Chorowski and Łukasz Kaiser and Geoffrey Hinton
arXiv e-Print archive - 2017 via Local arXiv
Keywords: cs.NE, cs.LG
more

Summaries/Notes 1

[link] Summary by David Stutz 5 years ago

Pereyra et al. propose an entropy regularizer for penalizing over-confident predictions of deep neural networks. Specifically, given the predicted distribution $p_\theta(y_i|x)$ for labels $y_i$ and network parameters $\theta$, a regularizer

$-\beta \max(0, \Gamma – H(p_\theta(y|x))$

is added to the learning objective. Here, $H$ denotes the entropy and $\beta$, $\Gamma$ are hyper-parameters allowing to weight and limit the regularizers influence. In experiments, this regularizer showed slightly improved performance on MNIST and Cifar-10.

Also find this summary at [davidstutz.de](https://davidstutz.de/category/reading/).

Your comment: