REVE: Regularizing Deep Learning with Variational Entropy Bound

Antoine Saporta,Yifu Chen,Michael Blot,Matthieu Cord
DOI: https://doi.org/10.1109/ICIP.2019.8804396
2019-10-15
Abstract:Studies on generalization performance of machine learning algorithms under the scope of information theory suggest that compressed representations can guarantee good generalization, inspiring many compression-based regularization methods. In this paper, we introduce REVE, a new regularization scheme. Noting that compressing the representation can be sub-optimal, our first contribution is to identify a variable that is directly responsible for the final prediction. Our method aims at compressing the class conditioned entropy of this latter variable. Second, we introduce a variational upper bound on this conditional entropy term. Finally, we propose a scheme to instantiate a tractable loss that is integrated within the training procedure of the neural network and demonstrate its efficiency on different neural networks and datasets.
Machine Learning
What problem does this paper attempt to address?