A Bayesian encourages dropout

Shin-ichi Maeda
DOI: https://doi.org/10.48550/arXiv.1412.7003
2014-12-30
Abstract:Dropout is one of the key techniques to prevent the learning from overfitting. It is explained that dropout works as a kind of modified L2 regularization. Here, we shed light on the dropout from Bayesian standpoint. Bayesian interpretation enables us to optimize the dropout rate, which is beneficial for learning of weight parameters and prediction after learning. The experiment result also encourages the optimization of the dropout.
Machine Learning,Neural and Evolutionary Computing
What problem does this paper attempt to address?