Improving the Generalization Ability of Restricted Boltzmann Machines Via Theta Pure Dependency.

Qi Xu,Yuexian Hou
DOI: https://doi.org/10.1145/3094243.3094246
2017-01-01
Abstract:The Restricted Boltzmann Machine (RBM) is an important probabilistic graphical model which often serves as a building block for deep belief networks (DBNs). A critical issue for DBNs is overfitting. The generalization ability of models is affected if an inappropriate model complexity is selected. In order to mitigate this problem, we introduce the Pure Dependency-Restricted Boltzmann Machine (PD-RBM). Compared to the regular RBM, the PD-RBM can adaptively change its structure in the face of different data in order to make its model complexity fit data. The establishment of the structure of PD-RBMs is based on Theta Pure Dependency (TPD), which is a statistical measure of variable dependency, defined under the Information Geometry framework. When specific data is given, we show an algorithm that is used to establish the structure of PD-RBMs. We evaluate the PD-RBM on different datasets. Our experiments show that it significantly improves test performance.
What problem does this paper attempt to address?