Subspace Restricted Boltzmann Machine

Jakub M. Tomczak,Adam Gonczarek
DOI: https://doi.org/10.48550/arXiv.1407.4422
2014-07-17
Abstract:The subspace Restricted Boltzmann Machine (subspaceRBM) is a third-order Boltzmann machine where multiplicative interactions are between one visible and two hidden units. There are two kinds of hidden units, namely, gate units and subspace units. The subspace units reflect variations of a pattern in data and the gate unit is responsible for activating the subspace units. Additionally, the gate unit can be seen as a pooling feature. We evaluate the behavior of subspaceRBM through experiments with MNIST digit recognition task, measuring reconstruction error and classification error.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the deficiencies of the existing Restricted Boltzmann Machines (RBM) in modeling complex data patterns, especially its inability to effectively reflect statistical dependencies. To address this issue, the author proposes the Subspace Restricted Boltzmann Machine (subspaceRBM), which is a third - order Boltzmann machine where multiplicative interactions occur between one visible unit and two hidden units. This model learns features in the data by introducing two types of hidden units - subspace units and gate units. Subspace units reflect the changes in patterns in the data, while gate units are responsible for activating subspace units and can be regarded as a pooling feature. In this way, subspaceRBM aims to more effectively capture complex patterns and statistical dependencies in the data, especially in the case of small sample sizes, and improve the generalization ability and robustness of the model. Specifically, the paper evaluates the performance of subspaceRBM in terms of reconstruction error and classification error through experiments on the MNIST handwritten digit recognition task to verify the effectiveness of this model.