Abstract:Building a good generative model for image has long been an important topic in computer vision and machine learning. Restricted Boltzmann machine (RBM) is one of such models that is simple but powerful. However, its restricted form also has placed heavy constraints on the models representation power and scalability. Many extensions have been invented based on RBM in order to produce deeper architectures with greater power. The most famous ones among them are deep belief network, which stacks multiple layer-wise pretrained RBMs to form a hybrid model, and deep Boltzmann machine, which allows connections between hidden units to form a multi-layer structure. In this paper, we present a new method to compose RBMs to form a multi-layer network style architecture and a training method that trains all layers jointly. We call the resulted structure deep restricted Boltzmann network. We further explore the combination of convolutional RBM with the normal fully connected RBM, which is made trivial under our composition framework. Experiments show that our model can generate descent images and outperform the normal RBM significantly in terms of image quality and feature quality, without losing much efficiency for training.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to improve the performance of the Restricted Boltzmann Machine (RBM) in image generation and feature extraction. Specifically, the traditional single - layer RBM has limitations in modeling complex probability distributions, especially when dealing with high - dimensional data such as images. To solve this problem, the author proposes a new method to combine RBMs to form a multi - layer network structure and develops a training method that can jointly train all layers. This new structure is called the Deep Restricted Boltzmann Network (DRBN). In addition, the author also explores the possibility of combining the convolutional RBM with the ordinary fully - connected RBM to improve the scalability and performance of the model. The main contributions of the paper include: 1. **Proposing the DRBN architecture**: By stacking multiple RBMs to form a multi - layer network, each RBM is trained as an independent layer, but the entire network can be jointly optimized. 2. **Joint training method**: A method that can train all layers simultaneously has been developed, using Persistent Contrastive Divergence (PCD) to approximate model - dependent terms, thereby achieving effective parameter updates. 3. **Integration of convolutional RBM**: Combining the convolutional RBM with the ordinary RBM to form a more powerful model that can handle high - dimensional data such as images. 4. **Experimental verification**: Through experiments on the MNIST and Weizmann Horses datasets, the superior performance of DRBN in image generation and feature extraction tasks has been proven. These improvements make DRBN significantly superior to the traditional single - layer RBM in image generation quality and also show better performance in feature extraction tasks, especially when labeled data is scarce.

Deep Restricted Boltzmann Networks

Restricted Boltzmann Machine with Adaptive Local Hidden Units.

Structural Restricted Boltzmann Machine for image denoising and classification

Hyperparameters Adaptation for Restricted Boltzmann Machines Based on Free Energy

An Overview of Restricted Boltzmann Machines

Training Restricted Boltzmann Machines with Binary Synapses Using the Bayesian Learning Rule

Restricted Boltzmann Machines: Introduction and Review

Monotone deep Boltzmann machines

A Note on Restricted Boltzmann Machines and Variational Autoencoders

Training Deep Belief Network with Sparse Hidden Units.

An Adaptive Deep Belief Network With Sparse Restricted Boltzmann Machines

Real-Time Keypoint Recognition Using Restricted Boltzmann Machine

Restricted Boltzmann machine based algorithm for multi-objective optimization

Tensor Ring Restricted Boltzmann Machines

Temperature based Restricted Boltzmann Machines

Properties and Bayesian fitting of restricted Boltzmann machines

Fault Diagnosis Based on Sparse Class Gaussian Restrict Boltzmann Machine Model

Gaussian Cardinality Restricted Boltzmann Machines.

Temporally Adaptive Restricted Boltzmann Machine for Background Modeling.

Lip Movement Generation Using Restricted Boltzmann Machines For Visual Speech Synthesis

Nonnegative Restricted Boltzmann Machines for Parts-based Representations Discovery and Predictive Model Stabilization