Abstract:Restricted Boltzmann machines (RBMs) are commonly used as pre-training methods for deep learning models. Contrastive divergence (CD) and parallel tempering (PT) are traditional training algorithms of RBMs. However, these two algorithms have shortcomings in processing high-dimensional and complex data. In particular, the number of temperature chains in PT has a significant impact on the training effect, and the PT algorithm cannot fully utilize parallel sampling from multiple temperature chains for the divergence of the algorithm. The training can quickly converge with fewer temperature chains, but this impacts the accuracy. More temperature chains can help PT achieve higher accuracy in theory, but severe divergence at the beginning of the training may ruin the training result. To exploit fully the advantages of PT and improve the ability of RBMs to process high-dimensional and complex models, this article proposes dynamic tempering chains (DTC). By dynamically changing the number of temperature chains during the training process, DTC starts training with fewer temperature chains and gradually increase the number of temperature chains with training going on, and finally get an accurate RBM. And one-step reconstruction error is proposed to measure the convergence, which can decrease the influence of the dynamic training strategy on reconstruction error. Experiments on MNIST, MNORB, Cifar 10, and Cifar 100 indicate that, compared with PT, the classification accuracy of DTC algorithm improved by up to 8%. DTC quickly converges in the early stage of training because of few exchanges among temperature chains and produces higher accuracy at the end for the global optimum model learned by more temperature chains, especially when learning high-dimensional and complex data. This proves that the DTC algorithm effectively utilizes parallel sampling of multiple temperature chains, overcomes divergence challenges, and further improves the training effect of the RBM.

Training Deep Belief Network with Sparse Hidden Units.

Contrastive Divergence Learning of Restricted Boltzmann Machine

Restricted Boltzmann Machine with Adaptive Local Hidden Units.

An Adaptive Deep Belief Network With Sparse Restricted Boltzmann Machines

Enhancing performance of restricted Boltzmann machines via log-sum regularization.

Training Restricted Boltzmann Machines with Binary Synapses Using the Bayesian Learning Rule

Cardinality restricted boltzmann machines

Sparse Deep Belief Net For Handwritten Digits Classification

Sparse Restricted Boltzmann Machine Based on Multiobjective Optimization.

Deep Restricted Boltzmann Networks

Sparse Group Restricted Boltzmann Machines

Generative and Discriminative Infinite Restricted Boltzmann Machine Training

Fuzzy Restricted Boltzmann Machine and Deep Belief Network: A Comparison on Image Reconstruction.

A sparse-response deep belief network based on rate distortion theory.

Analysis of Different Sparsity Methods in Constrained RBM for Sparse Representation in Cognitive Robotic Perception

A New Sparse Restricted Boltzmann Machine

A Novel Restricted Boltzmann Machine Training Algorithm with Dynamic Tempering Chains.

A New Variant of Restricted Boltzmann Machine with Horizontal Connections

Sparse Restricted Boltzmann Machine Based on Data Class Entropy

A Sparse Deep Belief Network with Efficient Fuzzy Learning Framework

An Overview on Restricted Boltzmann Machines.