Neural Boltzmann Machines

Alex H. Lang,Anton D. Loukianov,Charles K. Fisher
2023-05-15
Abstract:Conditional generative models are capable of using contextual information as input to create new imaginative outputs. Conditional Restricted Boltzmann Machines (CRBMs) are one class of conditional generative models that have proven to be especially adept at modeling noisy discrete or continuous data, but the lack of expressivity in CRBMs have limited their widespread adoption. Here we introduce Neural Boltzmann Machines (NBMs) which generalize CRBMs by converting each of the CRBM parameters to their own neural networks that are allowed to be functions of the conditional inputs. NBMs are highly flexible conditional generative models that can be trained via stochastic gradient descent to approximately maximize the log-likelihood of the data. We demonstrate the utility of NBMs especially with normally distributed data which has historically caused problems for Gaussian-Bernoulli CRBMs. Code to reproduce our results can be found at <a class="link-external link-https" href="https://github.com/unlearnai/neural-boltzmann-machines" rel="external noopener nofollow">this https URL</a>.
Machine Learning,Disordered Systems and Neural Networks
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the limitations of Conditional Restricted Boltzmann Machines (CRBMs) in terms of expressiveness and flexibility under existing conditions, especially the difficulties encountered when dealing with continuous data. Specifically, although traditional CRBMs perform well in modeling noisy discrete or continuous data, their wide application is limited due to the fixity of their parameters. The paper introduces a new model - Neural Boltzmann Machines (NBMs). By converting each parameter in CRBMs into its own neural network, which can be a function of conditional inputs, the flexibility of the model is greatly enhanced. NBMs can be trained by stochastic gradient descent to approximately maximize the log - likelihood of data. The paper shows the advantages of NBMs in dealing with normally distributed data, which has historically been a challenge for Gaussian - Bernoulli CRBMs. **Core Problem Summary**: - **Enhancing the Expressiveness of CRBMs**: By converting the parameters of CRBMs into neural networks, the flexibility and expressiveness of the model are increased. - **Solving the Problem of Continuous Data Modeling**: Especially for normally distributed data, NBMs can model more effectively and overcome the limitations of traditional CRBMs. Through these improvements, NBMs can show stronger capabilities in generating high - quality conditional outputs, especially in fields such as image and video generation.