Abstract:Maximum entropy principle (MEP) offers an effective and unbiased approach to inferring unknown probability distributions when faced with incomplete information, while neural networks provide the flexibility to learn complex distributions from data. This paper proposes a novel neural network architecture, the MEP-Net, which combines the MEP with neural networks to generate probability distributions from moment constraints. We also provide a comprehensive overview of the fundamentals of the maximum entropy principle, its mathematical formulations, and a rigorous justification for its applicability for non-equilibrium systems based on the large deviations principle. Through fruitful numerical experiments, we demonstrate that the MEP-Net can be particularly useful in modeling the evolution of probability distributions in biochemical reaction networks and in generating complex distributions from data.

What problem does this paper attempt to address?

The core problem that this paper attempts to solve is: **How to generate complex and accurate probability distributions in the case of limited information?** Specifically, traditional statistical methods and modern machine - learning techniques often have difficulty accurately capturing the true probability distribution when dealing with problems with only partial information (such as statistical moments or other constraints). For this reason, this paper proposes a new framework that combines the Maximum Entropy Principle (MEP) and neural networks - MEP - Net. ### Problem Background In many scientific fields, from statistical physics to molecular dynamics, to biological systems and climate models, situations where only partial information is available are often encountered. Traditional statistical methods may not be able to accurately capture the true distribution, and modern machine - learning techniques may produce unreasonable physical results when applied to scientific problems with limited data. ### Solution To meet this challenge, the paper introduces MEP - Net, a new framework that combines the Maximum Entropy Principle with neural networks. MEP provides an effective and unbiased method for inferring unknown probability distributions in the case of incomplete information, while neural networks provide the flexibility to learn complex distributions from data. In this way, MEP - Net can generate complex probability distributions given only limited information. ### Main Contributions 1. **Theoretical Basis**: The paper details the basic theory of the Maximum Entropy Principle and its mathematical formula, and strictly proves its applicability in non - equilibrium systems based on the Large Deviations Principle. 2. **Model Architecture**: Proposes the specific architecture of MEP - Net, including feature library screening and loss function design. In particular, by using the binomial function as an observed value (constraint), the reconstruction ability of the model is significantly improved. 3. **Numerical Experiments**: Through a series of numerical experiments, shows the effectiveness of MEP - Net in various scenarios, including high - dimensional distributions, time - dependent systems, etc., highlighting its potential in the case of limited data. ### Formula Presentation The core of the Maximum Entropy Principle is to maximize the Shannon entropy: \[ S[p]=-\int_{\mathbb{R}^d}p(\vec{x})\ln p(\vec{x})\,d\vec{x} \] Under the given constraints, the maximization problem can be expressed as: \[ \max_{p(\vec{x})}S[p]=-\int_{\mathbb{R}^d}p(\vec{x})\ln p(\vec{x})\,d\vec{x} \] \[ \text{s.t.}\quad\int_{\mathbb{R}^d}p(\vec{x})\,d\vec{x} = 1 \] \[ \int_{\mathbb{R}^d}p(\vec{x})f_i(\vec{x})\,d\vec{x}=\tilde{f}_i,\quad i = 1,\dots,M \] Through the Lagrange multiplier method, the above constrained optimization problem can be transformed into an unconstrained optimization problem, and finally the optimal probability distribution can be obtained: \[ p^*(\vec{x})=\frac{\exp\left(\sum_{i = 1}^M\lambda_i f_i(\vec{x})\right)}{\int_{\mathbb{R}^d}\exp\left(\sum_{i = 1}^M\lambda_i f_i(\vec{x})\right)\,d\vec{x}} \] ### Conclusion By combining the Maximum Entropy Principle and neural networks, MEP - Net not only fills the gap between classical statistical methods and modern machine learning, but also opens up new ways to solve inverse problems and integrate with other learning paradigms.

MEP-Net: Generating Solutions to Scientific Problems with Limited Knowledge by Maximum Entropy Principle

Maximum Entropy Principle Analysis in Network Systems with Short-Time Recordings

A Neural Network MCMC Sampler That Maximizes Proposal Entropy

Modeling Extremes with d-max-decreasing Neural Networks

Maximum Entropy Generators for Energy-Based Models

Dynamical and Coupling Structure of Pulse-Coupled Networks in Maximum Entropy Analysis

Partially Observed Maximum Entropy Discrimination Markov Networks

Maximum Probability and Relative Entropy Maximization. Bayesian Maximum Probability and Empirical Likelihood

Neural networks taking probability distributions as input: A framework for analyzing exchangeable networks

A New Maximum Entropy Method for Estimation of Multimodal Probability Density Function

Maximum entropy in dynamic complex networks

A Dynamical State Underlying the Second Order Maximum Entropy Principle in Neuronal Networks

Scalable Bayesian Uncertainty Quantification for Neural Network Potentials: Promise and Pitfalls

The Principle of Uncertain Maximum Entropy

Neural Joint Entropy Estimation

Bayesian Entropy Neural Networks for Physics-Aware Prediction

Constrained or Unconstrained? Neural-Network-Based Equation Discovery from Data

Maximum Entropy Principle, Equal Probability a Priori and Gibbs Paradox

Maximum Multiscale Entropy and Neural Network Regularization

Exact computation of the Maximum Entropy Potential of spiking neural networks models

MPCE: A Maximum Probability Based Cross Entropy Loss Function for Neural Network Classification