Convergence analysis of OT-Flow for sample generation

Yang Jing,Lei Li
2024-03-25
Abstract:Deep generative models aim to learn the underlying distribution of data and generate new ones. Despite the diversity of generative models and their high-quality generation performance in practice, most of them lack rigorous theoretical convergence proofs. In this work, we aim to establish some convergence results for OT-Flow, one of the deep generative models. First, by reformulating the framework of OT-Flow model, we establish the $\Gamma$-convergence of the formulation of OT-flow to the corresponding optimal transport (OT) problem as the regularization term parameter $\alpha$ goes to infinity. Second, since the loss function will be approximated by Monte Carlo method in training, we established the convergence between the discrete loss function and the continuous one when the sample number $N$ goes to infinity as well. Meanwhile, the approximation capability of the neural network provides an upper bound for the discrete loss function of the minimizers. The proofs in both aspects provide convincing assurances for OT-Flow.
Numerical Analysis,Machine Learning
What problem does this paper attempt to address?
The paper discusses the convergence analysis of the Optimal Transport Flow (OT-Flow) model in Continuous Normalizing Flows (CNFs) for sample generation. Currently, although deep generative models have shown excellent performance in various tasks, their theoretical convergence proofs are relatively lacking. The paper focuses on establishing the convergence results of OT-Flow and proves the Γ-convergence of OT-Flow in continuous optimization problems when the regularization parameter tends to infinity, meaning its solution approaches the solution of the classical optimal transport problem. In addition, the convergence of the optimizer is studied when the training loss function is approximated through Monte Carlo methods with increasing sample sizes. These analyses provide theoretical support for understanding the stability and intrinsic mechanisms of the OT-Flow model.