Flows for Flows: Training Normalizing Flows Between Arbitrary Distributions with Maximum Likelihood Estimation

Samuel Klein,John Andrew Raine,Tobias Golling
DOI: https://doi.org/10.48550/arXiv.2211.02487
IF: 5.414
2022-11-04
Machine Learning
Abstract:Normalizing flows are constructed from a base distribution with a known density and a diffeomorphism with a tractable Jacobian. The base density of a normalizing flow can be parameterised by a different normalizing flow, thus allowing maps to be found between arbitrary distributions. We demonstrate and explore the utility of this approach and show it is particularly interesting in the case of conditional normalizing flows and for introducing optimal transport constraints on maps that are constructed using normalizing flows.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to learn the mapping between two arbitrary distributions by using normalizing flows. Specifically, the author explores how to use normalizing flows to construct a mapping from one unknown probability density function to another, which is very useful in many applications, such as data calibration, interpolation between distributions or interpolation under conditional density. This technique is especially suitable for fields such as high - energy physics, where interpolation or conversion of distributions under different conditions is required. ### Core problems of the paper 1. **Learning the mapping between two arbitrary distributions**: - Traditional methods usually use two independent normalizing flows to fit two distributions respectively, and then achieve the mapping by combining these two flows. However, this method has some problems, such as a large number of parameters and not - so - intuitive mapping. - This paper proposes a new method - "Flows for Flows", that is, using another normalizing flow as the base distribution in one normalizing flow, so as to directly learn the mapping between two arbitrary distributions. 2. **Application of conditional normalizing flows**: - In conditional normalizing flows, the parameterized function depends on the conditional variable. This paper discusses how to use "Flows for Flows" to learn the mapping between conditional distributions in this case, especially how to smoothly perform distribution conversion when the condition changes. 3. **Introducing distance penalty**: - In order to make the learned mapping more reasonable and meaningful, the author introduces a distance penalty term, such as the L1 - norm distance penalty, to reduce the displacement of data points during the mapping process. This helps to learn a function closer to the optimal transport mapping. ### Specific research contents - **Unconditional distribution**: - The author first verifies the effectiveness of "Flows for Flows" on a two - dimensional data set, showing its ability to learn the mapping between the same distribution and different distributions. - The experimental results show that in some cases, using "Flows for Flows" can learn the mapping more effectively than the traditional two - step method (that is, first fitting two distributions and then combining them). - **Conditional distribution**: - The author further explores the application of conditional normalizing flows, especially the distribution conversion when the condition changes. The experimental results show that "Flows for Flows" performs better when dealing with conditional distributions and can perform distribution conversion more smoothly, reducing the abnormal situations of sample points. - **Effect of distance penalty**: - By introducing the L1 - norm distance penalty, the author shows how to further improve the performance of "Flows for Flows" to make it closer to the optimal transport mapping. The experimental results show that the distance penalty significantly improves the performance of the model in learning identity mapping and other complex mapping tasks. ### Conclusions - This paper proves the effectiveness and flexibility of the "Flows for Flows" method in learning the mapping between two arbitrary distributions. - Especially in the case of conditional distributions, this method shows better performance and can perform distribution conversion more smoothly. - Introducing distance penalty further improves the performance of the model, making it closer to the optimal transport mapping. In short, this paper provides a new perspective for the application of normalizing flows in learning arbitrary distribution mapping and shows its potential application value in multiple fields.