Abstract:A relatively new set of transport-based transforms (CDT, R-CDT, LOT) have shown their strength and great potential in various image and data processing tasks such as parametric signal estimation, classification, cancer detection among many others. It is hence worthwhile to elucidate some of the mathematical properties that explain the successes of these transforms when they are used as tools in data analysis, signal processing or data classification. In particular, we give conditions under which classes of signals that are created by algebraic generative models are transformed into convex sets by the transport transforms. Such convexification of the classes simplify the classification and other data analysis and processing problems when viewed in the transform domain. More specifically, we study the extent and limitation of the convexification ability of these transforms under an algebraic generative modeling framework. We hope that this paper will serve as an introduction to these transforms and will encourage mathematicians and other researchers to further explore the theoretical underpinnings and algorithmic tools that will help understand the successes of these transforms and lay the groundwork for further successful applications.

What problem does this paper attempt to address?

The core problem that this paper attempts to solve is to explore some mathematical properties behind the successful applications of transport - based transformations (such as CDT, R - CDT and LOT) in data processing, signal processing and machine learning. Specifically, the author aims to clarify under what conditions these transformations can convert signal classes generated by algebraic generative models into convex sets. This convexification can simplify classification and other data analysis and processing problems, especially in the transform domain. ### Main Problem Decomposition 1. **Convexification Conditions**: The paper explores what conditions enable signal classes to be transformed into convex sets through transport - based transformations (such as CDT, R - CDT and LOT). In particular, for one - dimensional generative models, the author gives specific conditions, that is, if the inverse set of the transformation is convex, then the transformed signal class is also convex (Theorem 3.2 and its Corollary 3.3). 2. **Limitations of Multidimensional Generative Models**: For the multidimensional case (\(d\geq2\)), the situation is more complex. The author provides a sufficient condition, that is, if the inverse set of the transformation is a convex subset satisfying specific properties, then the generated signal class is convex in the transform domain (Theorem 4.3). In addition, the author also discusses how to relax these conditions in certain signal subsets to expand the scope of application (Theorem 4.10). 3. **Practical Applications**: - **Simplifying Classification Problems**: In the transform domain, the convexity of signal classes ensures that there exists a linear classifier that can perfectly separate data of different classes. - **Simplifying Estimation Problems**: Convexity allows the design of linear least - squares techniques, thereby avoiding the complexity of non - linear and non - convex optimization. ### Mathematical Expressions - **Signal Space \(P_d\)**: \[ P_d=\left\{p:\mathbb{R}^d\rightarrow\mathbb{R}_+\mid\text{supp}(p)=\Omega_p\text{ is compact},\int_{\mathbb{R}^d}p(x)dx = 1\right\} \] - **Optimal Transport Map \(T\)**: \[ T\in F_d=\left\{f:\mathbb{R}^d\rightarrow\mathbb{R}^d\mid f = \nabla\phi\text{ for some convex }\phi:\mathbb{R}^d\rightarrow\mathbb{R}\right\} \] - **CDT Transformation**: \[ \hat{p}(x)=\sup\left\{t\mid\int_{-\infty}^t p(\xi)d\xi\leq\int_{-\infty}^x r(\xi)d\xi\right\} \] - **Wasserstein Distance**: \[ W_2(\mu,\nu)=\left(\inf_{\pi\in\Pi(\mu,\nu)}\int_{\mathbb{R}^d\times\mathbb{R}^d}|x - y|^2d\pi(x,y)\right)^{1/2} \] ### Conclusion By studying the convexification ability of transport - based transformations, this paper provides a theoretical basis for understanding the success of these transformations and lays the foundation for further applications and development. In particular, the convexification property makes classification and estimation in the transform domain simpler and more efficient.

Partitioning signal classes using transport transforms for data analysis and machine learning

Optimal Mass Transport: Signal processing and machine-learning applications

Deep composition of tensor-trains using squared inverse Rosenblatt transports

Linear optimal transport embedding: provable Wasserstein classification for certain rigid transformations and perturbations

Linear optimal transport subspaces for point set classification

The Radon cumulative distribution transform and its application to image classification

The Self-Optimal-Transport Feature Transform

PT$\mathrm{L}^{p}$: Partial Transport $\mathrm{L}^{p}$ Distances

Computational Optimal Transport and Filtering on Riemannian manifolds

Regularized Discrete Optimal Transport for Class-Imbalanced Classifications

Data representation with optimal transport

Scalable Optimal Transport Methods in Machine Learning: A Contemporary Survey

Tsallis Regularized Optimal Transport and Ecological Inference

Unbalanced L1 optimal transport for vector valued measures and application to Full Waveform Inversion

End-to-End Signal Classification in Signed Cumulative Distribution Transform Space

A Distributed Framework for the Construction of Transport Maps

Decorrelation using Optimal Transport

Point Cloud Classification via Deep Set Linearized Optimal Transport

Quantum optimal transport for approximately finite-dimensional $C^{*}$-algebras

Linear Optimal Partial Transport Embedding

Linearized optimal transport on manifolds