Learning linear operators: Infinite-dimensional regression as a well-behaved non-compact inverse problem

Mattes Mollenhauer,Nicole Mücke,T. J. Sullivan
2024-07-10
Abstract:We consider the problem of learning a linear operator $\theta$ between two Hilbert spaces from empirical observations, which we interpret as least squares regression in infinite dimensions. We show that this goal can be reformulated as an inverse problem for $\theta$ with the feature that its forward operator is generally non-compact (even if $\theta$ is assumed to be compact or of $p$-Schatten class). However, we prove that, in terms of spectral properties and regularisation theory, this inverse problem is equivalent to the known compact inverse problem associated with scalar response regression. Our framework allows for the elegant derivation of dimension-free rates for generic learning algorithms under Hölder-type source conditions. The proofs rely on the combination of techniques from kernel regression with recent results on concentration of measure for sub-exponential Hilbertian random variables. The obtained rates hold for a variety of practically-relevant scenarios in functional regression as well as nonlinear regression with operator-valued kernels and match those of classical kernel regression with scalar response.
Statistics Theory,Functional Analysis,Probability,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the problem of learning a linear operator \( \theta \) between two Hilbert spaces. Specifically, it is to learn this linear operator from empirical observations, which can be interpreted as a least - squares regression problem in infinite dimensions. The core objectives of the paper are: 1. **Define the problem**: Consider square - integrable random variables \( X \) and \( Y \) taking values from two Hilbert spaces \( X \) and \( Y \). The goal is to minimize \( \mathbb{E}[\|Y - \theta X\|^2_Y] \), where \( \theta \in L(X, Y) \) is a bounded linear operator. If at least one of \( X \) or \( Y \) is infinite - dimensional, then \( L(X, Y) \) is also infinite - dimensional, so this is an infinite - dimensional regression problem. 2. **Reformulate as an inverse problem**: The paper reformulates this problem as an inverse problem, whose forward operator is usually non - compact, even if it is assumed that \( \theta \) is compact or belongs to the \( p \)-Schatten class. However, the authors prove that in terms of spectral properties and regularization theory, this inverse problem is equivalent to the known compact inverse problem related to scalar - response regression. 3. **Regularization and convergence rates**: The paper provides a framework for deriving dimension - free convergence rates of general learning algorithms under Hölder - type source conditions. The proof relies on kernel regression techniques and recent results on concentration measures of sub - exponential Hilbert random variables. 4. **Practical applications**: The paper discusses the applications of this framework in practical scenarios such as functional linear regression, non - parametric regression with vector - valued kernels, conditional mean embedding, and Hilbert time - series inference. ### Main contributions - **Mathematical background**: Provides the mathematical background of infinite - dimensional linear regression, especially the treatment methods in the general context of inverse problems. - **Functional analysis framework**: Provides a functional analysis framework for the statistical analysis of regularized empirical solutions. - **Regularization theory**: Proves that even in non - compact cases, inverse problems can be solved by regularization methods, thereby solving the infinite - dimensional regression problem. - **Convergence rates**: Derives the basic convergence rates of general learning algorithms under Hölder - type source conditions, and these rates match those under classical kernel regression. ### Solution of specific problems - **Existence**: Discusses the necessary and sufficient conditions for the existence of the regression solution \( \theta^* \) and gives in which practical scenarios these conditions hold. - **Uniqueness**: Proves that when the distribution \( L(X) \) is non - degenerate, the regression solution \( \theta^* \) is unique. - **Closed - form solution**: Under certain conditions, gives the closed - form solution \( \theta^*=(C_{XX}^\dagger C_{XY})^* \) of the regression problem. ### Summary This paper successfully solves the problem of learning a linear operator in infinite - dimensional spaces by reformulating the infinite - dimensional linear regression problem as an inverse problem and using regularization techniques. The paper not only provides theoretical contributions but also provides powerful tools and methods for practical applications.