Optimal sampling for least squares approximation with general dictionaries

Philipp Trunschke,Anthony Nouy
2024-10-08
Abstract:We consider the problem of approximating an unknown function in a nonlinear model class from point evaluations. When obtaining these point evaluations is costly, minimising the required sample size becomes crucial. Recently, an increasing focus has been on employing adaptive sampling strategies to achieve this. These strategies are based on linear spaces related to the nonlinear model class, for which the optimal sampling measures are known. However, the resulting optimal sampling measures depend on an orthonormal basis of the linear space, which is known rarely. Consequently, sampling from these measures is challenging in practice. This manuscript presents a sampling strategy that iteratively refines an estimate of the optimal sampling measure by updating it based on previously drawn samples. This strategy can be performed offline and does not require evaluations of the sought function. We establish convergence and illustrate the practical performance through numerical experiments. Comparing the presented approach with standard Monte Carlo sampling demonstrates a significant reduction in the number of samples required to achieve a good estimation of an orthonormal basis.
Numerical Analysis
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to approximate an unknown function from point evaluations in a given nonlinear model class, especially when obtaining these point evaluations is costly, minimizing the required sample size becomes crucial. Recently, more and more attention has been focused on adopting adaptive sampling strategies to achieve this goal. However, these strategies are based on the linear spaces related to the nonlinear model class, and the optimal sampling measure depends on the orthogonal bases of these linear spaces, which are rarely known in practice. Therefore, sampling from these measures is challenging in practical applications. This paper proposes a sampling strategy that gradually refines the estimate of the optimal sampling measure by iteratively updating it according to the previously drawn samples. This strategy can be carried out offline and does not require the evaluation of the function to be sought. The authors establish the convergence of this method and demonstrate its practical performance through numerical experiments. Compared with standard Monte Carlo sampling, this method significantly reduces the number of samples required to achieve a good estimate. Specifically, the paper considers the general problem of an over - complete dictionary of an arbitrary function on a general domain, proposes a simple algorithm, and verifies its effectiveness through numerical experiments. The main contribution lies in using the sampling density \( w^{-1}_{\hat{G}^{(0)}} \rho \) induced by the initial estimate \( \hat{G}^{(0)} \) to improve \( \hat{G}^{(0)} \) itself, thereby gradually improving the accuracy of the estimate through an iterative process.