A class of kernel-based scalable algorithms for data science

Philippe G. LeFloch,Jean-Marc Mercier,Shohruh Miryusupov
2024-10-18
Abstract:We present several generative and predictive algorithms based on the RKHS (reproducing kernel Hilbert spaces) methodology, which most importantly are scalable in the following sense. It is well recognized that the RKHS methodology leads one to efficient and robust algorithms for numerous tasks in data science, statistics, and scientific <a class="link-external link-http" href="http://computations.However" rel="external noopener nofollow">this http URL</a>, the standard implementations remain difficult to scale to encompass large data sets. In this paper, we introduce a simple and robust, divide-and-conquer approach which applies to large scale data sets and relies on suitably formulated, kernel-based algorithms: we distinguish between extrapolation, interpolation, and optimal transport steps. We explain how to select the best algorithms in specific applications thanks to some feedback, mainly consisting of perfomance criteria. Our main focus for the applications and challenging problems arising in industrial applications, such as the generation of mesh for efficient numerical simulations, the design of generators of conditional distributions, the transition probability matrix for statistic or stochastic applications, as well as various tasks of interest to the artificial intelligence community. Indeed, the proposed algorithms are relevant for supervised or unsupervised learning, generative methods, and reinforcement learning.
Numerical Analysis
What problem does this paper attempt to address?