Abstract:The random Fourier features (RFFs) method is a powerful and popular technique in kernel approximation for scalability of kernel methods. The theoretical foundation of RFFs is based on the Bochner theorem (Bochner in Harmonic Analysis and the Theory of Probability, University of California Press, 1995) that relates symmetric, positive definite (PD) functions to probability measures. This condition naturally excludes asymmetric functions with a wide range applications in practice, e.g., directed graphs, conditional probability, and asymmetric kernels. Nevertheless, understanding asymmetric functions (kernels) and its scalability via RFFs is unclear both theoretically and empirically. In this paper, we introduce a complex measure with the real and imaginary parts corresponding to four finite positive measures, which expands the application scope of the Bochner theorem. By doing so, this framework allows for handling classical symmetric, PD kernels via one positive measure; symmetric, non-positive definite kernels via signed measures; and asymmetric kernels via complex measures, thereby unifying them into a general framework by RFFs, named AsK-RFFs. Such approximation scheme via complex measures enjoys theoretical guarantees in the perspective of uniform convergence. In algorithmic implementation, to speed up the kernel approximation process, which is expensive due to the calculation of total masses, we propose a subset-based fast estimation method. This method focuses on optimizing total masses within a sub-training set, effectively transforming the numerical integration for total masses into quadratic programming in three-dimension with low time complexity. AsK-RFFs provides two explicit feature mappings to approximate an asymmetric kernel. These mappings can be utilized in classifiers within the framework of AsK-LS (He et al. in IEEE Trans Pattern Anal Machine Intell 45(8):10044–10054), fulfilling the purpose of using asymmetric kernels in machine learning applications. Our AsK-RFFs method is empirically validated on several typical large-scale datasets and achieves promising kernel approximation performance, which demonstrates the effectiveness of AsK-RFFs.

Optimal Convergence for Agnostic Kernel Learning with Random Features.

Optimal Kernel Quantile Learning with Random Features

Variance-Reducing Couplings for Random Features

Learning Adaptive Kernels for Statistical Independence Tests.

Improved convergence rates for some kernel random forest algorithms

Decentralized Kernel Ridge Regression Based on Data-Dependent Random Feature

A Double-Variational Bayesian Framework in Random Fourier Features for Indefinite Kernels

A duality framework for analyzing random feature and two-layer neural networks

Error Bounds for Learning with Vector-Valued Random Features

Enhancing Kernel Flexibility Via Learning Asymmetric Locally-Adaptive Kernels.

Random fourier features for asymmetric kernels

Learning Analysis of Kernel Ridgeless Regression with Asymmetric Kernel Learning

Random Forest (RF) Kernel for Regression, Classification and Survival

Regularized Regression Problem in hyper-RKHS for Learning Kernels

A Non-parametric View of FedAvg and FedProx: Beyond Stationary Points

Random Features Approximation for Control-Affine Systems

Nonparametric Testing under Randomized Sketching

On Data-Dependent Random Features for Improved Generalization in Supervised Learning

Random Features Outperform Linear Models: Effect of Strong Input-Label Correlation in Spiked Covariance Data

A Duality Framework for Generalization Analysis of Random Feature Models and Two-Layer Neural Networks.