Learning with Selected Features

Shao-Bo Lin,Jian Fang,Xiangyu Chang
DOI: https://doi.org/10.1109/tcyb.2020.2987810
IF: 11.8
2022-01-01
IEEE Transactions on Cybernetics
Abstract:The coming big data era brings data of unprecedented size and launches an innovation of learning algorithms in statistical and machine-learning communities. The classical kernel-based regularized least-squares (RLS) algorithm is excluded in the innovation, due to its computational and storage bottlenecks. This article presents a scalable algorithm based on subsampling, called learning with selected features (LSF), to reduce the computational burden of RLS. Almost the optimal learning rate together with a sufficient condition on selecting kernels and centers to guarantee the optimality is derived. Our theoretical assertions are verified by numerical experiments, including toy simulations, UCI standard data experiments, and a real-world massive data application. The studies in this article show that LSF can reduce the computational burden of RLS without sacrificing its generalization ability very much.
What problem does this paper attempt to address?