A subspace aggregating algorithm for accurate classification

Saeid Amiri,Reza Modarres
DOI: https://doi.org/10.1007/s00180-024-01476-3
IF: 1.4049
2024-03-10
Computational Statistics
Abstract:We present a technique for learning via aggregation in supervised classification. The new method improves classification performance, regardless of which classifier is at its core. This approach exploits the information hidden in subspaces by combinations of aggregating variables and is applicable to high-dimensional data sets. We provide algorithms that randomly divide the variables into smaller subsets and permute them before applying a classification method to each subset. We combine the resulting classes to predict the class membership. Theoretical and simulation analyses consistently demonstrate the high accuracy of our classification methods. In comparison to aggregating observations through sampling, our approach proves to be significantly more effective. Through extensive simulations, we evaluate the accuracy of various classification methods. To further illustrate the effectiveness of our techniques, we apply them to five real-world data sets.
statistics & probability
What problem does this paper attempt to address?