Greedy feature selection: Classifier-dependent feature selection via greedy methods

Fabiana Camattari,Sabrina Guastavino,Francesco Marchetti,Michele Piana,Emma Perracchione
2024-03-08
Abstract:The purpose of this study is to introduce a new approach to feature ranking for classification tasks, called in what follows greedy feature selection. In statistical learning, feature selection is usually realized by means of methods that are independent of the classifier applied to perform the prediction using that reduced number of features. Instead, greedy feature selection identifies the most important feature at each step and according to the selected classifier. In the paper, the benefits of such scheme are investigated theoretically in terms of model capacity indicators, such as the Vapnik-Chervonenkis (VC) dimension or the kernel alignment, and tested numerically by considering its application to the problem of predicting geo-effective manifestations of the active Sun.
Machine Learning,Numerical Analysis
What problem does this paper attempt to address?
The main goal of this paper is to introduce a new feature selection method, called greedy feature selection, for feature ranking in classification tasks. Traditionally, feature selection methods are usually independent of the classifier used, whereas the greedy feature selection method proposed in this paper identifies the most important features based on the selected classifier. The main contributions of the paper can be summarized as follows: 1. **Proposed a new feature selection method**: This method selects the most important features at each step based on the currently selected classifier, making the feature selection process entirely dependent on the final model to be used. 2. **Theoretical analysis**: The authors theoretically analyzed the effectiveness of this new method, evaluating the impact on model complexity through metrics such as Vapnik-Chervonenkis (VC) dimension and kernel alignment. The study found that adding features during the greedy selection process does not reduce the expressiveness of the classifier (as measured by the VC dimension). 3. **Application example**: The paper demonstrates the practical effectiveness of the method through an application case of predicting geophysical phenomena triggered by solar activity (such as geomagnetic storms). Specifically, the researchers used machine learning algorithms to predict intense geomagnetic events caused by solar flares. 4. **Experimental validation**: The paper provides results from two experimental scenarios. First, an experiment on a synthetic dataset verified that the algorithm could correctly identify the most meaningful features for the classification task. Second, an application on an actual solar physics dataset showed that the method could effectively identify key features for predicting geomagnetic events. 5. **Stopping criterion**: To prevent overfitting, the paper also proposes a stopping criterion based on the True Skill Statistic (TSS) to decide when to stop the feature selection process. In summary, this paper aims to improve the performance of classification tasks through a new greedy feature selection method that is closely related to the classifier, and demonstrates its effectiveness and practicality through theoretical analysis and experimental results.