Hybrid Particle Swarm Optimization with Spiral-Shaped Mechanism for Feature Selection

Ke Chen,Feng-Yu Zhou,Xian-Feng Yuan
DOI: https://doi.org/10.1016/j.eswa.2019.03.039
IF: 8.5
2019-01-01
Expert Systems with Applications
Abstract:The "curse of dimensionality" is one of the largest problems that influences the quality of the optimization process in most data mining, pattern recognition, and machine learning tasks. Using high dimensional datasets to train a classification model may reduce the generalization performance of the learned model. In addition, high dimensionality of the dataset results in high computational and memory costs. Feature selection is an important data preprocessing approach in many practical application domains that are relevant to expert and intelligent systems. Feature selection aims at selecting a subset of informative and relevant features from an original feature dataset. Therefore, using a feature selection approach to process the original data prior to the learning process is essential for enhancing the performance on the classification task. In this paper, hybrid particle swarm optimization with a spiral shaped mechanism (HPSO-SSM) is proposed for selecting the optimal feature subset for classification via a wrapper-based approach. In HPSO-SSM, we make three improvements: First, a logistic map sequence is used to enhance the diversity in the search process. Second, two new parameters are introduced into the original position update formula, which can effectively improve the position quality of the next generation. Finally, a spiral-shaped mechanism is adopted as a local search operator around the known optimal solution region. For a complete evaluation, the proposed HPSO-SSM method is compared with six state-of-the-art meta-heuristic optimization algorithms, ten well-known wrapper-based feature selection techniques, and six classic filter-based feature selection methods. Various assessment indicators are used to properly evaluate and compare the performances of these approaches on twenty classic benchmark classification datasets from the UCI machine learning repository. According to the experimental results and statistical tests, the developed methods effectively and efficiently improve the classification accuracy compared with other wrapper-based approaches and filter-based approaches. The results demonstrate the high performance of the HPSO-SSM method in searching the feasible feature space and selecting the most informative attributes for solving classification problems. Therefore, the HPSO-SSM method has broad application prospects as a new feature selection approach. (C) 2019 Elsevier Ltd. All rights reserved.
What problem does this paper attempt to address?