Quantum computer based Feature Selection in Machine Learning

Gerhard Hellstern,Vanessa Dehn,Martin Zaefferer
2023-06-18
Abstract:The problem of selecting an appropriate number of features in supervised learning problems is investigated in this paper. Starting with common methods in machine learning, we treat the feature selection task as a quadratic unconstrained optimization problem (QUBO), which can be tackled with classical numerical methods as well as within a quantum computing framework. We compare the different results in small-sized problem setups. According to the results of our study, whether the QUBO method outperforms other feature selection methods depends on the data set. In an extension to a larger data set with 27 features, we compare the convergence behavior of the QUBO methods via quantum computing with classical stochastic optimization methods. Due to persisting error rates, the classical stochastic optimization methods are still superior.
Quantum Physics,Computational Finance
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to select an appropriate number of features in supervised learning. Specifically, the paper explores how to effectively select a subset of features that are most helpful for the prediction target from a given dataset, in order to improve the performance of machine - learning models. This problem is very important in machine learning because selecting too many or too few features will affect the training effect of the model and the final prediction ability. To deal with this problem, the author models the feature selection task as a Quadratic Unconstrained Binary Optimization (QUBO) problem, and studies methods of using classical numerical methods and quantum computing frameworks to solve this problem. In particular, they compare the performance of different methods on small - scale datasets, and further extend to larger datasets containing 27 features to evaluate the differences in convergence behavior between quantum computing methods and classical stochastic optimization methods. Through these studies, the author hopes to explore the potential of quantum computing technology in solving practical machine - learning problems, especially the application possibilities in this crucial step of feature selection.