Team QTB on Feature Selection Via Quantum Annealing and Hybrid Models

Juan Carlos Martínez Santos,Esteban Payares,Edwin Puertas
Abstract:Quantum technologies are a reality today, and the future is bright regarding its capabilities for real-world applications. Feature selection is a crucial preprocessing step in Information Retrieval. By identifying the most informative subset of features, feature selection can improve the efficiency of learning to rank models. In this paper, we propose a novel approach to feature selection for Information Retrieval using quantum annealing. This promising optimization technique leverages the principles of quantum mechanics. We focus on the MQ2007 dataset, a widely used benchmark for learning to rank tasks. We also explore different formulations of the feature selection problem as quadratic unconstrained binary optimization (QUBO) problems, including mutual information, conditional mutual information, and correlation coefficients. Our quantum annealing-based approaches demonstrate their effectiveness in selecting informative features, outperforming simulated annealing, which achieves an nDCG@10 score of 0.4024. The best quantum annealing-based approach achieves a score of 0.443 using a hybrid solver with only ten features. We discuss the importance of the number of selected features in the performance of learning to rank models and the role of hybrid quantum-classical solvers in incorporating additional constraints and preferences into the feature selection process. Our work demonstrates the potential of using quantum annealing to tackle complex optimization problems. It paves the way for further exploration in this domain.
Computer Science,Physics
What problem does this paper attempt to address?