A New Deep Learning and XAI-Based Algorithm for Features Selection in Genomics

Carlo Adornetto,Gianluigi Greco
2023-03-30
Abstract:In the field of functional genomics, the analysis of gene expression profiles through Machine and Deep Learning is increasingly providing meaningful insight into a number of diseases. The paper proposes a novel algorithm to perform Feature Selection on genomic-scale data, which exploits the reconstruction capabilities of autoencoders and an ad-hoc defined Explainable Artificial Intelligence-based score in order to select the most informative genes for diagnosis, prognosis, and precision medicine. Results of the application on a Chronic Lymphocytic Leukemia dataset evidence the effectiveness of the algorithm, by identifying and suggesting a set of meaningful genes for further medical investigation.
Genomics,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the problem of how to select the most informative genes from large-scale gene expression data in functional genomics to improve the accuracy of diagnosis, prognosis, and precision medicine. Specifically, the paper proposes a new algorithm based on deep learning and explainable artificial intelligence (XAI) for feature selection in genome-scale data. The algorithm addresses the problem through the following three objectives: 1. **Select the most meaningful genes**: Select the genes from gene expression profile data that are most meaningful for regression or classification problems. 2. **Provide more accurate prediction models**: Improve the accuracy of prediction models through feature selection. 3. **Quantify and evaluate the impact of features on predictions**: Use XAI methods to quantify and evaluate the impact of features on model predictions. The paper validates the effectiveness of the algorithm through its application on the Chronic Lymphocytic Leukemia (CLL) dataset and identifies a set of meaningful genes, providing valuable clues for the prognosis of the disease.