Comparing machine learning models for predicting mutation status in Acute Myeloid Leukemia patients using RNA-seq data

Raissa Silva,Cedric Riedel,Jerome Reboul,Benoit Guibert,Florence Ruffle,Melina Gallopin,Nicolas Gilbert,Anthony Boureux,Therese Commes
DOI: https://doi.org/10.1101/2024.11.13.623391
2024-11-14
Abstract:Acute Myeloid Leukemia (AML) is a highly heterogeneous disease. The current AML classifications are based mainly on molecular markers, including cytogenetics features, fusion genes, and the presence or absence of mutations. In this study, we investigated mutation status in AML patients through RNA-seq data in link with differential gene expression. We applied seven machine learning algorithms to identify the presence or absence of NPM1, IDH1/IDH2, and FLT3-ITD mutations, reaching 95%, 93%, and 87% accuracy, respectively. In each case, the best performing models were complex models, suggesting highly complex biological processes at work behind AML.
Bioinformatics
What problem does this paper attempt to address?