crossNN: an explainable framework for cross-platform DNA methylation-based classification of cancer
Dongsheng Yuan,Robin Jugas,Petra Pokorna,Jaroslav Sterba,Ondrej Slaby,Simone Schmid,Christin Siewert,Brendan Osberg,David Capper,Pia Zeiner,Katharina Weber,Patrick Harter,Nabil Jabareen,Sebastian Mackowiak,Naveed Ishaque,Roland Eils,Sören Lukassen,Philipp Euskirchen
DOI: https://doi.org/10.1101/2024.01.22.24301523
2024-01-23
Abstract:DNA methylation-based classification of brain tumors has emerged as a powerful and indispensable diagnostic technique. Initial implementations have used methylation microarrays for data generation, but different sequencing approaches are increasingly used. Most current classifiers, however, rely on a fixed methylation feature space, rendering them incompatible with other platforms, especially different flavors of DNA sequencing. Here, we describe crossNN, a neural network-based machine learning framework which can accurately classify tumor entities using DNA methylation profiles obtained from different platforms and with different epigenome coverage and sequencing depth. It outperforms other deep- and shallow machine learning models with respect to precision as well as simplicity and computational requirements while still being fully explainable. Validation in a large cohort of >1,900 tumors profiled using different microarray and sequencing platforms, including low-pass nanopore and targeted bisulfite sequencing, demonstrates the robustness and scalability of the model.
Neurology