Deep Learning-Based Multi-Omics Data Integration Reveals Two Prognostic Subtypes in High-Risk Neuroblastoma

Li Zhang,Chenkai Lv,Yaqiong Jin,Ganqi Cheng,Yibao Fu,Dongsheng Yuan,Yiran Tao,Yongli Guo,Xin Ni,Tieliu Shi
DOI: https://doi.org/10.3389/fgene.2018.00477
IF: 3.7
2018-10-18
Frontiers in Genetics
Abstract:High-risk neuroblastoma is a very aggressive disease, with excessive tumor growth and poor outcomes. A proper stratification of the high-risk patients by prognostic outcome is important for treatment. However, there is still a lack of survival stratification for the high-risk neuroblastoma. To fill the gap, we adopt a deep learning algorithm, Autoencoder, to integrate multi-omics data, and combine it with K-means clustering to identify two subtypes with significant survival differences. By comparing the Autoencoder with PCA, iCluster, and DGscore about the classification based on multi-omics data integration, Autoencoder-based classification outperforms the alternative approaches. Furthermore, we also validated the classification in two independent datasets by training machine-learning classification models, and confirmed its robustness. Functional analysis revealed that <i>MYCN</i> amplification was more frequently occurred in the ultra-high-risk subtype, in accordance with the overexpression of <i>MYC/MYCN</i> targets in this subtype. In summary, prognostic subtypes identified by deep learning-based multi-omics integration could not only improve our understanding of molecular mechanism, but also help the clinicians make decisions.
genetics & heredity
What problem does this paper attempt to address?