An Integrated Method Based on Wasserstein Distance and Graph for Cancer Subtype Discovery

Qingqing Cao,Jianping Zhao,Haiyun Wang,Qi Guan,Chunhou Zheng
DOI: https://doi.org/10.1109/tcbb.2023.3293472
2023-01-01
IEEE/ACM Transactions on Computational Biology and Bioinformatics
Abstract:Due to the complexity of cancer pathogenesis at different omics levels, it is necessary to find a comprehensive method to accurately distinguish and find cancer subtypes for cancer treatment. In this paper, we proposed a new cancer multi-omics subtype identification method, which is based on variational autoencoder measured by Wasserstein distance and graph autoencoder (WVGMO). This method depends on two foremost models. The first model is a variational autoencoder measured by Wasserstein distance (WVAE), which is used to extract potential spatial information of each omic data type. The second model is the graph autoencoder (GAE) with the second-order proximity. It has the capability to retain the topological structure information and feature information of the multi-omics data. And then, the identification of cancer subtypes via k-means clustering. Extensive experiments were conducted on seven different cancers based on four omics data from TCGA. The results show that WVGMO provides equivalent or even better results than the most of advanced synthesis methods.
What problem does this paper attempt to address?