Multi-view Clustering for the Integration Analysis of Gene Expression and Methylation Data

Xiaowei Gao,Xiaogang Liu,Xiaoke Ma
DOI: https://doi.org/10.1109/BIBM52615.2021.9669285
2021-01-01
Abstract:The accumulated gene expression and DNA methylation data provide a great opportunity to exploit the mechanisms of biological systems. Current algorithms for the integration of gene expression and methylation data are characterized for undesirable performance because they fail to address the latent relations in the heterogeneous data. To solve this problem, we propose a novel multi-view clustering with self-representation learning and low-rank tensor constraint (MCSL-LTC), where the gene expression and DNA methylation data are treated as complementary views, and MCSL-LTC obtains a consensus partitioning reflecting the structure and features of various views. Specifically, self-representation learning is employed to explore the low-dimensional subspace structures embedded in different views, where the tensor norm is adopted to smooth different views, therefore improving the quality of features. Experimental results demonstrate that the proposed approach outperforms state-of-the-art baselines in terms of accuracy on both the social and cancer data, provides an effective and efficient method for the integration of heterogeneous genomic data.
What problem does this paper attempt to address?