Low-rank representation regularized by L<inf>2,1</inf>-norm for identifying differentially expressed genes

Ya-Xuan Wang,Jin-Xing Liu,Ying-Lian Gao,Chun-Hou Zheng,Ling-Yun Dai
DOI: https://doi.org/10.1109/BIBM.2017.8217725
2017-01-01
Abstract:Low-rank representation (LRR) via rank minimization is a high efficiency method for capturing low-dimensional structure embedded in high-dimensional data. However, minimizing the rank of a matrix is NP-hard. In this paper, robust truncated nuclear norm low-rank representation regularized by L <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2,1</sub> -norm method (RTLRR) is proposed. The truncated nuclear norm is introduced to replace the nuclear norm to approximate the rank function. At the same time, L <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2,1</sub> -norm is used to regularize the sparse matrix to achieve better sparse effect of the algorithm. The proposed method is divided into two steps. Firstly, we do singular value decomposition (SVD) to the original data matrix. Then we apply the truncated nuclear norm and L <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2,1</sub> -norm constraints to subproblems and use inexact augmented Lagrange multiplier method to solve subproblems. Finally, the genes with high scores will be identified as differentially expressed genes according to the sparse matrix. The results on The Cancer Genome Atlas (TCGA) data illustrate that the effectiveness of RTLRR method outperforms many methods.
What problem does this paper attempt to address?