Laplacian Regularized Sparse Representation based Classifier for Identifying DNA N4-methylcytosine Sites via L2,1/2-matrix Norm.

Yijie Ding,Wenying He,Jijun Tang,Quan Zou,Fei Guo
DOI: https://doi.org/10.1109/TCBB.2021.3133309
2021-01-01
IEEE/ACM Transactions on Computational Biology and Bioinformatics
Abstract:N4-methylcytosine (4mC) is one of important epigenetic modifications in DNA sequences. Detecting 4mC sites is time-consuming. The computational method based on machine learning has provided effective help for identifying 4mC. To further improve the performance of prediction, we propose a Laplacian Regularized Sparse Representation based Classifier with L2,1/2-matrix norm (LapRSRC). We also utilize kernal trick to derive the kernel LapRSRC for nonlinear modeling. Matrix factorization technology is employed to solve the sparse representation coefficients of all test samples in the training set. And an efficient iterative algorithm is proposed to solve the objective function. We implement our model on six benchmark datasets of 4mC and eight UCI datasets to test evaluate performance. The results show that the performance of our method is better or comparable.
What problem does this paper attempt to address?