Regularized Sparse Modelling for Microarray Missing Value Estimation

Aiguo Wang,Jing Yang,Ning An
DOI: https://doi.org/10.1109/ACCESS.2021.3053631
IF: 3.9
2021-01-01
IEEE Access
Abstract:The existence of missing values in microarray data inevitably hinders downstream biological analyses that expect complete data as input, therefore how to effectively explore the underlying structure of data to accurately estimate missing entries remains crucial and meaningful. In this study, we formalize the problem under a regularized sparse framework and accordingly propose local learning-based imputation models to capture the relationships that are hidden in gene expression profiles towards better imputation. Specifically, in view of the simultaneous variable selection and grouping effect of the elastic net penalty, we present an elastic net regularized local least squares-based imputation method to estimate the missing entries of a target gene with its neighbors. Besides, we investigate different similarity filtering metrics to select neighbor genes and develop another four imputation methods under the framework. Furthermore, the proposed methods process the target genes in ascending order of their associated missing rates. Finally, extensive comparative experiments against other eight commonly-used methods are conducted on multiple microarray datasets having varying missing rates. Results indicate the power of sparse regularization techniques and the superiority of elastic net over its competitors in terms of statistical analysis metrics.
Computer Science
What problem does this paper attempt to address?