TOBMI: Trans-omics block missing data imputation using a k-Nearest Neighbor weighted approach.

Xuesi Dong,Lijuan Lin,Ruyang Zhang,Yang Zhao,David C Christiani,Yongyue Wei,Feng Chen
DOI: https://doi.org/10.1093/bioinformatics/bty796
IF: 5.8
2019-01-01
Bioinformatics
Abstract:Motivation: Stitching together trans-omics data is a powerful approach to assess the complex mechanisms of cancer occurrence, progression and treatment. However, the integration process suffers from the 'block missing' phenomena when part of individuals lacks some omics data. Results: We proposed a k-nearest neighbor (kNN) weighted imputation method for trans-omics block missing data (TOBMIkNN) to handle gene-absence individuals in RNA-seq datasets using external information obtained from DNA methylation probe datasets. Referencing to multi-hot deck, mean imputation and missing cases deletion, we assess the relative error, absolute error, interomics correlation structure change and variable selection. The proposed method, TOBMIkNN reliably imputed RNA-seq data by borrowing information from DNA methylation data, and showed superiority over the other three methods in imputation error and stability of correlation structure. Our study indicates that TOBMIkNN can be used as an advisable method for trans-omics block missing data imputation. Availability and implementation: TOBMIkNN is freely available at https://github.com/XuesiDong/TOBMI.
What problem does this paper attempt to address?