LncRNA–disease association prediction through combining linear and non-linear features with matrix factorization and deep learning techniques

Min Zeng,Chengqian Lu,Fuhao Zhang,Zhangli Lu,Fang-Xiang Wu,Yaohang Li,Min Li
DOI: https://doi.org/10.1109/BIBM47256.2019.8983279
2019-01-01
Abstract:Long non-coding RNAs (lncRNAs) are the foundation for understanding mechanisms of many human diseases. Considering the limited number of known experimentally verified associations between lncRNAs and diseases, it is appealing to develop accurate and effective computational methods to identify lncRNA-disease associations. Conventional matrix factorization-based methods cannot model complicated associations between lncRNAs and diseases. In this study, we propose a novel computational framework, through combining linear and non-linear features, which is used for lncRNA-disease association prediction. In our model, a conventional matrix factorization method is applied to extract linear features between lncRNAs and diseases. Deep learning techniques (fully connected layers) are applied to extract nonlinear features between lncRNAs and diseases. Finally, linear and non-linear features are fused to improve predictive performance. Compared to previous studies, our model can take advantages of the combination of linear and non-linear features between lncRNAs and diseases, and thus can effectively identify potential lncRNA-disease associations. The results show that our method achieves state-of-the-art performance in the leave-one-out cross-validation. The source codes of our method can be found at https://github.com/CSUBioGroup/DMFLDA2.
What problem does this paper attempt to address?