A gene prioritization method based on a swine multi-omics knowledgebase and a deep learning model

Yuhua Fu,Jingya Xu,Zhenshuang Tang,Lu Wang,Dong Yin,Yu Fan,Dongdong Zhang,Fei Deng,Yanping Zhang,Haohao Zhang,Haiyan Wang,Wenhui Xing,Lilin Yin,Shilin Zhu,Mengjin Zhu,Mei Yu,Xinyun Li,Xiaolei Liu,Xiaohui Yuan,Shuhong Zhao
DOI: https://doi.org/10.1038/s42003-020-01233-4
IF: 6.548
2020-09-10
Communications Biology
Abstract:Abstract The analyses of multi-omics data have revealed candidate genes for objective traits. However, they are integrated poorly, especially in non-model organisms, and they pose a great challenge for prioritizing candidate genes for follow-up experimental verification. Here, we present a general convolutional neural network model that integrates multi-omics information to prioritize the candidate genes of objective traits. By applying this model to Sus scrofa , which is a non-model organism, but one of the most important livestock animals, the model precision was 72.9%, recall 73.5%, and F1-Measure 73.4%, demonstrating a good prediction performance compared with previous studies in Arabidopsis thaliana and Oryza sativa . Additionally, to facilitate the use of the model, we present ISwine ( http://iswine.iomics.pro/ ), which is an online comprehensive knowledgebase in which we incorporated almost all the published swine multi-omics data. Overall, the results suggest that the deep learning strategy will greatly facilitate analyses of multi-omics integration in the future.
biology
What problem does this paper attempt to address?