A Gene-disease Association Prediction Algorithm Based on Multi-source Data Fusion

Fei Wang,Hohhot Vocational College Information Management Center,
DOI: https://doi.org/10.7546/ijba.2022.26.1.000870
2022-03-01
International Journal Bioautomation
Abstract:Accurate gene-disease association prediction results are the basis for effective diagnosis and treatment of complex genetic diseases. However, existing studies related to this topic generally face problems in two aspects: large volume of original data and diverse data type, and data fusion difficulty. Therefore, this paper studied a gene-disease association prediction algorithm based on multi-source data fusion. At first, it processed the multi-dimensional gene phenotype data, analyzed the gene-disease associations of different phenotypes, and completed the selection of disease gene loci under multi-dimensional phenotypes. Then, this paper fused the multi-source data containing the gene expression data, gene sequence data, gene interaction data, and transcriptome sequencing data, and established the corresponding gene-disease association prediction model. At last, the effectiveness of the constructed prediction model was verified by experimental results. The research results obtained in this paper can improve the low utilization of gene datasets, restored the main features of the datasets to the greatest extent, reasonably processed the data noise, effectively enhanced the robustness of the model, and further improved the classification accuracy of the prediction of disease-causing genes.
English Else
What problem does this paper attempt to address?