Biomedical evidence engineering for data-driven discovery

Sendong Zhao,Aobo Wang,Bing Qin,Fei Wang
DOI: https://doi.org/10.1093/bioinformatics/btac675
IF: 5.8
2022-10-13
Bioinformatics
Abstract:Abstract Motivation With the rapid development of precision medicine, a large amount of health data (such as electronic health records, gene sequencing, medical images, etc.) has been produced. It encourages more and more interest in data-driven insight discovery from these data. A reasonable way to verify the derived insights is by checking evidence from biomedical literature. However, manual verification is inefficient and not scalable. Therefore, an intelligent technique is necessary to solve this problem. Results This article introduces a framework for biomedical evidence engineering, addressing this problem more effectively. The framework consists of a biomedical literature retrieval module and an evidence extraction module. The retrieval module ensembles several methods and achieves state-of-the-art performance in biomedical literature retrieval. A BERT-based evidence extraction model is proposed to extract evidence from literature in response to queries. Moreover, we create a dataset with 1 million examples of biomedical evidence, 10 000 of which are manually annotated. Availability and implementation Datasets are available at https://github.com/SendongZhao.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?