BioCaster in 2021: automatic disease outbreaks detection from global news media

Zaiqiao Meng,Anya Okhmatovskaia,Maxime Polleri,Yannan Shen,Guido Powell,Zihao Fu,Iris Ganser,Meiru Zhang,Nicholas B King,David Buckeridge,Nigel Collier
DOI: https://doi.org/10.1093/bioinformatics/btac497
IF: 5.8
2022-09-15
Bioinformatics
Abstract:Summary: BioCaster was launched in 2008 to provide an ontology-based text mining system for early disease detection from open news sources. Following a 6-year break, we have re-launched the system in 2021. Our goal is to systematically upgrade the methodology using state-of-the-art neural network language models, whilst retaining the original benefits that the system provided in terms of logical reasoning and automated early detection of infectious disease outbreaks. Here, we present recent extensions such as neural machine translation in 10 languages, neural classification of disease outbreak reports and a new cloud-based visualization dashboard. Furthermore, we discuss our vision for further improvements, including combining risk assessment with event semantics and assessing the risk of outbreaks with multi-granularity. We hope that these efforts will benefit the global public health community. Availability and implementation: BioCaster web-portal is freely accessible at http://biocaster.org.
What problem does this paper attempt to address?