Development and validation of HBV surveillance models using big data and machine learning

Weinan Dong,Cecilia Clara Da Roza,Dandan Cheng,Dahao Zhang,Yuling Xiang,Wai Kay Seto,William C. W. Wong
DOI: https://doi.org/10.1080/07853890.2024.2314237
IF: 5.348
2024-02-12
Annals of Medicine
Abstract:Background The construction of a robust healthcare information system is fundamental to enhancing countries' capabilities in the surveillance and control of hepatitis B virus (HBV). Making use of China's rapidly expanding primary healthcare system, this innovative approach using big data and machine learning (ML) could help towards the World Health Organization's (WHO) HBV infection elimination goals of reaching 90% diagnosis and treatment rates by 2030. We aimed to develop and validate HBV detection models using routine clinical data to improve the detection of HBV and support the development of effective interventions to mitigate the impact of this disease in China.
medicine, general & internal
What problem does this paper attempt to address?
The paper aims to address the issue of Hepatitis B Virus (HBV) detection and monitoring in China. Specifically, the research objective is to develop and validate an HBV monitoring model using big data and machine learning techniques to improve HBV detection rates and support the formulation of effective interventions to mitigate the impact of the disease in China. By building such a system, the researchers hope to contribute to achieving the World Health Organization's (WHO) goal of reaching a 90% HBV diagnosis and treatment rate by 2030. The study utilized routine clinical data from China's rapidly developing primary healthcare system, structured the data using advanced natural language processing techniques, and employed various machine learning methods to establish an HBV risk assessment model. Ultimately, the research team developed a simplified model that performed well in terms of discrimination ability (AUC=0.78) and goodness-of-fit, showing potential for deployment in clinical practice.