Drug Safety Signal Detection in a Regional Healthcare Database Using the Tree-Based Scan Statistic and Comparison to 3 Other Mining Methods

Hailong Li,Houyu Zhao,Hongbo Lin,Peng Shen,Siyan Zhan
DOI: https://doi.org/10.1111/bcp.15810
2023-01-01
British Journal of Clinical Pharmacology
Abstract:AimsTo evaluate and compare the relative performance of the tree-based scan statistic (TreeScan) with the crude cohort study, Bayesian confidence propagation neural network (BCPNN) and Gamma Poisson Shrinker (GPS) in detecting statin-related adverse events (AEs) in an electronic healthcare database. MethodsData from a Chinese healthcare database from 2010 to 2016 were evaluated. We identified statin users based on prescription information in their out-/in-patient records, and AEs were defined according to the ICD-10 codes in patients' diagnosis records. TreeScan was applied to detect AE signals related to statin use and was compared with 3 other methods based on sensitivity, specificity, positive predictive value, negative predictive value, accuracy, the Youden index, area under the precision-recall curve and the area under the receiver operating characteristic curve. ResultsA total of 224 187 patients were enrolled and divided into 85 758 statin users and 138 429 nonusers. TreeScan generated 29 positive signals, of which 9 were known AEs. The sensitivities of TreeScan, BCPNN and GPS were all 69.2%, which was higher than that of the crude cohort study (46%). The specificity (82.3%), positive predictive value (31.0%), negative predictive value (95.9%), accuracy (81.0%), Youden index (51.5%) and area under the receiver operating characteristic curve (75.8%) of TreeScan were the highest among the 4 methods. ConclusionTreeScan outperformed the crude cohort, BCPNN and GPS in detecting statin-related AEs in an electronic healthcare database. Therefore, it can be used as a complementary tool for other signal detection methods in drug safety surveillance.
What problem does this paper attempt to address?