An Automation Framework for Clinical Codelist Development Validated with UK Data from Patients with Multiple Long-term Conditions

Asra Aslam,Lauren Walker,Micheal Abaho,Harriet Cant,Maurice O’Connell,A S Abuzour,Layik Hama,Pieta Schofield,Frances S Mair,Roy Ruddle,Olusegun Popoola,Matthew Sperrin,Jung Yin Tsang,Eduard Shantsila,Mark Gabbay,Andrew Clegg,Alan Woodall,Iain E Buchan,Samuel Relton
DOI: https://doi.org/10.1101/2024.09.25.24314215
2024-09-26
Abstract:Background: Codelists play a crucial role in ensuring accurate and standardized communication within healthcare. However, preparation of high-quality codelists is a rigorous and time-consuming process. The literature focuses on transparency of clinical codelists and overlooks the utility of automation. Method and Automated Framework Design: Here we present a Codelist Generation Framework that can automate generation of codelists with minimal input from clinical experts. We demonstrate the process using a specific project, DynAIRx, producing appropriate codelists and a framework allowing future projects to take advantage of automated codelist generation. Both the framework and codelist are publicly available. Use-case: DynAIRx: DynAIRx is an NIHR-funded project aiming to develop AIs to help optimise prescribing of medicines in patients with multiple long-term conditions. DynAIRx requires complex codelists to describe the trajectory of each patient, and the interaction between their conditions. We promptly generated ~200 codelists for DynAIRx using the proposed framework and validated them with a panel of experts, significantly reducing the amount of time required by making effective use of automation. Findings and Conclusion: The framework reduced the clinician time required to validate codes, automatically shrunk codelists using trusted sources and added new codes for review against existing codelists. In the DynAIRx case study, a codelist of ~9600 codes required only 7-9 hours of clinician's time in the end (while existing methods takes months), and application of the automation framework reduced the workload by >80%.
What problem does this paper attempt to address?