Automatic Extraction of Medication Data from Semi-Structured Prescriptions

Johannes Benedict Oehm,Oliver Wenning,Michael Storck,Xiaoyi Jiang,Julian Varghese
DOI: https://doi.org/10.3233/SHTI240749
2024-08-22
Abstract:In many healthcare facilities, the prescription of drugs is done only in a semi-structured manner, using free-text fields where various information is often mixed. Therefore, automatic processing, especially for secondary use such as research purposes, is often challenging. This paper compares various approaches that identify and classify the various parts of these free-text fields in German language, namely simple Levenshtein-based, rule-based and CRF (conditional random field)-based approaches. Our results show that a F1-score >90% can be achieved with both the rule-based and the CRF-based approach, with the CRF-based approach even reaching nearly 95%.
What problem does this paper attempt to address?