Evaluating MedDRA-to-ICD terminology mappings

Xinyuan Zhang,Yixue Feng,Fang Li,Jin Ding,Danyal Tahseen,Ezekiel Hinojosa,Yong Chen,Cui Tao
DOI: https://doi.org/10.1186/s12911-023-02375-1
IF: 3.298
2024-02-10
BMC Medical Informatics and Decision Making
Abstract:In this era of big data, data harmonization is an important step to ensure reproducible, scalable, and collaborative research. Thus, terminology mapping is a necessary step to harmonize heterogeneous data. Take the Medical Dictionary for Regulatory Activities (MedDRA) and International Classification of Diseases (ICD) for example, the mapping between them is essential for drug safety and pharmacovigilance research. Our main objective is to provide a quantitative and qualitative analysis of the mapping status between MedDRA and ICD.
medical informatics
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to evaluate and improve the quality of term mapping between the Medical Dictionary for Regulatory Activities (MedDRA) and the International Classification of Diseases (ICD). Specifically: 1. **Evaluation of term mapping coverage**: The paper first uses the Unified Medical Language System (UMLS) and the Observational Medical Outcomes Partnership Common Data Model (OMOP CDM) to summarize the current mapping statistics between MedDRA and ICD. The study found that the MedDRA Preferred Terms (PT) mapped by these two methods only cover 27.23% of all MedDRA PTs. 2. **Evaluation of mapping quality**: A systematic quality analysis was carried out on the mapped term pairs. The results showed that among the mapping pairs provided by UMLS, only 51.44% were considered exact matches. In addition, for unmapped terms, the researchers used a self - developed algorithm to recommend the best mapping candidates in order to improve mapping coverage. 3. **Evaluation of unmapped terms**: 100 unmapped MedDRA PT terms were randomly selected from each System Organ Class (SOC) for evaluation. The results showed that 56 unmapped MedDRA PT terms could find exact matches in ICD, indicating the possibility of further expanding the mapping. 4. **Analysis of mapping relationships**: The study also explored different categories in mapping relationships, such as "MedDRA terms are broader than ICD terms", "MedDRA terms are narrower than ICD terms" and "partial overlap", etc. These analyses are helpful for understanding the differences between the two term systems and providing guidance for future mapping improvement. Overall, this paper aims to comprehensively evaluate the current situation of term mapping between MedDRA and ICD through quantitative and qualitative methods, and propose methods to improve mapping quality and coverage. This is of great significance for drug safety and pharmacoepidemiology research.