HeNeCOn: An ontology for integrative research in Head and Neck cancer

Liss Hernández,Estefanía Estévez-Priego,Laura López-Pérez,María Fernanda Cabrera-Umpiérrez,María Teresa Arredondo,Giuseppe Fico,BD2Decide Consortium,Tito Poli,Silvia Rossi,Elena Martinelli,Lisa Licitra,Stefano Cavalieri,Loris De Cecco,Silvana Canevari,Kathrin Scheckenbach,Ruud H Brakenhoff,Irene Nauta,Frank J P Hoebers,Frederik W R Wesseling,Annalisa Trama,Gemma Gatta
DOI: https://doi.org/10.1016/j.ijmedinf.2023.105284
Abstract:Background: Head and Neck Cancer (HNC) has a high incidence and prevalence in the worldwide population. The broad terminology associated with these diseases and their multimodality treatments generates large amounts of heterogeneous clinical data, which motivates the construction of a high-quality harmonization model to standardize this multi-source clinical data in terms of format and semantics. The use of ontologies and semantic techniques is a well-known approach to face this challenge. Objective: This work aims to provide a clinically reliable data model for HNC processes during all phases of the disease: prognosis, treatment, and follow-up. Therefore, we built the first ontology specifically focused on the HNC domain, named HeNeCOn (Head and Neck Cancer Ontology). Methods: First, an annotated dataset was established to provide a formal reference description of HNC. Then, 170 clinical variables were organized into a taxonomy, and later expanded and mapped to formalize and integrate multiple databases into the HeNeCOn ontology. The outcomes of this iterative process were reviewed and validated by clinicians and statisticians. Results: HeNeCOn is an ontology consisting of 502 classes, a taxonomy with a hierarchical structure, semantic definitions of 283 medical terms and detailed relations between them, which can be used as a tool for information extraction and knowledge management. Conclusion: HeNeCOn is a reusable, extendible and standardized ontology which establishes a reference data model for terminology structure and standard definitions in the Head and Neck Cancer domain. This ontology allows handling both current and newly generated knowledge in Head and Neck cancer research, by means of data linking and mapping with other public ontologies.
What problem does this paper attempt to address?