Active Learning Pipeline to Identify Candidate Terms for a CDSS Ontology
Xia Jing,Rohan Goli,Keerthana Komatineni,Shailesh Alluri,Nina Hubig,Hua Min,Yang Gong,Dean F Sittig,Paul Biondich,David Robinson,Christian Nøhr,Arild Faxvaag,Adam Wright,Timothy Law,Lior Rennert,Ronald Gimbel
DOI: https://doi.org/10.3233/SHTI240660
2024-08-22
Abstract:Ontology is essential for achieving health information and information technology application interoperability in the biomedical fields and beyond. Traditionally, ontology construction is carried out manually by human domain experts (HDE). Here, we explore an active learning approach to automatically identify candidate terms from publications, with manual verification later as a part of a deep learning model training and learning process. We introduce the overall architecture of the active learning pipeline and present some preliminary results. This work is a critical and complementary component in addition to manually building the ontology, especially during the long-term maintenance stage.