LLMs4OL: Large Language Models for Ontology Learning

Hamed Babaei Giglou,Jennifer D'Souza,Sören Auer
DOI: https://doi.org/10.48550/arXiv.2307.16648
2023-08-02
Abstract:We propose the LLMs4OL approach, which utilizes Large Language Models (LLMs) for Ontology Learning (OL). LLMs have shown significant advancements in natural language processing, demonstrating their ability to capture complex language patterns in different knowledge domains. Our LLMs4OL paradigm investigates the following hypothesis: \textit{Can LLMs effectively apply their language pattern capturing capability to OL, which involves automatically extracting and structuring knowledge from natural language text?} To test this hypothesis, we conduct a comprehensive evaluation using the zero-shot prompting method. We evaluate nine different LLM model families for three main OL tasks: term typing, taxonomy discovery, and extraction of non-taxonomic relations. Additionally, the evaluations encompass diverse genres of ontological knowledge, including lexicosemantic knowledge in WordNet, geographical knowledge in GeoNames, and medical knowledge in UMLS.
Artificial Intelligence,Computation and Language,Information Theory,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to use large - language models (LLMs) for ontology learning (OL). Specifically, the paper explores the following hypothesis: whether large - language models can effectively apply their ability to capture complex language patterns in natural - language processing to ontology - learning tasks, which include automatically extracting and structuring knowledge from natural - language texts. To test this hypothesis, the author conducted a comprehensive evaluation, using the zero - sample prompting method to evaluate nine different large - language - model families for three main ontology - learning tasks: term classification, type - classification discovery, and extraction of non - classificatory relationships. In addition, the evaluation also covered ontology knowledge in different fields, such as lexical - semantic knowledge (WordNet), geographical knowledge (GeoNames), and medical knowledge (UMLS). Through these experiments, the author aims to verify whether large - language models can be used as effective auxiliary tools to alleviate the knowledge - acquisition bottleneck in ontology construction. The research results show that basic large - language models are not entirely suitable for ontology - construction tasks that require high - level reasoning ability and domain - specific knowledge, but after effective fine - tuning, they may become suitable auxiliary tools.