An Instance Learning Approach for Automatic Semantic Annotation

Wang Shu,Chen Enhong
DOI: https://doi.org/10.1007/978-3-540-30497-5_148
2004-01-01
Abstract:Currently there appear only few practical semantic web applications. The reason is mainly in that a large number of existed web documents contain only machine-unreadable information on which software agent can do nothing. There have been some works devoting to web document annotation manually or semi-automatically to solve this problem. This paper presents an automatic approach for web document annotation based on specific domain ontology. Because complete semantic annotation of web document is still a tough task, we simplify the problem by annotating ontology concept instances on web documents and propose an Ontology Instance Learning (OIL) method to extract instances from structure and free text of web documents. These instances of the ontology concept will be used to annotate web pages in the related domain. Our OIL method exhibits quite good performance in real life web documents as shown in our experiment.
What problem does this paper attempt to address?