Improved semantic annotation method for documents based on ontology

陈叶旺,李文,彭鑫,赵文耘
DOI: https://doi.org/10.3969/j.issn.1001-0505.2009.06.005
2009-01-01
Abstract:Based on the semantic context and the structural info of a document, an improved semantic annotation method is proposed. The correlation between the ontology entity and the document and the co-appearance of the label-words frequents and the semantic context in local window are analysed and calculated. Firstly, this method extracts the text content from the document, and then decomposes it into a sub-sentences set, a sentences set and a paragraphs set. For each knowledge item in ontology, the context information of the item is extracted, and then the correlation between these information and those decomposed documents sets is calculated. Finally, the final correlation between the knowledge item and the document in the range of all document base and ontology base are obtained. The experimental results show that based on domain ontology, this method can annotate unstructured documents in web automatically and effectively.
What problem does this paper attempt to address?