Word Sense Disambiguation of Tags in XML Documents Based on WordNet

潘有能,滕海明
DOI: https://doi.org/10.13833/j.cnki.is.2014.03.008
IF: 8.1
2014-01-01
Information Sciences
Abstract:The tags are important to represent and control the content of XML documents, but it is common that there is semantic ambiguity in user-defined tags. Word Sense Disambiguation is useful to calculate the semantic similarity of XML documents, and it's also the foundation of XML document clustering and classification. Differ from traditional dictionaries, WordNet arranges the words with hierarchical structure like a tree and provides advantage to Word Sense Disambiguation. The paper introduces the existing algorithms of Word Sense Disambiguation, then analyzes the possibility of word sense disambiguation of XML documents tags based on WordNet, and explains the procedures in detail. The experimental result proves that this method has a high accuracy rate in Word Sense Disambiguation.
What problem does this paper attempt to address?