Enriching a Text by Semantic Disambiguation for Information Extraction

Bernard Jacquemin,Caroline Brun,Claude Roux
DOI: https://doi.org/10.48550/arXiv.cs/0506048
2005-06-13
Abstract:External linguistic resources have been used for a very long time in information extraction. These methods enrich a document with data that are semantically equivalent, in order to improve recall. For instance, some of these methods use synonym dictionaries. These dictionaries enrich a sentence with words that have a similar meaning. However, these methods present some serious drawbacks, since words are usually synonyms only in restricted contexts. The method we propose here consists of using word sense disambiguation rules (WSD) to restrict the selection of synonyms to only these that match a specific syntactico-semantic context. We show how WSD rules are built and how information extraction techniques can benefit from the application of these rules.
Information Retrieval
What problem does this paper attempt to address?