A Context-Aware Approach for the Identification of Complex Words in Natural Language Texts

Elnaz Davoodi,Leila Kosseim,Matthew Mongrain
DOI: https://doi.org/10.1109/icsc.2017.9
2017-01-01
Abstract:This paper evaluates the effect of the context on the identification of complex words in natural language texts. The approach automatically tags words as either complex or not, based on two sets of features: base features that only pertain to the target word, and contextual features that take the context of the target word into account. We experimented with several supervised machine learning models, and trained and tested the approach with the SemEval-2016 dataset. Results show that considering contextual features significantly improves the identification of complex words by reaching an F-measure of 0.260 compared to 0.184 without them.
What problem does this paper attempt to address?