Joint Learning of Frequency and Word Embeddings for Multilingual Readability Assessment.

Dieu-Thu Le,Cam-Tu Nguyen,Xiaoliang Wang
DOI: https://doi.org/10.18653/v1/w18-3714
2018-01-01
Abstract:This paper describes two models that employ word frequency embeddings to deal with the problem of readability assessment in multiple languages. The task is to determine the difficulty level of a given document, i.e., how hard it is for a reader to fully comprehend the text. The proposed models show how frequency information can be integrated to improve the readability assessment. The experimental results testing on both English and Chinese datasets show that the proposed models improve the results notably when comparing to those using only traditional word embeddings.
What problem does this paper attempt to address?