Named Entity Resolution in Chinese News Comments on the Web

Liang Zong,Xiaojun Wan,Lihong Zhao,Jianwu Yang,Yuqian Wu
DOI: https://doi.org/10.1109/apweb.2010.20
2010-01-01
Abstract:News comment is a new text genre which people use to express their opinions on recent news events. Different from normal text corpus, news comments have some particular properties. The named entities in the news comments usually use some wrongly written words, informal abbreviations or aliases, which bring great difficulties for machine detection and understanding. This paper addresses the issue of named entity resolution in Chinese news comments on the web, which is a special case of coreference resolution. Traditional resolution algorithms have some limitations for this special task. In this paper, we first define the special task, and then propose a novel resolution algorithm with new features to improve the resolution performance. We manually labeled a benchmark dataset with 60 pieces of news and their corresponding comments downloaded from a popular Chinese news portal and the experimental results on the dataset show that our algorithm is effective for this special task.
What problem does this paper attempt to address?