An Automatic Online News Topic Keyphrase Extraction System

Canhui Wang,Min Zhang,Liyun Ru,Shaoping Ma
DOI: https://doi.org/10.1109/wiiat.2008.225
2008-01-01
Abstract:News Topics are related to a set of keywords or keyphrases. Topic keyphrases briefly describe the key content of topics and help users decide whether to do further reading about them. Moreover, keyphrases of a news topic can be considered as a cluster of related terms, which provides term relationship information that can be integrated into information retrieval models. In this paper, an automatic online news topic keyphrase extraction system is proposed. News stories are organized into topics. Keyword candidates are firstly extracted from single news stories and filtered with topic information. Then a phrase identification process combines keywords into phrases using position information. Finally, the phrases are ranked and top ones are selected as topic keyphrases. Experiments performed on practical Web datasets show that the proposed system works effectively, with a performance of precision=70.61% and recall=67.94%.
What problem does this paper attempt to address?