HITIR's Update Summary at TAC 2008: Extractive Content Selection for Language Independence.

Ruifang He,Yang Liu,Bing Qin,Ting Liu,Sheng Li
2008-01-01
Theory and applications of categories
Abstract:The update summary aims to capture evolving information of a single topic changing over time. It delivers salient and novel information to a user who has already read a set of older documents covering the same topic. According to the new challenges brought by update summary, we propose the evolutionary manifold-ranking algorithm, and further integrate the sub-topics partition with spectral clustering to have a content selection, which is completely language independence. Three systems: 11, 41 and 62 are submitted. Our best system ranks three top 1 under average modified (pyramid) score, average numSCUs and macro-average modified score with 3 models of PYRAMID, ranks 13 th in ROUGE-2, ranks 15 th in ROUGE-SU4 and ranks 17 th in BE. Though the evaluation results show the interesting performance of the proposed method, yet the problem is far from solved.
What problem does this paper attempt to address?