Chinese Text Summarization Algorithm Based on Word2vec

Xu Chengzhang,Liu Dan
DOI: https://doi.org/10.1088/1742-6596/976/1/012006
2018-02-01
Journal of Physics: Conference Series
Abstract:In order to extract some sentences that can cover the topic of a Chinese article, a Chinese text summarization algorithm based on Word2vec is used in this paper. Words in an article are represented as vectors trained by Word2vec, the weight of each word, the sentence vector and the weight of each sentence are calculated by combining word-sentence relationship with graph-based ranking model. Finally the summary is generated on the basis of the final sentence vector and the final weight of the sentence. The experimental results on real datasets show that the proposed algorithm has a better summarization quality compared with TF-IDF and TextRank.
What problem does this paper attempt to address?