Improving text similarity measurement by critical sentence vector model

Wei Li,Kam-Fai Wong,Chunfa Yuan,Wenjie Li,Yunqing Xia
DOI: https://doi.org/10.1007/11562382_44
2005-01-01
Abstract:We propose the Critical Sentence Vector Model (CSVM), a novel model to measure text similarity. The CSVM accounts for the structural and semantic information of the document. Compared to existing methods based on keyword vector, e.g. Vector Space Model (VSM), CSVM measures documents similarity by measuring similarity between critical sentence vectors extracted from documents. Experiments show that CSVM outperforms VSM in calculation of text similarity.
What problem does this paper attempt to address?