A Clustering Chunking Method Based on Manifold Geodesic Distance

LEI Lin,XIONG Wei,JING Ning,XIAO Jianfu
DOI: https://doi.org/10.13209/j.0479-8023.2013.019
2013-01-01
Abstract:Regarding the Chinese chunker analysis as a procedure of inner-sentence word clustering and chunker type labeling,a grammar function space is constructed at first,and then embedded in a lower dimension space by applying ISOMAP to observe the distribution feature of Chinese word in the embedding space.In the hierarchical clustering algorithm which is aiming at partitioning word into different clusters,the manifold geodesic distance is employed instead of Euclidean distance to measure the similarity between words.The algorithm facilitates the increment of Chinese chunker analysis performance under the condition of appropriate algorithm complexity.
What problem does this paper attempt to address?