Handwritten Text Line Segmentation by Spectral Clustering

Xuecheng Han,Hui Yao,Guoqiang Zhong
DOI: https://doi.org/10.1117/12.2266982
2017-01-01
Abstract:Since handwritten text lines are generally skewed and not obviously separated, text line segmentation of handwritten document images is still a challenging problem. In this paper, we propose a novel text line segmentation algorithm based on the spectral clustering. Given a handwritten document image, we convert it to a binary image first, and then compute the adjacent matrix of the pixel points. We apply spectral clustering on this similarity metric and use the orthogonal kmeans clustering algorithm to group the text lines. Experiments on Chinese handwritten documents database (HIT-MW) demonstrate the effectiveness of the proposed method.
What problem does this paper attempt to address?