Comprehensive Printed Tibetan/English Mixed Text Segmentation Method

H Wang,Xq Ding
DOI: https://doi.org/10.1117/12.528949
2004-01-01
Abstract:Text segmentation plays a crucial role in a text recognition system. A comprehensive method is proposed to solve Tibetan/English text segmentation. 2 algorithms based on Tibetan inter-syllabic tshegs and discriminant function, respectively, are presented to perform skew detection before text line separation. Then a dynamic recursive character segmentation algorithm integrating multi-level information is developed. The encouraging experimental results on a large-scale Tibetan/English mixed text set show the validity of proposed method.
What problem does this paper attempt to address?