Text Extraction Algorithm Based on Binary Clustering

戴维,张申生
DOI: https://doi.org/10.3724/sp.j.1087.2009.00057
2009-01-01
Journal of Computer Applications
Abstract:To deal with the gradient problem in the clustering process of text extraction, an algorithm based on binary clustering was proposed. The original image was converted to binary bitmap after preprocessing. The background blocks of the image were clustered by the region features, and then text blocks were recognized by the distribution features. The experiment shows this method achieves satisfactory result on various kinds of images.
What problem does this paper attempt to address?