Unified HMM-based Layout Analysis Framework and Algorithm

Chen Ming,Ding Xiaoqing,Wu Youshou
DOI: https://doi.org/10.1360/02yf0135
2003-01-01
Science in China Series F Information Sciences
Abstract:> To manipulate the layout analysis problem for complex or irregular document image, a Unified HMM-based Layout Analysis Framework is presented in this paper. Based on the multi-resolution wavelet analysis results of the document image, we use HMM method in both inner-scale image model and trans-scale context model to classify the pixel region properties, such as text, picture or background. In each scale, a HMM direct segmentation method is used to get better inner-scale classification result. Then another HMM method is used to fuse the inner-scale result in each scale and then get better final segmentation result. The optimized algorithm uses a stop rule in the coarse to fine multi-scale segmentation process, so the speed is improved remarkably. Experiments prove the efficiency of proposed algorithm.
What problem does this paper attempt to address?