Analysis, Understanding and Representation of Chinese Newspaper with Complex Layout

M Chen,XQ Ding,J Liang
DOI: https://doi.org/10.1109/icip.2000.899500
2001-01-01
Abstract:Layout Analysis, understanding and representation are important problems when transform ing paper documents to electronic versions. A bottom-up algorithm of layout ana lysis based on nearest neighbor connect-strength and line confidence is proposed for Chinese newspapers with complex layouts. We also propose a rule-based grow the algorithm for layout understanding. The implementation of layout representation is also discussed. These algorithms with a Chinese character recognition engine were used to finish a complete system to automatically do electronic publis hing. The algorithms were proven be efficient and practical by experiment result s and a practical operating system.
What problem does this paper attempt to address?