Segmentation and Analysis of Double-Sided Handwritten Archival Documents

Ruini Cao,Chew Lim Tan,Qian Wang,Peiyi Shen
2000-01-01
Abstract:Historical handwritten documents are preserved in good condition in many national archives or libraries. One problem that many archivists are facing is the sipping of ink through the pages of certain double-sided handwritten documents after long periods of storage. This paper addresses this problem and develops a novel algorithm to extract clear textual images from interfering and overlapping areas. With the critical observation that the edges of the sipping strokes from the reverse side are not as sharp as those on the front side, we adopt the edge detection approach to suppress unwanted background patterns. Firstly, an improved Canny edge detector with edge orientation constraint is proposed. These improvements could link more weak foreground edges without introducing noises. Secondly, a new edge expansion model is presented for recovering broken edges of the words or characters on the front side. Finally, the outline of the whole document analysis system is illustrated. The segmentation results of real images are shown and evaluated.
What problem does this paper attempt to address?