U-Net Architecture for Ancient Handwritten Chinese Character Detection in Han Dynasty Wooden Slips

Hojun SHIMOYAMA,Soh YOSHIDA,Takao FUJITA,Mitsuji MUNEYASU
DOI: https://doi.org/10.1587/transfun.2023smp0007
2023-01-01
Abstract:Recent character detectors have been modeled using deep neural networks and have achieved high performance in various tasks, such as text detection in natural scenes and character detection in historical documents. However, existing methods cannot achieve high detection accuracy for wooden slips because of their multi-scale character sizes and aspect ratios, high character density, and close character-to-character distance. In this study, we propose a new U-Net-based character detection and localization framework that learns character regions and boundaries between characters. The proposed method enhances the learning performance of character regions by simultaneously learning the vertical and horizontal boundaries between characters. Furthermore, by adding simple and low-cost post-processing using the learned regions of character boundaries, it is possible to more accurately detect the location of a group of characters in a close neighborhood. In this study, we construct a wooden slip dataset. Experiments demonstrated that the proposed method outperformed existing character detection methods, including state-of-the-art character detection methods for historical documents.
computer science, information systems,engineering, electrical & electronic, hardware & architecture
What problem does this paper attempt to address?