A Cross-Font Image Retrieval Network for Recognizing Undeciphered Oracle Bone Inscriptions

Zhicong Wu,Qifeng Su,Ke Gu,Xiaodong Shi
2024-09-10
Abstract:Oracle Bone Inscription (OBI) is the earliest mature writing system known in China to date, which represents a crucial stage in the development of hieroglyphs. Nevertheless, the substantial quantity of undeciphered OBI characters continues to pose a persistent challenge for scholars, while conventional methods of ancient script research are both time-consuming and labor-intensive. In this paper, we propose a cross-font image retrieval network (CFIRN) to decipher OBI characters by establishing associations between OBI characters and other script forms, simulating the interpretive behavior of paleography scholars. Concretely, our network employs a siamese framework to extract deep features from character images of various fonts, fully exploring structure clues with different resolution by designed multiscale feature integration (MFI) module and multiscale refinement classifier (MRC). Extensive experiments on three challenging cross-font image retrieval datasets demonstrate that, given undeciphered OBI characters, our CFIRN can effectively achieve accurate matches with characters from other gallery fonts.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the identification of uninterpreted characters in Oracle Bone Inscriptions (OBI). Specifically, Oracle Bone Inscriptions are the earliest known mature writing system in China and have important historical and cultural values. However, due to their complexity and diversity, a large number of oracle bone characters remain uninterpreted to this day. Traditional methods such as manual interpretation are not only time - consuming and require a large amount of human resources, but also have low efficiency. To solve this problem, the author proposes a Cross - Font Image Retrieval Network (CFIRN), which establishes the association between oracle bone characters and other font forms by simulating the interpretation behavior of ancient philologists. CFIRN aims to extract features from character images of different fonts through deep - learning technology and use these features to accurately match uninterpreted oracle bone characters with characters in known fonts, thereby achieving automated character interpretation. ### Main contributions 1. **First proposal**: As far as the author knows, this is the first exploration of the application of a unified cross - font image retrieval network in the task of oracle bone character interpretation. 2. **Innovative modules**: The Multiscale Feature Integration (MFI) module and the Multiscale Refinement Classifier (MRC) are introduced to enhance the extraction of different - scale and subtle semantic information. 3. **Superior performance**: Compared with existing image retrieval models, CFIRN performs excellently on three challenging ancient font image retrieval data sets. ### Method overview The overall architecture of CFIRN is based on the Siamese Network. Multiscale features are extracted from two branches (the oracle bone branch and the gallery font branch) through the ConvNeXt encoder. Then, these features are fused through the MFI module and further optimized by the MRC. Finally, training is carried out through a combination of cross - entropy loss, KL - divergence loss and triplet loss to minimize the total loss function: \[ L=\text{CEL}+\text{KL}+\alpha \text{TL} \] where: - \(\text{CEL}\) is the cross - entropy loss, which is used to optimize network parameters: \[ \text{CEL}_i = -\sum_{n = 1}^{N}p(x_i,n)\log q(x_i,n) \] - \(\text{TL}\) is the triplet loss, which is used to reduce the distance between feature vectors of the same category: \[ \text{TL}_i=\max(\|V_i - V_p\|_2-\|V_i - V_n\|_2+M,0) \] - \(\alpha\) is a weight coefficient, which is set to 5. Through this method, CFIRN can establish effective associations between characters in different historical periods, thereby achieving efficient interpretation of oracle bone characters.