Multi-scene ancient Chinese text recognition with deep coupled alignments

Kaili Wang,Yaohua Yi,Ziwei Tang,Jibing Peng
DOI: https://doi.org/10.1016/j.asoc.2021.107475
IF: 8.7
2021-09-01
Applied Soft Computing
Abstract:The task of multi-scene ancient Chinese text recognition (MACR) is challenging due to large-scale categories, high intra-class variance and inter-class similarity and complicated backgrounds. Little effort has been devoted to MACR research due to insufficient datasets and language barrier. Because the sub-dataset generation process of sub-dataset is mutually blind, there are discrepancies in the class category number, deep feature representation and class center distribution after the dataset statistics and character analysis are performed. The general deep learning method that assumes that data are independent and identically distributed is inappropriate. The deep coupled alignments (CA) module based on domain adaptation is presented to alleviate domain and class center shifts. In addition, a cross-domain fusion (CF) module is proposed to mitigate negative transfer in partial domain adaptation by updating the target domain with the full-class and augmenting the source domain with pseudo labeled samples. Extensive experiments of the proposed method are conducted, and the results illustrate the superiority of CA–CF to previous methods in terms of the model size and recognition accuracy.
computer science, artificial intelligence, interdisciplinary applications
What problem does this paper attempt to address?