TMCR: A Twin Matching Networks for Chinese Scene Text Retrieval

Zhiheng Peng,Ming Shao,Siyu Xia
DOI: https://doi.org/10.1007/978-3-031-18913-5_24
2022-01-01
Abstract:In this paper, we focus on a critical task of retrieving common style in Chinese scene text: given an image of style text, the system returns all the images matching the queried text image. To that, a novel twin Transformer based matching network is proposed, which is featured by the integration of anchor-free detection, text recognition, and similarity matching networks. On the fly, our model retrieves the similarity of text features in the text area and evaluates it through recognition. Our experiments demonstrate that the proposed model outperforms the state-of-the-art in terms of both processing speed and accuracy. Additional experiments show that our model generalizes well on various benchmarks, including a self-constructed Chinese query data set with complex Chinese scenes in the real world.
What problem does this paper attempt to address?