CFS: Character Feature Summarization Model for Real-time End-to-end Text Spotting

Chuanyang Gong,Heifei Mei,Heqian Qiu,Xinpeng Hao,Jian Jiao,Shiyuan Tang,Hongliang Li
DOI: https://doi.org/10.1109/vcip59821.2023.10402719
2023-01-01
Abstract:Most real-time end-to-end text spotting methods employ sequence models as their recognition heads. However, these models generate characters one by one, which is inefficient when there are many characters. To solve this problem, we propose a Character Feature Summarization (CFS) Model, which can predict fixed-length characters in parallel, regardless of length. Specifically, we propose a Character Feature Summarization Module (CFSM) consisting of a Global Feature Capture and a Historical Feature Summarizer to extract and summarize global character features, enabling getting characters by simple linear prediction. We use Multi-stage Testing, cascading multiple CFSMs to obtain multi-stage summarized global character features to obtain several predictions for better convergence. The Result Selector is used to select the most likely result. Experiments on the Total-Text dataset show that CFS achieves a 3.53% improvement on the "Full" while being 3.6 times faster than ABCNet v2’s head.
What problem does this paper attempt to address?