MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding.

Zhanghui Kuang,Hongbin Sun,Zhizhong Li,Xiaoyu Yue,Tsui Hin Lin,Jianyong Chen,Huaqiang Wei,Yiqin Zhu,Tong Gao,Wenwei Zhang,Kai Chen,Wayne Zhang,Dahua Lin
DOI: https://doi.org/10.1145/3474085.3478328
2021-01-01
Abstract:We present MMOCR-an open-source toolbox which provides a comprehensive pipeline for text detection and recognition, as well as their downstream tasks such as named entity recognition and key information extraction. MMOCR implements 14 state-of-the-art algorithms, which is significantly more than all the existing open-source OCR projects we are aware of to date. To facilitate future research and industrial applications of text recognition-related problems, we also provide a large number of trained models and detailed benchmarks to give insights into the performance of text detection, recognition and understanding. MMOCR is publicly released at this https URL.
What problem does this paper attempt to address?