Abstract:Deep convolutional neuralnetworks have achieved fairly high accuracy for single online handwritten Chinese character recognition (SOLHCCR). However, in real application scenarios, users always write multiple characters to form a complete sentence, and previous contextual information holds significant potential for improving the accuracy, robustness and efficiency of recognition. In this work, we first propose a simple and straightforward model named the vanilla compositional network (VCN) by coupling convolutional neural network with a sequence modeling architecture (i.e., a recurrent neural network or Transformer), which exploits the handwritten character’s previous contextual information. Although VCN performs much better than the previous state-of-the-art SOLHCCR models, it is a two-stage architecture in nature. It suffers from high fragility when confronting with poorly written characters such as sloppy writing, and missing or broken strokes, due to relying heavily on contextual information. To improve the robustness of the OLHCCR model, we further propose a novel deep spatial & contextual information fusion network (DSCIFN). It utilizes an autoregresssive framework pre-trained on a large-scale sentence corpora as the backbone component, and highly integrates the spatial features of handwritten characters and their previous contextual information in a multi-layer fusion module. To verify the effectiveness of models, we reorganize a new form of online Chinese handwritten character with its previous context dataset, named OHCCC. Extensive experimental results demonstrate that DSCIFN achieves state-of-the-art performance and has increased strong robustness compared to VCN and previous SOLHCCR models. The in-depth empirical analysis and case study indicate that DSCIFN can significantly improve the efficiency of handwriting input because it does not need complete strokes to recognize a handwritten Chinese character precisely.

OCR with a Convolutional Neural Networks Integration Model in Machine Vision

A Convolutional Recurrent Neural-Network-Based Machine Learning for Scene Text Recognition Application

Processing and Recognition of Characters Image in Complex Environment

OCR using CRNN: A Deep Learning Approach for Text Recognition

Deep Learning-Based Multifunctional End-to-End Model for Optical Character Classification and Denoising

Efficient, Lexicon-Free OCR using Deep Learning

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

A Deep Learning Technology based OCR Framework for Recognition Handwritten Expression and Text

Gated Recurrent Convolution Neural Network for Ocr

Multilingual Interoperation in Cross-Country Industry 4.0 System for One Belt and One Road

Implementation of OCR using Convolutional Neural Network (CNN): A Survey

Optical Character Detection and Recognition for Image-Based in Natural Scene

OCR is All you need: Importing Multi-Modality into Image-based Defect Detection System

High-Performance Ocr On Packing Boxes In Industry Based On Deep Learning

Embossed Characters Enhancement Based on Convolutional Neural Network

UPOCR: Towards Unified Pixel-Level OCR Interface

Consecutive Convolutional Activations for Scene Character Recognition

TrOCR: Transformer-Based Optical Character Recognition with Pre-trained Models

Medical Image Character Recognition Based on Multi-scale Neural Convolutional Network

Fast and Robust Online Handwritten Chinese Character Recognition with Deep Spatial and Contextual Information Fusion Network

SuperOCR: A Conversion from Optical Character Recognition to Image Captioning