A recurrent neural network based deep learning model for text and non-text stroke classification in online handwritten Devanagari document

Rajib Ghosh
DOI: https://doi.org/10.1007/s11042-022-12767-6
IF: 2.577
2022-03-19
Multimedia Tools and Applications
Abstract:It is very common for human beings to include both text and non-text data in a handwritten document. Text portion contains alphabets, digits, and mathematical symbols, whereas non-text portion includes various graphical entities like flow chart, transition diagram, etc. This paper proposes a novel method for online handwritten text and non-text stroke classification with text written in Devanagari script, the most popular script in India, using two different architectures of artificial Recurrent Neural Network (RNN)—long-short term memory (LSTM) and bidirectional long-short term memory (BLSTM). In the present work, the classifier classifies an ink as text or non-text stroke when a sequence of strokes of any online handwritten document of both text and non-text data is presented to it. Various structural and directional features related to online handwriting have been extracted from the basic strokes of text and non-text data. The system has been trained in both LSTM and BLSTM architecture based classification platforms. Experiment has also been performed using Convolutional Neural Network (CNN) to make a comparative performance analysis with RNN classifier based results. The classification performance of the present work has been evaluated using a self-generated dataset and it outperforms the CNN based results as well as the existing studies available in the literature in this regard.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?