Design of a two-stage ASCII recognizer for the case-sensitive inputs in handwritten and gesticulation mode of the text-entry interface

Anish Monsley Kirupakaran,Kuldeep Singh Yadav,Naragoni Saidulu,Saharul Alom Barlaskar,Rabul Hussain Laskar
DOI: https://doi.org/10.1007/s11042-024-18261-5
IF: 2.577
2024-02-17
Multimedia Tools and Applications
Abstract:Unlike real-world objects, which remain the same irrespective of the changes in size on a fixed/varying scale, few alphanumeric characters become identical because of their case-ambiguous nature. Recognizing alphabets becomes further complex when different characters are gesticulated/handwritten with the same pattern or become similar due to the gesture/handwriting style. The generalization ability of 2D deep convolutional neural networks (DCNN) results in the misclassification of these characters. To overcome this, we propose a two-stage recognition model that comprises DCNN, advisor unit (AU), and conflict-resolving post-decision support modules (PDSM-I and PDSM-II). PDSM-I is an artificial neural network trained on the proposed handcrafted features engineered to capture size-variant and size-invariant characteristics. PDSM-II is a two-stream 1D convolutional neural network (Ca1DNet) designed to capture the learned descriptors from the convolutions of 1D kernels. Using four state-of-the-art DCNN models, experimentation is carried out on two bases: (i) the unmerged; (ii) the merged data of EMNIST, NIST handwritten, and NITS hand gesture databases. These experiments help to show the fall/gap in recognition rate because of the case-sensitive characters in unmerged data conditions. PDSM-I with DCNN results in an average accuracy of EMNIST – 87.89%; NIST – 91.63%; NITS – 96.60% over the existing DCNN models' performance in unmerged conditions. Similarly, when PDSM-II gets used with DCNN, an average accuracy of 87.96% (EMNIST), 91.79% (NIST), and 96.91% (NITS) is achieved. These results suggest that case-sensitive characters from handwritten/gestures can be recognized, provided the peripheral information of these characters is preserved.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?