Abstract:Handwriting is controlled by neurons in the brain’s nervous system, reflecting an individual’s personality and psychology. This unique characteristic can be used for various applications, including user authentication, assessment of neurodegenerative disorders, and classification of handedness, gender, and age groups. Traditional authentication systems require memorization, information leakage, and fingerprints, making them vulnerable to security breaches. The majority of researchers have studied the limitations of image quality, camera frames, and light effects on text and image-dependent performance. Therefore, this paper mainly focused on real-time, text-independent handwriting fine-motor data and proposed an efficient authentication system with low cost using efficient feature extraction and optimal feature selection approaches. This research utilizes two benchmark databases, including the handwriting data of 48 (24+24) participants collected via a sensor-based pen tablet. Each participant wrote the 10 words five times repeatedly, making it a total of 2400 samples. The handwriting classification of the different individuals is in 3 phases: feature extraction, feature selection, and classification. A total of 91 features (statistical, kinematic, spatial, and composite) were extracted from more accurate, real-time numerical handwriting data. The efficient and optimal features have been selected using four feature selection approaches, namely, Pearson’s r correlation, ANOVA-F, Mutual Information Gain, and PCA, among which the ANOVA-F test and PCA perform well for handwriting-extracted data. Then, 14 machine learning (ML) models and 7 deep learning (DL) models were applied to handle the problem of individual classification, with both no- and full-feature-selection scenarios considered. The experimental analysis has been conducted with different angles and perspectives, such as K-Fold cross-validation, testing system efficiency considering 5/10/15/24/48 individuals, and in the case of individual tasks. It shows that ML-based algorithms, namely, CATBOOST (99.07%) with ANOVA-F and DL-based models, namely, BiLSTM (98.31%) with PCA-selected features, provide the highest accuracy with dataset 2, among others that advocate the practicality and reliability of choosing this system for user identification.

Self-supervised Data Bootstrapping for Deep Optical Character Recognition of Identity Documents

Large-Scale Printed Chinese Character Recognition for ID Cards Using Deep Learning and Few Samples Transfer Learning

DocFace: Matching ID Document Photos to Selfies

DocFace+: ID Document to Selfie Matching

Weakly Supervised Training for Hologram Verification in Identity Documents

Offline Handwriting Signature Verification: A Transfer Learning and Feature Selection Approach

Efficient, Lexicon-Free OCR using Deep Learning

3D Rendering Framework for Data Augmentation in Optical Character Recognition

An Intelligent Hybrid Model for Identity Document Classification

IDTrust: Deep Identity Document Quality Detection with Bandpass Filtering

Face Detection in Camera Captured Images of Identity Documents under Challenging Conditions

DocXPand-25k: a large and diverse benchmark dataset for identity documents analysis

Making Old Kurdish Publications Processable by Augmenting Available Optical Character Recognition Engines

Self-supervised Character-to-Character Distillation for Text Recognition

Dynamics of Digital Pen-Tablet: Handwriting Analysis for Person Identification Using Machine and Deep Learning Techniques

Identifying People's Faces in Smart Banking Systems Using Artificial Neural Networks

Indonesian ID Card Extractor Using Optical Character Recognition and Natural Language Post-Processing

TextCaps : Handwritten Character Recognition with Very Small Datasets

Advanced Digital Image Processing Technique based Optical Character Recognition of Scanned Document

An Intelligent Knowledge Extraction Framework For Recognizing Identification Information From Real-World Id Card Images

Deep Self-Taught Learning for Handwritten Character Recognition