Vision transformer for contactless fingerprint classification
Kaplesh, Pooja,Gupta, Aastha
DOI: https://doi.org/10.1007/s11042-024-20396-4
IF: 2.577
2024-11-08
Multimedia Tools and Applications
Abstract:Contactless fingerprint recognition has gained attention due to its data security and hygiene as compared to the contact-based counterpart. In the previous years, deep learning models, mainly Convolutional Neural Networks (CNNs), have been extensively applied in this field and have given promising results. Recently, their successor, Vision Transformers (ViTs) have outperformed CNNs in various computer vision tasks due to their reliance on self-attention mechanism that focusses on capturing global and context-aware relationships between the image patches without inherent spatial bias. Due to the underlying advantageous and innovative feature extraction process, we propose to adapt ViT model for contactless fingerprint classification. To the best of our knowledge, this study would be first of its kind that applies ViT for contactless fingerprint classification. The performance of the model when evaluated on publicly available datasets, namely ISPFDv1, ISPFDv2, and UNFIT, results in average accuracies of 96%, 92%, and 91%, respectively. The comparative analysis with state-of-the-art CNN models using equal error rate (EER) metric shows that the proposed model outperforms them in classifying contactless fingerprints with a significant margin. Additionally, the Local Interpretable Model-agnostic Explanations (LIME) technique is integrated with the proposed model to enhance interpretability and explainability of the predictions.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering