SimSANet: a simple sequential attention-aided deep neural network for vehicle make and model recognition

Soumyajit Gayen,Sourajit Maity,Pawan Kumar Singh,Ram Sarkar
DOI: https://doi.org/10.1007/s00521-024-10480-z
2024-11-25
Neural Computing and Applications
Abstract:Vehicle make and model recognition (VMMR) is a pivotal task for developing automatic vehicle recognition systems. In recent decades, this field has attracted significant attention from the computer vision and artificial intelligence communities. Previous research heavily emphasized improving recognition by focusing on implementing different types of attention mechanisms. The attention mechanism has demonstrated its effectiveness in considering features and uncovering distinctive local and global intricacies. However, one significant issue with this approach is that it increases complexity, which leads to a costly and pointless computational burden. To this end, we introduce a deep neural network model, called simple sequential attention network, which concurrently blends a sequential multi-kernel approach to achieve a trade-off between complexity and performance. This method reduces the computational load and adopts a faster approach while efficiently capturing essential information from feature maps of different scales, from local to global. To demonstrate the effectiveness of the proposed method, we conduct experiments on a variety of publicly accessible VMMR datasets, including Stanford Cars, Comprehensive Cars (CompCars), Comprehensive Cars Surveillance Nature (CompCarsSV), and the Vehicle Images dataset. The suggested approach performs better in the vehicle make and model recognition task than the most advanced models. With 94.47% accuracy on Stanford Cars, 98.34% on CompCars, 99.20% on CompCarsSV, and 97.20% on the Vehicle Images dataset, our model achieves state-of-the-art performance. The implementation details with the source code can be found at: https://github.com/JUVCSE/SIMSANET.
computer science, artificial intelligence
What problem does this paper attempt to address?