CAM: A fine-grained vehicle model recognition method based on visual attention model

Ye Yu,Longdao Xu,Wei Jia,Wenjia Zhu,Yunxiang Fu,Qiang Lu
DOI: https://doi.org/10.1016/j.imavis.2020.104027
IF: 3.86
2020-12-01
Image and Vision Computing
Abstract:<p>Vehicle model recognition (VMR) is a typical fine-grained classification task in computer vision. To improve the representation power of classical CNN networks for this special task, we focus on enhancing the subtle difference of features and their spatial encoding based on the attention mechanism, and then propose a novel architectural unit, which we term the "convolutional attention model" (CAM). It adopts a two-stage attention mechanism for VMR, which includes the global feature map attention (GFMA) algorithm, applied at the lower part of the main network flow to enhance the subtle feature difference from the beginning, and the feature spatial relationship attention (FSRA) algorithm, applied at the higher part to enhance the spatial relationship of features. The experiments are conducted on the benchmark CompCars web-nature and Stanford Car datasets and demonstrate the effectiveness of CAM when integrated with some classical CNN architectures. CAM can improve the top-1 recognition accuracy by an average of 1.15% and top-5 by an average of 0.78%.</p>
computer science, artificial intelligence, theory & methods,engineering, electrical & electronic, software engineering,optics
What problem does this paper attempt to address?