Attention-shift Based Deep Neural Network for Fine-Grained Visual Categorization

Yi Niu,Yang Jiao,Guangming Shi
DOI: https://doi.org/10.1016/j.patcog.2021.107947
IF: 8
2021-01-01
Pattern Recognition
Abstract:•We re-investigate the pipeline of fine-grained visual categorization (FGVC) techniques from the view of human visual recognition system, and propose a novel Attention-Shift based Deep Neural Network (AS-DNN) for automatic parts locating and semantic correlation learning.•We propose an end-to-end trainable sub-network structure Csft to simulate the attention-shift process. Csft locates the discriminative regions automatically and encodes and decodes the semantic relations among diverse discriminative parts iteratively.•Comprehensive experiments show that AS-DNN achieves state-of-the-art performances in three widely used challenging datasets. Moreover, the visualization of located discriminative parts proves the robustness of AS-DNN in complex backgrounds and postures.
What problem does this paper attempt to address?