CapsNet meets SIFT: A robust framework for distorted target categorization

Zhongqi Lin,Wanlin Gao,Jingdun Jia,Feng Huang
DOI: https://doi.org/10.1016/j.neucom.2021.08.087
IF: 6
2021-01-01
Neurocomputing
Abstract:•We realize distorted target recognition by presenting a CapsNetSIFT architecture (cf. Fig. 1). It can simultaneously enable accurate positioning of interest regions and comprehensive learning of discriminative features whilst boasting invariance to visual distortions.•We propose a parallel MD-CapsNet in CapsNetSIFT for boosting representability. The customized structure, hyperparameter setting, and dynamic routing agreement have been demonstrated.•We propose a VM-SIFT in CapsNetSIFT to establish correspondence among capsule encoding vectors of standard images and distorted ones. This may represent the first attempt to realize feature matching of capsule vectors.•We conduct quantitative experiments of distorted target categorization on four benchmarks (CUB-200-2011, Stanford Dogs, Stanford Cars, and our hand-crafted rice growth dataset). Evaluation results reveal our advantage over state-of-the-arts.
What problem does this paper attempt to address?