Capsule Dynamic Network-Based Object Detection Algorithm

Hong Zhang,Qiang Zhi,Bongyuan Xue,Yiwen Fu,Qing Zhang,Lingfei Han,Mengyan Guo
DOI: https://doi.org/10.1109/SSCI44817.2019.9002884
2019-01-01
Abstract:In traditional convolution neural networks, the convolution pooling unit lacks the ability to process geometric transformations, which results in the loss of spatial hierarchical information of features. However, this information is very helpful to the expression of content. When detecting objects with spatial location information, the extracted object features are incomplete, which reduces the performance of the object detection algorithm. For solving this problem, we propose Capsule Dynamic Single Shot multibox Detector (CD-SSD) object detection algorithm in this paper. This method replaces the partial convolution pooling layer of the original Single Shot MultiBox Detector (SSD) framework with a dynamical routable capsule layer to learn the spatial position feature offset, so that the convolution units can learn the location information between different features by dynamic routing, thereby to improve the performance of the detection algorithm. The algorithm is validated on the VOC2007 dataset. The experimental results and analysis prove that the improved algorithm achieves 79.1% mean average precision (mAP) on the VOC2007 test set, which is much higher than 74.5 % of the original SSD algorithm, and the detection speed is 55 frames per second (FPs), which is slightly lower than 59 FPs of the original SSD.
What problem does this paper attempt to address?