Abstract:Despite Convolutional Neural Networks (CNNs) based approaches have been successful in objects detection, they predominantly focus on positioning discriminative regions while overlooking the internal holistic part-whole associations within objects. This would ultimately lead to the neglect of feature relationships between object and its parts as well as among those parts, both of which are significantly helpful for detecting discriminative parts. In this paper, we propose to "look insider the objects" by digging into part-whole feature correlations and take the attempts to leverage those correlations endowed by the Capsule Network (CapsNet) for robust object detection. Actually, highly correlated capsules across adjacent layers share high familiarity, which will be more likely to be routed together. In light of this, we take such correlations between different capsules of the preceding training samples as an awareness to constrain the subsequent candidate voting scope during the routing procedure, and a Feature Correlation-Steered CapsNet (FCS-CapsNet) with Locally-Constrained Expectation-Maximum (EM) Routing Agreement (LCEMRA) is proposed. Different from conventional EM routing, LCEMRA stipulates that only those relevant low-level capsules (parts) meeting the requirement of quantified intra-object cohesiveness can be clustered to make up high-level capsules (objects). In doing so, part-object associations can be dug by transformation weighting matrixes between capsules layers during such "part backtracking" procedure. LCEMRA enables low-level capsules to selectively gather projections from a non-spatially-fixed set of high-level capsules. Experiments on VOC2007, VOC2012, HKU-IS, DUTS, and COCO show that FCS-CapsNet can achieve promising object detection effects across multiple evaluation metrics, which are on-par with state-of-the-arts.

Deformable Capsules for Object Detection

CapsNet based on Encoder and Decoder for Object Detection

Feature Correlation-Steered Capsule Network for Object Detection.

Capsule Dynamic Network-Based Object Detection Algorithm

Capsules for Biomedical Image Segmentation

CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing

DE-CapsNet: A Diverse Enhanced Capsule Network with Disperse Dynamic Routing

DeepCaps: Going Deeper With Capsule Networks

Scale-invariant Convolutional Capsule Network

Orthogonal Capsule Networks With Positional Information Preservation and Lightweight Feature Learning

Capsule Networks With Residual Pose Routing

DR-CapsNet with CAEMRA: Looking deep inside instance for boosting object detection effect

Capsule Network Performance on Complex Data

3D Capsule Networks for Object Classification with Weight Pruning.

Patch-Based Capsule Network for Complex Images

Hybrid Gromov-Wasserstein Embedding for Capsule Learning

A Novel Capsule Network Based on Multi-Order Descartes Extension Transformation

CapsNets Algorithm

Training Deep Capsule Networks with Residual Connections

Deep multi-prototype capsule networks

Enhancing Classification Efficiency in Capsule Networks Through Windowed Routing: Tackling Gradient Vanishing, Dynamic Routing, and Computational Complexity Challenges