Abstract:Capsule networks (CapsNets) were introduced to address convolutional neural networks limitations, learning object-centric representations that are more robust, pose-aware, and interpretable. They organize neurons into groups called capsules, where each capsule encodes the instantiation parameters of an object or one of its parts. Moreover, a routing algorithm connects capsules in different layers, thereby capturing hierarchical part-whole relationships in the data. This thesis investigates the intriguing aspects of CapsNets and focuses on three key questions to unlock their full potential. First, we explore the effectiveness of the routing algorithm, particularly in small-sized networks. We propose a novel method that anneals the number of routing iterations during training, enhancing performance in architectures with fewer parameters. Secondly, we investigate methods to extract more effective first-layer capsules, also known as primary capsules. By exploiting pruned backbones, we aim to improve computational efficiency by reducing the number of capsules while achieving high generalization. This approach reduces CapsNets memory requirements and computational effort. Third, we explore part-relationship learning in CapsNets. Through extensive research, we demonstrate that capsules with low entropy can extract more concise and discriminative part-whole relationships compared to traditional capsule networks, even with reasonable network sizes. Lastly, we showcase how CapsNets can be utilized in real-world applications, including autonomous localization of unmanned aerial vehicles, quaternion-based rotations prediction in synthetic datasets, and lung nodule segmentation in biomedical imaging. The findings presented in this thesis contribute to a deeper understanding of CapsNets and highlight their potential to address complex computer vision challenges.

CapsNet based on Encoder and Decoder for Object Detection

Deformable Capsules for Object Detection

Modified Capsule Network For Object Classification

RS-CapsNet: an Advanced Capsule Network.

Adaptive Capsule Network

A Context-aware Capsule Network for Multi-label Classification

Orthogonal Capsule Networks With Positional Information Preservation and Lightweight Feature Learning

Ddrm-Capsnet: Capsule Network Based On Deep Dynamic Routing Mechanism For Complex Data

DR-CapsNet with CAEMRA: Looking deep inside instance for boosting object detection effect

A lightweight capsule network via channel-space decoupling and self-attention routing

FSC-CapsNet: Fractionally-Strided Convolutional Capsule Network for Complex Data.

DE-CapsNet: A Diverse Enhanced Capsule Network with Disperse Dynamic Routing

VideoCapsuleNet: A Simplified Network for Action Detection

IOP-CapsNet with ISEMRA: Fetching part-to-whole topology for improving detection performance of articulated instances

Mamba Capsule Routing Towards Part-Whole Relational Camouflaged Object Detection

Hierarchical Object-Centric Learning with Capsule Networks

DeepCaps: Going Deeper With Capsule Networks

Capsule Networks With Residual Pose Routing

Capsule Network Performance on Complex Data

RGB-D salient object detection via convolutional capsule network based on feature extraction and integration

Spiking CapsNet: A spiking neural network with a biologically plausible routing rule between capsules