Abstract:Current convolutional neural networks (CNNs) suffer a lack of viewpoint equivariance, namely, they tend to fail or perform poorly when dealing with viewpoints unseen during training procedures. CNN achieves invariance with the help of pooling and improves its performance in image classification tasks. However, the pooling operation does not necessarily improve viewpoint generalization, rather relying on more data to achieve viewpoint equivariance. Capsule network (CapsNet) is proposed to tackle this issue, but it is inefficient and inaccurate when applied to complex datasets. We propose a novel CapsNet architecture called Global Routing CapsNet (GR-CapsNet) to solve this problem. Specifically, colored background in the input image can generate invalid background voting capsules to reduce the performance of CapsNet. Therefore, we first construct a dynamic linear unit (DLU), which avoids the generation of invalid background voting capsules. Then we present two extra learnable units: frequency domain unit (FDU) and spatial unit (SPU). The former is used to capture finer features in the frequency domain and aims to improve classification performance on complex datasets. The latter is applied to construct the spatial relationship between the voting capsules and component capsules and aims to enhance robustness to affine transformation. Finally, we propose a global routing mechanism to simplify the routing process for CapsNet, which obtains more feature information to improve the performance of CapsNet. Extensive experiments on nine datasets show that our method obtains better robustness and generalization and achieves SOTA performance compared to other related methods. And it has fewer the number of parameters and GPU memory consumption than these related methods. The source code is available on https://github.com/cwpl/GR-CapsNet .

E-CapsGan: Generative Adversarial Network Using Capsule Network As Feature Encoder

SA-CapsGAN: Using Capsule Networks with Embedded Self-Attention for Generative Adversarial Network

Capsule GAN Using Capsule Network for Generator Architecture

CapsuleGAN: Generative Adversarial Capsule Network

Capsules Encoder and Capsgan for Image Inpainting

A Novel GAN Based on Progressive Growing Transformer with Capsule Embedding.

DeCapsGAN: Generative Adversarial Capsule Network for Image Denoising

Adversarial Capsule Learning for Network Embedding

Comparing Generative Adversarial Network Techniques for Image Creation and Modification

The Analysis Between Traditional Convolution Neural Network and CapsuleNet

Adaptive Capsule Network

Cv-CapsNet: Complex-Valued Capsule Network

Hybrid Gromov-Wasserstein Embedding for Capsule Learning

DE-CapsNet: A Diverse Enhanced Capsule Network with Disperse Dynamic Routing

DeepCaps: Going Deeper With Capsule Networks

RS-CapsNet: an Advanced Capsule Network.

DuCaGAN: Unified Dual Capsule Generative Adversarial Network for Unsupervised Image-to-Image Translation

P-CapsNets: a General Form of Convolutional Neural Networks

Dual-Channel Capsule Generation Adversarial Network for Hyperspectral Image Classification.

Subspace Capsule Network

Global routing between capsules