Abstract:Early diagnosis of patients' disease is crucial since it helps doctors and patients devise a treatment plan. Therefore, recognizing medical images using Artificial intelligence-based deep learning techniques has recently increased. Capsule Network (CapsNet) has promising methods in visual tasks due to its ability to keep a high relationship of spatial information compared to convolutional neural networks (CNNs). However, CapsNet faces a critical problem with a complex image background that limits its performance. The traditional CapsNet adopts a standalone convolution (SC) as a feature extractor, Softmax function for normalization of coupling coefficient, and dynamic routing procedure to allow active capsules to perform predictions leading to activation of high-level capsules. The SC is not an effective feature extractor, and SoftMax impedes capsules from distributing optimal coupling coefficient during routing. This paper proposes a CapsNet architecture called SqueezeCapsNet that integrates SqueezeNet and CapsNet to achieve effective feature extraction and fewer parameters. A new squash function named parametric squash function (PSF) was proposed to reduce non-informative capsules and promote discriminative capsules. To the best of our knowledge in literature, we are the first to integrate SqueezeNet into CapsNet. We evaluate our framework on two medical image datasets; Brain tumor and Lung & Colon cancer datasets. Additionally, datasets with varied backgrounds; MNIST, fashion-MNIST, CIFAR-10 were used to evaluate the robustness and generalizability of the model. The SqueezeCapsNet produces 94.85%, 99.76%, 99.87 % , 93.49 % , and 82.45 % on Brain tumor, Lung & Colon Cancer, MNIST, fashion-MNIST, and CIFAR-10 datasets, respectively. Experimental results show that the proposed architecture's compression techniques significantly provide fewer parameters while enhancing stability and accuracy across all the evaluation metrics. Our results show that our method improves CapsNet and can be adopted as a computer-aided diagnostic method to support the diagnosis of medical image tasks.

Self-Attention Capsule Networks for Object Classification

Research on a Capsule Network Text Classification Method with a Self-Attention Mechanism

Separable Attention Capsule Network for Signal Classification

Attentive Octave Convolutional Capsule Network for Medical Image Classification

Multi-Lane Capsule Network for Classifying Images With Complex Background

Adaptive Capsule Network

CACNN: Capsule Attention Convolutional Neural Networks for 3D Object Recognition.

RS-CapsNet: an Advanced Capsule Network.

A Context-aware Capsule Network for Multi-label Classification

Multi-branch RA Capsule Network and Its Application in Image Classification

CapsNet comparative performance evaluation for image classification

A lightweight capsule network via channel-space decoupling and self-attention routing

SqueezeCapsNet: enhancing capsule networks with squeezenet for holistic medical and complex images

Capsule Network Performance on Complex Data

TTDCapsNet: Tri Texton-Dense Capsule Network for complex and medical image recognition

SA-CapsGAN: Using Capsule Networks with Embedded Self-Attention for Generative Adversarial Network

Few-shot Fine-Grained Classification with Spatial Attentive Comparison

Feature Amplification Capsule Network for Complex Images.

Modified Capsule Network For Object Classification

Patch-Based Capsule Network for Complex Images

Diverse Capsules Network Combining Multiconvolutional Layers for Remote Sensing Image Scene Classification