SqueezeCapsNet: enhancing capsule networks with squeezenet for holistic medical and complex images
Kwabena Adu,Joojo Walker,Patrick Kwabena Mensah,Mighty Abra Ayidzoe,Michael Opoku,Samuel Boateng
DOI: https://doi.org/10.1007/s11042-023-15089-3
IF: 2.577
2023-05-16
Multimedia Tools and Applications
Abstract:Early diagnosis of patients' disease is crucial since it helps doctors and patients devise a treatment plan. Therefore, recognizing medical images using Artificial intelligence-based deep learning techniques has recently increased. Capsule Network (CapsNet) has promising methods in visual tasks due to its ability to keep a high relationship of spatial information compared to convolutional neural networks (CNNs). However, CapsNet faces a critical problem with a complex image background that limits its performance. The traditional CapsNet adopts a standalone convolution (SC) as a feature extractor, Softmax function for normalization of coupling coefficient, and dynamic routing procedure to allow active capsules to perform predictions leading to activation of high-level capsules. The SC is not an effective feature extractor, and SoftMax impedes capsules from distributing optimal coupling coefficient during routing. This paper proposes a CapsNet architecture called SqueezeCapsNet that integrates SqueezeNet and CapsNet to achieve effective feature extraction and fewer parameters. A new squash function named parametric squash function (PSF) was proposed to reduce non-informative capsules and promote discriminative capsules. To the best of our knowledge in literature, we are the first to integrate SqueezeNet into CapsNet. We evaluate our framework on two medical image datasets; Brain tumor and Lung & Colon cancer datasets. Additionally, datasets with varied backgrounds; MNIST, fashion-MNIST, CIFAR-10 were used to evaluate the robustness and generalizability of the model. The SqueezeCapsNet produces 94.85%, 99.76%, 99.87 % , 93.49 % , and 82.45 % on Brain tumor, Lung & Colon Cancer, MNIST, fashion-MNIST, and CIFAR-10 datasets, respectively. Experimental results show that the proposed architecture's compression techniques significantly provide fewer parameters while enhancing stability and accuracy across all the evaluation metrics. Our results show that our method improves CapsNet and can be adopted as a computer-aided diagnostic method to support the diagnosis of medical image tasks.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering