Abstract:With the development of deep learning, convolutional neural networks (CNNs) and Transformer-based methods have become key techniques for medical image classification tasks. However, many current neural network models have problems such as high complexity, a large number of parameters, and large model sizes; such models obtain higher classification accuracy at the expense of lightweight networks. Moreover, such larger-scale models pose a great challenge for practical clinical applications. Meanwhile, Transformer and multi-layer perceptron (MLP) methods have some shortcomings in terms of local modeling capability and high model complexity, and need to be used on larger datasets to show good performance. This makes it difficult to utilize these networks in clinical medicine. Based on this, we propose a lightweight and efficient pure CNN network for medical image classification (Eff-PCNet). On the one hand, we propose a multi-branch multi-scale CNN (M2C) module, which divides the feature map into four parallel branches along the channel dimensions by a certain scale factor and carries out a deep convolution operation using different scale convolution kernels, and this multi-branch multi-scale operation effectively replaces the large kernel convolution. This multi-branch multi-scale operation effectively replaces the large kernel convolution. It reduces the computational cost of the module while fusing the feature information between different channels and thus obtains richer feature information. Finally, the four feature maps are then spliced along the channel dimensions to fuse the multi-scale and multi-dimensional feature information. On the other hand, we introduce the structural reparameterization technique and propose the structural reparameterized CNN (Rep-C) module. Specifically, it utilizes multiple linear operators to generate different feature maps during the training process and fuses all the participants into one through parameter fusion to achieve fast inference while providing a more effective solution for feature reuse. A number of experimental results show that our Eff-PCNet performs better than current methods based on CNN, Transformer, and MLP in the classification of three publicly available medical image datasets. Among them, we achieve 87.4% Acc on the HAM10000 dataset, 91.06% Acc on the SkinCancer dataset, and 97.03% Acc on the Chest-Xray dataset. Meanwhile, our approach achieves a better trade-off between the number of parameters; computation; and other performance metrics as well.

Multi-Instance Multi-Scale CNN for Medical Image Classification

Multi-Branch Cnn and Multi-Scale Multi-Dimensional Feature Fusion Mlp for Medical Image Classification

DM-CNN: Dynamic Multi-scale Convolutional Neural Network with uncertainty quantification for medical image classification

Hybrid multiple instance learning network for weakly supervised medical image classification and localization

A Dense R-CNN Multi-Target Instance Segmentation Model and Its Application in Medical Image Processing

Multi-scale Feature Fusion Convolutional Neural Network for Multi-Modal Medical Image Fusion.

An Explainable Multi-Instance Multi-Label Classification Model for Full Slice Brain CT Images

Instance importance-Aware graph convolutional network for 3D medical diagnosis

Medical Image Classification Based on Multi-Scale Non-Negative Sparse Coding

Multi-View Convolutional Neural Networks for Mammographic Image Classification

Deep learning of feature representation with multiple instance learning for medical image analysis

Neural Networks Model Based on an Automated Multi-Scale Method for Mammogram Classification.

Multi-scale region selection network in deep features for full-field mammogram classification

Self-paced Convolutional Neural Network for Computer Aided Detection in Medical Imaging Analysis

An Efficient and Robust 3D Medical Image Classification Approach Based on 3D CNN, Time‐Distributed 2D CNN‐BLSTM Models, and mRMR Feature Selection

Real-Time Medical Image Classification with ML Framework and Dedicated CNN–LSTM Architecture

Multi-Center Brain Imaging Classification Using a Novel 3D CNN Approach.

Eff-PCNet: an Efficient Pure CNN Network for Medical Image Classification

Multi-Scale Mixed Attention Network for CT and MRI Image Fusion

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning