A deep image classification model based on prior feature knowledge embedding and application in medical diagnosis

Chen Xu,Jiangxing Wu,Fan Zhang,Jonathan Freer,Zhongqun Zhang,Yihua Cheng
DOI: https://doi.org/10.1038/s41598-024-63818-x
IF: 4.6
2024-06-11
Scientific Reports
Abstract:Aiming at the problem of image classification with insignificant morphological structural features, strong target correlation, and low signal-to-noise ratio, combined with prior feature knowledge embedding, a deep learning method based on ResNet and Radial Basis Probabilistic Neural Network (RBPNN) is proposed model. Taking ResNet50 as a visual modeling network, it uses feature pyramid and self-attention mechanism to extract appearance and semantic features of images at multiple scales, and associate and enhance local and global features. Taking into account the diversity of category features, channel cosine similarity attention and dynamic C-means clustering algorithms are used to select representative sample features in different category of sample subsets to implicitly express prior category feature knowledge, and use them as the kernel centers of radial basis probability neurons (RBPN) to realize the embedding of diverse prior feature knowledge. In the RBPNN pattern aggregation layer, the outputs of RBPN are selectively summed according to the category of the kernel center, that is, the subcategory features are combined into category features, and finally the image classification is implemented based on Softmax. The functional module of the proposed method is designed specifically for image characteristics, which can highlight the significance of local and structural features of the image, form a non-convex decision-making area, and reduce the requirements for the completeness of the sample set. Applying the proposed method to medical image classification, experiments were conducted based on the brain tumor MRI image classification public dataset and the actual cardiac ultrasound image dataset, and the accuracy rate reached 85.82% and 83.92% respectively. Compared with the three mainstream image classification models, the performance indicators of this method have been significantly improved.
multidisciplinary sciences
What problem does this paper attempt to address?
The paper attempts to address challenges in image classification such as insignificant morphological structural features, strong target correlation, and low signal-to-noise ratio. Specifically, these issues are particularly prominent in medical image classification because medical images often have blurred structural boundaries and intertwined multiple organs and lesions. This leads to insufficient attention to local key features by existing methods, ignoring potentially important features, thereby reducing the generalization ability and robustness of classification models. To tackle these challenges, the paper proposes a deep learning method based on prior feature knowledge embedding, combining ResNet and Radial Basis Probabilistic Neural Network (RBPNN). This method addresses the aforementioned issues through the following points: 1. **Feature Extraction and Enhancement**: - Using ResNet50 as the visual modeling network, it utilizes feature pyramids and self-attention mechanisms to extract appearance and semantic features of images from multiple scales, associating and enhancing local and global features. - Introducing channel cosine similarity attention and dynamic C-means clustering algorithm, it selects representative sample features from different category subsets, implicitly expressing prior category feature knowledge, and uses them as the kernel centers of Radial Basis Probabilistic Neurons (RBPN) to achieve diversified prior feature knowledge embedding. 2. **Pattern Aggregation and Classification**: - In the pattern aggregation layer of RBPNN, it selectively sums the outputs of RBPN based on the kernel centers' categories, merging subcategory features into category features, and finally achieves image classification based on Softmax. - In this way, the method can form non-convex decision regions, reduce the requirement for the completeness of the sample set, and improve the accuracy of image classification. 3. **Small Sample Set Modeling**: - This method is particularly suitable for modeling small sample sets, maintaining good generalization ability in cases of limited data. The paper validates the effectiveness of this method through experiments on public datasets of brain tumor MRI image classification and actual cardiac ultrasound image datasets, achieving accuracies of 85.82% and 83.92%, respectively, which show significant improvement compared to mainstream image classification model performance metrics.