Swin-ResNet: Research and Implementation of a Tooth-Marked Tongue Classification Method Combining ResNet-50 and Swin Transformer

Xinshen Zhao,Bo Zhao,Qiyuan Zhang,Rong Li,Xiangyong Kong,Ping Wang
DOI: https://doi.org/10.1145/3644116.3644271
2023-01-01
Abstract:Tongue diagnosis is an integral part of diagnosis by observation in Chinese medicine, and the tooth mark feature in the tongue image is an important objective indicator for the diagnosis of spleen deficiency. In order to solve the problem that the features of tooth marks are difficult to be recognized by naked eyes and reduce the error of subjective judgment, this paper designed a deep learning-based classification model for detecting and classifying tooth-marked tongues. By embedding the Swin-T module within the Resnet residual structure, we proposed and implemented a neural network integrating Resnet and Swin-Transformer (Swin-Resnet), which first down-sampled the input image by 7×7 convolution and max pooling, and then subsequently went through a stack of the improved Swin-Resnet modules for feature extraction, and finally a fully connected layer was utilized to complete high-precision classification of tooth-marked tongues. Meanwhile, we improved the proposed Swin-Resnet in terms of model running speed by transforming the convolutional layer of the original Resnet residual block into a convolutional block with smaller parameters and fewer network parameters while ensuring the network performance. Comparison experiments with other algorithms were conducted with sample datasets and the results of the experiments were evaluated, and Swin-Resnet achieved an average accuracy of 0.9959 for the three classifications, and the accuracies of 0.9832, 0.9792, and 0.9890 for each type of tooth-marked tongue classifications of Lightly Tooth, No Tooth, and Severe Tooth, respectively. The experiments showed that the classification method can identify dentition features with higher accuracy, which was important for improving the accuracy of objectivized analysis of tongue diagnosis and practical application in healthcare.
What problem does this paper attempt to address?