Enhancing Automatic Gastrointestinal Endoscopy Image Classification via Advanced Model and Adversarial Training Strategy.

Yi Luo,Wenhan Chen,Wei Chen,Fan Sun,Ming Liu
DOI: https://doi.org/10.1145/3625403.3625416
2023-01-01
Abstract:The present study elucidates the innovative application of automatic CMT model in gastrointestinal endoscopic image classification tasks, which is an advanced hybrid network based on Transformer.The training process of the CMT model effectively incorporates an adversarial training strategy. It utilizes network D to balance the distance between hidden features and real images. This strategy significantly reduces the interference of irrelevant features, allowing the model to focus on extracting key image characteristics to enhance classification performance. We have instituted a confusion matrix and plotted the Receiver Operating Characteristic (ROC) curve for the resultant output, allowing us to compute metrics like accuracy and AUC for evaluating the performance of our classification model. In this context, the AUC value represents the area under the ROC curve, with larger values indicative of superior classification performance by the model. In terms of both accuracy and AUC, our method outperforms traditional CNN and Visual Transformer models. However, certain categories still exhibit classification errors due to the inherent similarity of image features. Despite this, our research findings highlight the potential of automated endoscopy exams and the supportive role of the improved classification models in clinical decision-making. Future work will be devoted to expanding datasets and optimizing the model structure to enhance classification performance and its applications in clinical practice.
What problem does this paper attempt to address?