Quantifying the Effect of Image Similarity on Diabetic Foot Ulcer Classification

Imran Chowdhury Dipto,Bill Cassidy,Connah Kendrick,Neil D. Reeves,Joseph M. Pappachan,Vishnu Chandrabalan,Moi Hoon Yap
DOI: https://doi.org/10.1007/978-3-031-26354-5_1
2023-04-26
Abstract:This research conducts an investigation on the effect of visually similar images within a publicly available diabetic foot ulcer dataset when training deep learning classification networks. The presence of binary-identical duplicate images in datasets used to train deep learning algorithms is a well known issue that can introduce unwanted bias which can degrade network performance. However, the effect of visually similar non-identical images is an under-researched topic, and has so far not been investigated in any diabetic foot ulcer studies. We use an open-source fuzzy algorithm to identify groups of increasingly similar images in the Diabetic Foot Ulcers Challenge 2021 (DFUC2021) training dataset. Based on each similarity threshold, we create new training sets that we use to train a range of deep learning multi-class classifiers. We then evaluate the performance of the best performing model on the DFUC2021 test set. Our findings show that the model trained on the training set with the 80\% similarity threshold images removed achieved the best performance using the InceptionResNetV2 network. This model showed improvements in F1-score, precision, and recall of 0.023, 0.029, and 0.013, respectively. These results indicate that highly similar images can contribute towards the presence of performance degrading bias within the Diabetic Foot Ulcers Challenge 2021 dataset, and that the removal of images that are 80\% similar from the training set can help to boost classification performance.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to study the impact of image similarity on the task of diabetic foot ulcer classification. Specifically, the paper explores the effects of visually similar but not identical images when training deep learning classification networks. The paper points out that the presence of binary identical duplicate images in the dataset can lead to a decline in model performance, which is a known issue. However, the impact of visually similar but not identical images is a less studied topic. To this end, the authors used an open-source fuzzy matching algorithm to identify similar images in the diabetic foot ulcer dataset and created several new training sets based on different similarity thresholds. Various deep learning multi-classifiers were trained using these new training sets and their performance on the test set was evaluated. The research results show that after removing images with a similarity of 80%, the trained model improved by 0.023 in F1 score, 0.029 in precision, and 0.013 in recall. This indicates that highly similar images may lead to a decline in model performance, and removing these images from the training set helps to improve classification performance.