FUSECAPS: Investigating Feature Fusion Based Framework for Capsule Endoscopy Image Classification

Bidisha Chakraborty,Shree Mitra
2024-11-05
Abstract:In order to improve model accuracy, generalization, and class imbalance issues, this work offers a strong methodology for classifying endoscopic images. We suggest a hybrid feature extraction method that combines convolutional neural networks (CNNs), multi-layer perceptrons (MLPs), and radiomics. Rich, multi-scale feature extraction is made possible by this combination, which captures both deep and handmade representations. These features are then used by a classification head to classify diseases, producing a model with higher generalization and accuracy. In this framework we have achieved a validation accuracy of 76.2% in the capsule endoscopy video frame classification task.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?