Abstract:Background and objective: In renal disease research, precise glomerular disease diagnosis is crucial for treatment and prognosis. Currently reliant on invasive biopsies, this method bears risks and pathologist-dependent variability, yielding inconsistent results. There is a pressing need for innovative diagnostic tools that enhance traditional methods, streamline processes, and ensure accurate and consistent disease detection. Methods: In this study, we present an innovative Convolutional Neural Networks-Vision Transformer (CVT) model leveraging Transformer technology to refine glomerular disease diagnosis by fusing spectral and spatial data, surpassing traditional diagnostic limitations. Using interval sampling, preprocessing, and wavelength optimization, we also introduced the Gramian Angular Field (GAF) method for a unified representation of spectral and spatial characteristics. Results: We captured hyperspectral images ranging from 385.18 nm to 1009.47 nm and employed various methods to extract sample features. Initial models based solely on spectral features achieved a accuracy of 85.24 %. However, the CVT model significantly outperformed these, achieving an average accuracy of 94 %. This demonstrates the model's superior capability in utilizing sample data and learning joint feature representations. Conclusions: The CVT model not only breaks through the limitations of existing diagnostic techniques but also showcases the vast potential of non-invasive, high-precision diagnostic technology in supporting the classification and prognosis of complex glomerular diseases. This innovative approach could significantly impact future diagnostic strategies in renal disease research. Concise abstract: This study introduces a transformative hyperspectral image classification model leveraging a Transformer to significantly improve glomerular disease diagnosis accuracy by synergizing spectral and spatial data, surpassing conventional methods. Through a rigorous comparative analysis, it was determined that while spectral features alone reached a peak accuracy of 85.24 %, the novel Convolutional Neural Network-Transformer (CVT) model's integration of spatial-spectral features via the Gramian Angular Field (GAF) method markedly enhanced diagnostic precision, achieving an average accuracy of 94 %. This methodological innovation not only overcomes traditional diagnostic limitations but also underscores the potential of non-invasive, high-precision technologies in advancing the classification and prognosis of complex renal diseases, setting a new benchmark in the field.

High-Speed and Accurate Diagnosis of Gastrointestinal Disease: Learning on Endoscopy Images Using Lightweight Transformer with Local Feature Attention.

Automatic disease detection in endoscopy with light weight transformer

Real-Time Multi-Label Upper Gastrointestinal Anatomy Recognition from Gastroscope Videos

SatFormer: Saliency-Guided Abnormality-Aware Transformer for Retinal Disease Classification in Fundus Image

Gastrointestinal Disorder Detection with a Transformer Based Approach

Pathological Insights: Enhanced Vision Transformers for the Early Detection of Colorectal Cancer

Transfer Learning in Endoscopic Imaging: A Machine Vision Approach to GIT Disease Identification

Enhancing image-based diagnosis of gastrointestinal tract diseases through deep learning with EfficientNet and advanced data augmentation techniques

Transformer-Based Disease Identification for Small-Scale Imbalanced Capsule Endoscopy Dataset

Development of a multi-fusion convolutional neural network (MF-CNN) for enhanced gastrointestinal disease diagnosis in endoscopy image analysis

Efficient-gastro: optimized EfficientNet model for the detection of gastrointestinal disorders using transfer learning and wireless capsule endoscopy images

Classification of Endoscopy and Video Capsule Images using CNN-Transformer Model

Accurate classification of glomerular diseases by hyperspectral imaging and transformer

Vision Transformer for Efficient Chest X-ray and Gastrointestinal Image Classification

Celiac disease diagnosis from endoscopic images based on multi-scale adaptive hybrid architecture model

Will Transformers change gastrointestinal endoscopic image analysis? A comparative analysis between CNNs and Transformers, in terms of performance, robustness and generalization

Gastrointestinal Disease Classification in Endoscopic Images Using Attention-Guided Convolutional Neural Networks

GIFCOS-DT: One Stage Detection of Gastrointestinal Tract Lesions From Endoscopic Images With Distance Transform

Accurate multiclassification and segmentation of gastric cancer based on a hybrid cascaded deep learning model with a vision transformer from endoscopic images

AI support for colonoscopy quality control using CNN and transformer architectures

Research and implementation of multi-disease diagnosis on chest X-ray based on vision transformer