Abstract:Cognitive disorders affect various cognitive functions that can have a substantial impact on individual's daily life. Alzheimer's disease (AD) is one of such well-known cognitive disorders. Early detection and treatment of cognitive diseases using artificial intelligence can help contain them. However, the complex spatial relationships and long-range dependencies found in medical imaging data present challenges in achieving the objective. Moreover, for a few years, the application of transformers in imaging has emerged as a promising area of research. A reason can be transformer's impressive capabilities of tackling spatial relationships and long-range dependency challenges in two ways, i.e., (1) using their self-attention mechanism to generate comprehensive features, and (2) capture complex patterns by incorporating global context and long-range dependencies. In this work, a Bi-Vision Transformer (BiViT) architecture is proposed for classifying different stages of AD, and multiple types of cognitive disorders from 2-dimensional MRI imaging data. More specifically, the transformer is composed of two novel modules, namely Mutual Latent Fusion (MLF) and Parallel Coupled Encoding Strategy (PCES), for effective feature learning. Two different datasets have been used to evaluate the performance of proposed BiViT-based architecture. The first dataset contain several classes such as mild or moderate demented stages of the AD. The other dataset is composed of samples from patients with AD and different cognitive disorders such as mild, early, or moderate impairments. For comprehensive comparison, a multiple transfer learning algorithm and a deep autoencoder have been each trained on both datasets. The results show that the proposed BiViT-based model achieves an accuracy of 96.38% on the AD dataset. However, when applied to cognitive disease data, the accuracy slightly decreases below 96% which can be resulted due to smaller amount of data and imbalance in data distribution. Nevertheless, given the results, it can be hypothesized that the proposed algorithm can perform better if the imbalanced distribution and limited availability problems in data can be addressed.

Introducing Vision Transformer for Alzheimer's Disease classification task with 3D input

Vision Mamba: Cutting-Edge Classification of Alzheimer's Disease with 3D MRI Scans

Efficiently Training Vision Transformers on Structural MRI Scans for Alzheimer's Disease Detection

Enhancing MRI-Based Classification of Alzheimer's Disease with Explainable 3D Hybrid Compact Convolutional Transformers

Vision Transformers and Bi-LSTM for Alzheimer's Disease Diagnosis from 3D MRI

Multiple Inputs and Mixed Data for Alzheimer's Disease Classification Based on 3D Vision Transformer

Joint transformer architecture in brain 3D MRI classification: its application in Alzheimer's disease classification

Ensemble Vision Transformer for Dementia Diagnosis

Explainable early detection of Alzheimer's disease using ROIs and an ensemble of 138 3D vision transformers

Ensemble of vision transformer architectures for efficient Alzheimer's Disease classification

3D Transformer based on deformable patch location for differential diagnosis between Alzheimer's disease and Frontotemporal dementia

Attention-based Efficient Classification for 3D MRI Image of Alzheimer's Disease

Visualizing Convolutional Networks for MRI-based Diagnosis of Alzheimer's Disease

Video and Synthetic MRI Pre-training of 3D Vision Architectures for Neuroimage Analysis

3D transfer learning network for classification of Alzheimer’s disease with MRI

Alzheimer's Disease Diagnostics by Adaptation of 3D Convolutional Network

Classification and Visualization of Alzheimer’s Disease using Volumetric Convolutional Neural Network and Transfer Learning

FGI-CogViT: Fuzzy Granule-based Interpretable Cognitive Vision Transformer for Early Detection of Alzheimer's Disease using MRI Scan Images

Computer-aided diagnosis of Alzheimer's disease and neurocognitive disorders with multimodal Bi-Vision Transformer (BiViT)

Diagnosis of Alzheimer's disease via optimized lightweight convolution-attention and structural MRI

ViTAD: Leveraging Modified Vision Transformer for Alzheimer's Disease Multi-Stage Classification from Brain MRI Scans