RDTNet: A residual deformable attention based transformer network for breast cancer classification
Babita,Deepak Ranjan Nayak
DOI: https://doi.org/10.1016/j.eswa.2024.123569
IF: 8.5
2024-03-08
Expert Systems with Applications
Abstract:Accurate and timely detection of breast cancer plays a pivotal role in reducing the mortality rate. Deep learning models, especially CNNs, have recently shown astounding performance in detecting breast cancer from histopathological images. However, their drawbacks lie in the limited capacity to capture subtle lesion information. Vision transformers (ViTs) have emerged as a promising technique due to their ability to capture global feature dependencies through self-attention. Nevertheless, applying ViTs in medical imaging is challenging due to the unavailability of large training data and their limited ability to capture local contextual information. To address these challenges, we propose a residual deformable attention-based transformer network (RDTNet) for breast cancer classification, which can capture local and global contextual details from the histopathological images. In RDTNet, we introduce a residual deformable transformer layer called RDTL after a backbone network. The RDTL comprises multi-head deformable self-attention mechanisms (MDSA) and residual connections, enabling fine-grained and category-specific lesion feature extraction. The experimental results on a benchmark dataset indicate the superiority of the RDTNet over state-of-the-art methods. Notably, our model achieves a higher image-level accuracy of 99.00%, 98.87%, 98.84%, and 97.80% and a patient-level accuracy of 96.41%, 94.82%, 93.91%, and 91.25% for 40 × , 100 × , 200 × , and 400 × magnifications, respectively. The improved performance of RDTNet can be attributed to the integration of RDTL with a backbone network.
computer science, artificial intelligence,engineering, electrical & electronic,operations research & management science