Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Segment Anything Model

Yongcheng Li,Lingcong Cai,Ying Lu,Yupeng Zhang,Jingyan Jiang,Genan Dai,Bowen Zhang,Jingzhou Cao,Xiangzhong Zhang,Xiaomao Fan
2024-08-13
Abstract:Accurate classification of blood cells plays a vital role in hematological analysis as it aids physicians in diagnosing various medical conditions. In this study, we present a novel approach for classifying blood cell images known as BC-SAM. BC-SAM leverages the large-scale foundation model of Segment Anything Model (SAM) and incorporates a fine-tuning technique using LoRA, allowing it to extract general image embeddings from blood cell images. To enhance the applicability of BC-SAM across different blood cell image datasets, we introduce an unsupervised cross-domain autoencoder that focuses on learning intrinsic features while suppressing artifacts in the images. To assess the performance of BC-SAM, we employ four widely used machine learning classifiers (Random Forest, Support Vector Machine, Artificial Neural Network, and XGBoost) to construct blood cell classification models and compare them against existing state-of-the-art methods. Experimental results conducted on two publicly available blood cell datasets (Matek-19 and Acevedo-20) demonstrate that our proposed BC-SAM achieves a new state-of-the-art result, surpassing the baseline methods with a significant improvement. The source code of this paper is available at <a class="link-external link-https" href="https://github.com/AnoK3111/BC-SAM" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address the problem of blood cell image classification, particularly the cross-domain classification issue between different datasets. Specifically, the research team proposed a new method called BC-SAM (Blood Cell Segment Anything Model), which leverages the large-scale foundational model Segment Anything Model (SAM) and fine-tunes it using LoRA technology to extract general embedding features of blood cell images. To enhance the model's applicability across different datasets, they introduced an unsupervised cross-domain autoencoder that focuses on learning the intrinsic features of images while suppressing artifacts in the images. Experimental results show that BC-SAM achieves significantly better cross-domain classification performance than existing methods on two public blood cell datasets (Matek-19 and Acevedo-20). Additionally, the effectiveness and superiority of BC-SAM were further validated by models constructed using four commonly used machine learning classifiers (Random Forest, Support Vector Machine, Artificial Neural Network, and XGBoost).