SCKansformer: Fine-Grained Classification of Bone Marrow Cells via Kansformer Backbone and Hierarchical Attention Mechanisms

Yifei Chen,Zhu Zhu,Shenghao Zhu,Linwei Qiu,Binfeng Zou,Fan Jia,Yunpeng Zhu,Chenyan Zhang,Zhaojie Fang,Feiwei Qin,Jin Fan,Changmiao Wang,Yu Gao,Gang Yu
2024-10-11
Abstract:The incidence and mortality rates of malignant tumors, such as acute leukemia, have risen significantly. Clinically, hospitals rely on cytological examination of peripheral blood and bone marrow smears to diagnose malignant tumors, with accurate blood cell counting being crucial. Existing automated methods face challenges such as low feature expression capability, poor interpretability, and redundant feature extraction when processing high-dimensional microimage data. We propose a novel fine-grained classification model, SCKansformer, for bone marrow blood cells, which addresses these challenges and enhances classification accuracy and efficiency. The model integrates the Kansformer Encoder, SCConv Encoder, and Global-Local Attention Encoder. The Kansformer Encoder replaces the traditional MLP layer with the KAN, improving nonlinear feature representation and interpretability. The SCConv Encoder, with its Spatial and Channel Reconstruction Units, enhances feature representation and reduces redundancy. The Global-Local Attention Encoder combines Multi-head Self-Attention with a Local Part module to capture both global and local features. We validated our model using the Bone Marrow Blood Cell Fine-Grained Classification Dataset (BMCD-FGCD), comprising over 10,000 samples and nearly 40 classifications, developed with a partner hospital. Comparative experiments on our private dataset, as well as the publicly available PBC and ALL-IDB datasets, demonstrate that SCKansformer outperforms both typical and advanced microcell classification methods across all datasets. Our source code and private BMCD-FGCD dataset are available at <a class="link-external link-https" href="https://github.com/JustlfC03/SCKansformer" rel="external noopener nofollow">this https URL</a>.
Image and Video Processing,Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper aims to address the challenges in fine-grained classification of bone marrow cells, particularly in the diagnosis of malignant tumors such as acute leukemia. Specifically, the paper targets the following issues: 1. **Low Feature Expression Capability**: Existing automated methods have insufficient feature expression capability when dealing with high-dimensional microscopic image data, leading to low classification accuracy. 2. **Poor Interpretability**: Current methods have poor model interpretability when processing complex image data, making it difficult to understand the decision-making process of the model. 3. **Redundant Feature Extraction**: Existing methods have redundant information in the feature extraction process, affecting the efficiency and accuracy of the model. To address these challenges, the authors propose a new fine-grained classification model—SCKansformer. This model improves the accuracy and efficiency of bone marrow cell classification by integrating the Kansformer encoder, SCConv encoder, and global-local attention encoder. The specific contributions include: - **Kansformer Encoder**: Utilizes the Kolmogorov-Arnold Network (KAN) to replace traditional Multi-Layer Perceptron (MLP) layers, enhancing nonlinear feature representation and model interpretability. - **Enhanced Feature Representation**: The SCConv encoder reduces feature redundancy through spatial reconstruction units and channel reconstruction units, optimizing the feature representation of target cells. The global-local attention encoder combines multi-head self-attention modules and local part modules to effectively capture global and local features of microscopic images, improving classification reliability. - **BMCD-FGCD Dataset**: In collaboration with the affiliated hospital of Zhejiang University, a fine-grained classification dataset of bone marrow cells (BMCD-FGCD) containing over 10,000 samples and nearly 40 categories was established and publicly shared with other researchers. - **Experimental Validation**: Comparative experiments were conducted on the private BMCD-FGCD dataset as well as the public PBC and ALL-IDB datasets. The results show that the SCKansformer model outperforms other methods on all datasets, especially in handling imbalanced datasets. Through these improvements, the SCKansformer model not only provides important reference standards for clinicians but also significantly enhances the accuracy of diagnosing acute leukemia and other major hematological diseases, holding significant clinical auxiliary diagnostic value.