Machine learning based on multiplatform tests assists in subtype classification of mature B‐cell neoplasms

Junwei Lin,Yafei Mu,Lingling Liu,Yuhuan Meng,Tao Chen,Xijie Fan,Jiecheng Yuan,Maoting Shen,Jianhua Pan,Yuxia Ren,Shihui Yu,Yuxin Chen
DOI: https://doi.org/10.1111/bjh.19934
2024-12-04
British Journal of Haematology
Abstract:The subtyping of mature B‐cell neoplasms (MBNs) largely depends on the expertise of clinicians and pathologists. Variability in clinical experience may lead to differences in subtyping efficiency and consistency, and incorrect subtyping of MBNs may delay optimal treatment for patients. This study collected 1721 cases of MBNs, encompassing eight major MBN subtypes and further elucidated the genetic landscape of Chinese MBN patients. We first developed a machine learning (ML) model based on next‐generation sequencing (NGS) from Chinese MBN patients, with an accuracy of 0.719, which decreased to 0.707 after model feature selection. Another ML model based on NGS and tumour cell size had an accuracy of 0.715, which increased to 0.763 after model feature selection. Both models were more accurate than models constructed using Western MBN patient databases. Furthermore, by adding flow cytometry for CD5 and CD10, the accuracy reached 0.864, which further improved to 0.872 after model feature selection. These final models are accessible via an open‐access website. Summary Mature B‐cell neoplasms (MBNs) are clonal proliferative diseases encompassing over 40 subtypes. The WHO classification (morphology, immunology, cytogenetics and molecular biology) provides comprehensive diagnostic understandings. However, MBN subtyping relies heavily on the expertise of clinicians and pathologists, and differences in clinical experience can lead to variations in subtyping efficiency and consistency. Additionally, due to the diversity in genetic backgrounds, machine learning (ML) models constructed based on Western populations may not be suitable for Chinese MBN patients. To construct a highly accurate classification model suitable for Chinese MBN patients, we first developed an ML model based on next‐generation sequencing (NGS) from Chinese MBN patients, with an accuracy of 0.719, which decreased to 0.707 after model feature selection. Another ML model based on NGS and tumour cell size had an accuracy of 0.715, which increased to 0.763 after model feature selection. Both models were more accurate than models constructed using Western MBN patient databases. Furthermore, by adding flow cytometry for CD5 and CD10, the accuracy reached 0.864, which further improved to 0.872 after model feature selection. These models are accessible via an open‐access website. Overall, ML models incorporating multiplatform tests can serve as practical auxiliary tools for MBN subtype classification.
hematology
What problem does this paper attempt to address?