Deep Learning for Morphology-Based, Bone Marrow Cell Classification

Shenghuan Sun,Jacob Cleave,Linlin Wang,Fabienne Lucas,Laura Brown,Jacob Spector,Leonardo Boiocchi,Jeeyeon Baik,Menglei Zhu,Orly Ardon,Chuanyi M. Lu,Ahmet Dogan,Dmitry Goldgof,Iain Carmichael,Sonam Prakash,Atul Butte,Gregory Mark Goldgof
DOI: https://doi.org/10.1182/blood-2023-172654
IF: 20.3
2023-11-28
Blood
Abstract:The morphological classification of cells in bone marrow aspirate (BMA) is central to the diagnosis of hematologic diseases, including leukemias. Despite being a critical task, its monotonous, time-consuming nature and dependency on highly skilled clinical experts makes it prone to human error. Such errors can lead to delays and misdiagnoses that negatively impact patient care. To counter these challenges, we curated an expansive dataset of more than 40,000 hematopathologist consensus-annotated single-cell images, extracted from BMA whole slide images (WSIs), each annotated into one of 23 distinct morphologic classes. We then utilized this data to develop DeepHeme, a convolutional neural network classifier designed for bone marrow cell typing tasks. DeepHeme achieves state-of-the-art performance in both the breadth of differentiable classes and accuracy across these classes. By comparing its performance to that of individual hematopathologists from three premier academic medical centers, using our gold standard consensus-labelled images, we found our AI algorithm either matched or surpassed the average performance across all classes. In addition, we integrated DeepHeme with internally developed region classifier and cell detection algorithms, culminating in a comprehensive diagnostic pipeline for whole slide cell differential. We next tested DeepHeme on slides from an external hospital system at a major cancer center to evaluate the generalizability of our model, a necessary precondition to widespread application. DeepHeme demonstrated a high level of generalizability, evidenced by a decrease of only 4% in the mean F-1 score, from 0.89 to 0.85, across all 23 cell classes. Lastly, to improve access to the DeepHeme algorithm results and encourage further real-world generalizability testing, we developed a web application that allows scientists and clinicians to test the DeepHeme algorithm on either test images from our study or their own user-uploaded aspirates.
hematology
What problem does this paper attempt to address?