Machine Learning-Based Flow Cytometry Diagnostics in Myelodysplastic Syndromes: Validation in the HOVON89 Clinical Trial (EudraCT 2008-002195-10)
Carolien Duetz,Sofie Van Gassen,Theresia M. Westers,Florentien in t Hout,Eline Cremers,Canan Alhan,Costa Bachas,Margot F. Van Spronsen,Heleen Visser-Wisselaar,Dana Chitu,Aniek O. De Graaf,Joop Jansen,Yvan Saeys,Arjan van de Loosdrecht
DOI: https://doi.org/10.1182/blood-2020-136719
IF: 20.3
2020-11-05
Blood
Abstract:Introduction Flow cytometry is a recommended tool in the diagnostic work-up of cytopenic patients suspected for myelodysplastic syndromes. Currently used flow cytometry scores rely on human assessment of dysplastic features in the bone marrow. Although proven useful, these methods are labor intensive and require a high level of expertise. Therefore, we previously developed a machine learning-based workflow for flow cytometry diagnostics in MDS by combining computational cell detection and a machine learning-classifier. This workflow outperformed traditional diagnostic scores with respect to accuracy (sensitivity 85-97%, specificity 93-97%), time investment (<30 seconds) and required materials (manuscript submitted). In the present study, we validated sensitivity of the workflow in a well-characterized clinical trial cohort (HOVON89 EudraCT 2008-002195-10) of lower risk MDS patients. Method Patient inclusion and characteristics Very low to intermediate risk MDS patients enrolled in the HOVON89 clinical trial (EudraCT 2008-002195-10) were included. 53 patients met the additional inclusion criteria, concerning written consent for add-on studies and availability of required flow cytometry data. Sample preparation Bone marrow samples were processed for flow cytometry analysis according to the European Leukemia Net guidelines. This study focused on the antibody combination optimized for assessment of myeloid progenitors and erythroid dysplasia (CD45, CD34, CD117, HLA-DR, CD71, CD36, CD105, CD33, sideward light scatter (SSC) and forward light scatter (FSC)). Machine learning-based workflow The machine learning-based workflow was developed in a prior study based on a reference cohort consisting of MDS patients without excess of blasts(n=67) and non-MDS cases (n=81) (Figure 1). MDS patients were diagnosed based on (cyto)morphology, cytogenetics and clinical follow-up. Non-MDS cases were patients with confirmed non-neoplastic cytopenias (n=69) and age-matched healthy individuals (n=12). Results In the validation cohort, the machine learning-based diagnostic workflow classified 49 out of 53 patients correctly, reaching a sensitivity of 92%. The workflow outperformed two currently used diagnostic tools for MDS flow cytometry, the Ogata score and integrated flow cytometry score (iFS). The former obtained 72% sensitivity (McNemar: p = 0.001) and the latter 83% sensitivity (McNemar: p = 0.06) in the validation cohort. Per patient, time required for automated analysis was less than 30 seconds. All four MDS patients that classified false negatively had a normal karyotype and (very) low risk disease according to the IPSS-r. In three out of four patients, no mutations or MDS-associated immunophenotypic features were detected. One patients was diagnosed as MDS-MLD and three patients as MDS-RS-SLD according to the WHO 2016 classification. The ten most relevant cellular features that discriminated between MDS and non-MDS patients in the reference data were confirmed in the current validation cohort. All ten features of MDS patients in the validation cohort were significantly different from non-MDS patients of the reference cohort (all features, p < 0.00001) (Figure 2). Seven out of ten features were similar in MDS patients of the validation cohort compared to those of the MDS patients of the reference cohort (p>0.05) (Figure 2). Conclusion In this validation study, we confirmed accuracy of machine learning-based flow cytometry diagnostics in lower risk MDS. The workflow obtained 92% sensitivity, which is in accordance with results from our previous study (85-97%), and outperformed currently used diagnostic flow cytometry scores for MDS (i.e. Ogata score and iFS). In our previous study specificity was 95% in both reference and test cohorts. Cellular features, most discriminative for diagnosis, were confirmed in the validation cohort, emphasizing robustness of the method. Additional benefits of this approach are the reduction in analysis time to less than thirty seconds per patient, reduction of required antibodies and increased reproducibility. Disclosures van de Loosdrecht: celgene: Honoraria; novartis: Honoraria.
hematology