Combining gene expression profiling and machine learning to diagnose B-cell non-Hodgkin lymphoma

Victor Bobée,Fanny Drieux,Vinciane Marchand,Vincent Sater,Liana Veresezan,Jean-Michel Picquenot,Pierre-Julien Viailly,Marie-Delphine Lanic,Mathieu Viennot,Elodie Bohers,Lucie Oberic,Christiane Copie-Bergman,Thierry Jo Molina,Philippe Gaulard,Corinne Haioun,Gilles Salles,Hervé Tilly,Fabrice Jardin,Philippe Ruminy
DOI: https://doi.org/10.1038/s41408-020-0322-5
IF: 9.812
2020-05-01
Blood Cancer Journal
Abstract:Abstract Non-Hodgkin B-cell lymphomas (B-NHLs) are a highly heterogeneous group of mature B-cell malignancies. Their classification thus requires skillful evaluation by expert hematopathologists, but the risk of error remains higher in these tumors than in many other areas of pathology. To facilitate diagnosis, we have thus developed a gene expression assay able to discriminate the seven most frequent B-cell NHL categories. This assay relies on the combination of ligation-dependent RT-PCR and next-generation sequencing, and addresses the expression of more than 130 genetic markers. It was designed to retrieve the main gene expression signatures of B-NHL cells and their microenvironment. The classification is handled by a random forest algorithm which we trained and validated on a large cohort of more than 400 annotated cases of different histology. Its clinical relevance was verified through its capacity to prevent important misclassification in low grade lymphomas and to retrieve clinically important characteristics in high grade lymphomas including the cell-of-origin signatures and the MYC and BCL2 expression levels. This accurate pan-B-NHL predictor, which allows a systematic evaluation of numerous diagnostic and prognostic markers, could thus be proposed as a complement to conventional histology to guide the management of patients and facilitate their stratification into clinical trials.
oncology,hematology
What problem does this paper attempt to address?