Integrative Imaging Informatics for Cancer Research: Workflow Automation for Neuro-Oncology (I3CR-WANO)
Satrajit Chakrabarty,Syed Amaan Abidi,Mina Mousa,Mahati Mokkarala,Isabelle Hren,Divya Yadav,Matthew Kelsey,Pamela LaMontagne,John Wood,Michael Adams,Yuzhuo Su,Sherry Thorpe,Caroline Chung,Aristeidis Sotiras,Daniel S. Marcus
DOI: https://doi.org/10.1200/cci.22.00177
2023-05-01
JCO Clinical Cancer Informatics
Abstract:PURPOSE Efforts to use growing volumes of clinical imaging data to generate tumor evaluations continue to require significant manual data wrangling, owing to data heterogeneity. Here, we propose an artificial intelligence–based solution for the aggregation and processing of multisequence neuro-oncology MRI data to extract quantitative tumor measurements. MATERIALS AND METHODS Our end-to-end framework (1) classifies MRI sequences using an ensemble classifier, (2) preprocesses the data in a reproducible manner, (3) delineates tumor tissue subtypes using convolutional neural networks, and (4) extracts diverse radiomic features. Moreover, it is robust to missing sequences and adopts an expert-in-the-loop approach in which the segmentation results may be manually refined by radiologists. After the implementation of the framework in Docker containers, it was applied to two retrospective glioma data sets collected from the Washington University School of Medicine (WUSM; n = 384) and The University of Texas MD Anderson Cancer Center (MDA; n = 30), comprising preoperative MRI scans from patients with pathologically confirmed gliomas. RESULTS The scan-type classifier yielded an accuracy of >99%, correctly identifying sequences from 380 of 384 and 30 of 30 sessions from the WUSM and MDA data sets, respectively. Segmentation performance was quantified using the Dice Similarity Coefficient between the predicted and expert-refined tumor masks. The mean Dice scores were 0.882 (±0.244) and 0.977 (±0.04) for whole-tumor segmentation for WUSM and MDA, respectively. CONCLUSION This streamlined framework automatically curated, processed, and segmented raw MRI data of patients with varying grades of gliomas, enabling the curation of large-scale neuro-oncology data sets and demonstrating high potential for integration as an assistive tool in clinical practice.