Non-invasive detection of esophageal carcinoma by integrative analysis of low-pass whole methylome sequencing of plasma cell-free DNA.

Dan Liu,Bin Jiang,Weizhi Chen,Peiyao Nie,Hongyu Xie,Tiancheng Han,Shunli Yang,Yu S. Huang,Fang Lv,Shuying He,Ying Yang,Yulong LI,Yuanyuan Hong,Jianing Yu
DOI: https://doi.org/10.1200/jco.2022.40.16_suppl.4067
IF: 45.3
2022-06-04
Journal of Clinical Oncology
Abstract:4067 Background: Cell-free DNA (cfDNA) methylation, fragmentation patterns, chromosome instability, and chromatin accessiblity have been previously shown to be valid plasma biomarkers for non-invasive cancer detection. However, conventional whole-genome bisulfite sequencing (WGBS) is unable to simultaneously profile all these biomarkers due to bisulfite-induced DNA damages. Here we developed a machine learning approach to comprehensively integrate multiple types of cancer genomic markers from enzyme-conversion-based low-pass whole-methylome sequencing (WMS) of plasma cfDNA to non-invasively detect esophageal carcinoma. Methods: Plasma cfDNA sampels from 85 patients and 568 healthy individuals were collected and were split into the discovery and independent testing cohort. The discovery cohort includes 60 cancer patients and 398 healthy individuals and the independent testing cohort includes 25 cancer patients and 170 healthy individuals. Whole methylome sequencing (WMS) libraries were generated from enzymatically converted cfDNA and were subsequently paired-end sequenced at ̃2X coverage. The genome-wide methylation density, fragmentation fingerprints, chromosome instability, and chromatin accessibility were extracted from the WMS data and individually modelled via machine learning methods such as SVM, LR, GBDT, random forest. The final predictive model is an ensemble model integrating all uni-modal models. All models were trained and fitted on the discovery cohort. Results: Data of different modalities provide complementary information in separating the cancer patients from the healthy individuals. Unsupervised clustering of the individuals showed clear separation between cancer patients and healthy individuals. The final predictive model achieved AUC =96.25% in the discovery cohort and AUC =91.15% in the independent testing cohort. Under a specificity of 95.29% (CI: 87.64% - 100.00%), sensitivity was 72.00% (CI: 56.00% - 92.00%) in the independent testing cohort. Separating the cancer patients into different stages, we found that the detection power is usuaul lower for early-stage cancer patients. Conclusions: These results demonstrate the first proof of principle on the feasibility of integrating multiple genomic cancer markers to non-invasively detect esophageal carcinoma from WMS plasma cell-free DNA. A large prospective cohort study is planned to further validate its clinical performance.
oncology
What problem does this paper attempt to address?