Letter to the Editor: An ultra-sensitive assay using cell-free DNA fragmentomics for multi-cancer early detection
Hua Bao,Zheng Wang,Xiaoji Ma,Wei Guo,Xiangyu Zhang,Wanxiangfu Tang,Xin Chen,Xinyu Wang,Yikuan Chen,Shaobo Mo,Naixin Liang,Qianli Ma,Shuyu Wu,Xiuxiu Xu,Shuang Chang,Yulin Wei,Xian Zhang,Hairong Bao,Rui Liu,Shanshan Yang,Ya Jiang,Xue Wu,Yaqi Li,Long Zhang,Fengwei Tan,Qi Xue,Fangqi Liu,Sanjun Cai,Shugeng Gao,Junjie Peng,Jian Zhou,Yang Shao
DOI: https://doi.org/10.1186/s12943-022-01594-w
IF: 37.3
2022-06-12
Molecular Cancer
Abstract:Early detection can benefit cancer patients with more effective treatments and better prognosis, but existing early screening tests are limited, especially for multi-cancer detection. This study investigated the most prevalent and lethal cancer types, including primary liver cancer (PLC), colorectal adenocarcinoma (CRC), and lung adenocarcinoma (LUAD). Leveraging the emerging cell-free DNA (cfDNA) fragmentomics, we developed a robust machine learning model for multi-cancer early detection. 1,214 participants, including 381 PLC, 298 CRC, 292 LUAD patients, and 243 healthy volunteers, were enrolled. The majority of patients ( N = 971) were at early stages (stage 0, N = 34; stage I, N = 799). The participants were randomly divided into a training cohort and a test cohort in a 1:1 ratio while maintaining the ratio for the major histology subtypes. An ensemble stacked machine learning approach was developed using multiple plasma cfDNA fragmentomic features. The model was trained solely in the training cohort and then evaluated in the test cohort. Our model showed an Area Under the Curve (AUC) of 0.983 for differentiating cancer patients from healthy individuals. At 95.0% specificity, the sensitivity of detecting all cancer reached 95.5%, while 100%, 94.6%, and 90.4% for PLC, CRC, and LUAD, individually. The cancer origin model demonstrated an overall 93.1% accuracy for predicting cancer origin in the test cohort (97.4%, 94.3%, and 85.6% for PLC, CRC, and LUAD, respectively). Our model sensitivity is consistently high for early-stage and small-size tumors. Furthermore, its detection and origin classification power remained superior when reducing sequencing depth to 1× (cancer detection: ≥ 91.5% sensitivity at 95.0% specificity; cancer origin: ≥ 91.6% accuracy). In conclusion, we have incorporated plasma cfDNA fragmentomics into the ensemble stacked model and established an ultrasensitive assay for multi-cancer early detection, shedding light on developing cancer early screening in clinical practice.
oncology,biochemistry & molecular biology