Systematically Evaluating Cell‐Free DNA Fragmentation Patterns for Cancer Diagnosis and Enhanced Cancer Detection via Integrating Multiple Fragmentation Patterns

Yuying Hou,Xiang‐Yu Meng,Xionghui Zhou
DOI: https://doi.org/10.1002/advs.202308243
IF: 15.1
2024-06-19
Advanced Science
Abstract:Cell‐free DNA within open chromatin regions is susceptible to fragmentation. This study evaluates the diagnostic performance of 10 representative fragmentation patterns in open chromatin regions across 4 datasets for cancer diagnosis. An ensemble model is constructed by combining these fragmentation patterns, demonstrating a more stable performance for cancer detection and tissue‐of‐origin determination. The model's crucial regions offer biologically interpretable insights. Cell‐free DNA (cfDNA) fragmentation patterns have immense potential for early cancer detection. However, the definition of fragmentation varies, ranging from the entire genome to specific genomic regions. These patterns have not been systematically compared, impeding broader research and practical implementation. Here, 1382 plasma cfDNA sequencing samples from 8 cancer types are collected. Considering that cfDNA within open chromatin regions is more susceptible to fragmentation, 10 fragmentation patterns within open chromatin regions as features and employed machine learning techniques to evaluate their performance are examined. All fragmentation patterns demonstrated discernible classification capabilities, with the end motif showing the highest diagnostic value for cross‐validation. Combining cross and independent validation results revealed that fragmentation patterns that incorporated both fragment length and coverage information exhibited robust predictive capacities. Despite their diagnostic potential, the predictive power of these fragmentation patterns is unstable. To address this limitation, an ensemble classifier via integrating all fragmentation patterns is developed, which demonstrated notable improvements in cancer detection and tissue‐of‐origin determination. Further functional bioinformatics investigations on significant feature intervals in the model revealed its impressive ability to identify critical regulatory regions involved in cancer pathogenesis.
materials science, multidisciplinary,nanoscience & nanotechnology,chemistry
What problem does this paper attempt to address?