Comprehensive analysis of clustered mutations in cancer reveals recurrent APOBEC3 mutagenesis of ecDNA

Erik N. Bergstrom,Jens Luebeck,Mia Petljak,Vineet Bafna,Paul S. Mischel,Reuben S. Harris,Ludmil B. Alexandrov,Erik N Bergstrom,Jens-Christian Luebeck,Reuben Harris,Ludmil B Alexandrov
DOI: https://doi.org/10.1101/2021.05.27.445689
2021-05-27
Abstract:Clustered somatic mutations are common in cancer genomes with prior analyses revealing several types of clustered single-base substitutions, including doublet- and multi-base substitutions, diffuse hypermutation termed omikli, and longer strand-coordinated events termed kataegis. Here, we provide a comprehensive characterization of clustered substitutions and clustered small insertions and deletions (indels) across 2,583 whole-genome sequenced cancers from 30 cancer types. While only 3.7% of substitutions and 0.9% of indels were found to be clustered, they contributed 8.4% and 6.9% of substitution and indel drivers, respectively. Multiple distinct mutational processes gave rise to clustered indels including signatures enriched in tobacco smokers and homologous-recombination deficient cancers. Doublet-base substitutions were caused by at least 12 mutational processes, while the majority of multi-base substitutions were generated by either tobacco smoking or exposure to ultraviolet light. Omikli events, previously attributed to the activity of APOBEC3 deaminases, accounted for a large proportion of clustered substitutions. However, only 16.2% of omikli matched APOBEC3 patterns with experimental validation confirming additional mutational processes giving rise to omikli. Kataegis was generated by multiple mutational processes with 76.1% of all kataegic events exhibiting AID/APOBEC3-associated mutational patterns. Co-occurrence of APOBEC3 kataegis and extrachromosomal-DNA (ecDNA) was observed in 31% of samples with ecDNA. Multiple distinct APOBEC3 kataegic events were observed on most mutated ecDNA. ecDNA containing known cancer genes exhibited both positive selection and kataegic hypermutation. Our results reveal the diversity of clustered mutational processes in human cancer and the role of APOBEC3 in recurrently mutating and fueling the evolution of ecDNA.
What problem does this paper attempt to address?