Global mapping of cancers: The Cancer Genome Atlas and beyond
Carlo Ganini,Ivano Amelio,Riccardo Bertolo,Pierluigi Bove,Oreste Claudio Buonomo,Eleonora Candi,Chiara Cipriani,Nicola Di Daniele,Hartmut Juhl,Alessandro Mauriello,Carla Marani,John Marshall,Sonia Melino,Paolo Marchetti,Manuela Montanaro,Maria Emanuela Natale,Flavia Novelli,Giampiero Palmieri,Mauro Piacentini,Erino Angelo Rendina,Mario Roselli,Giuseppe Sica,Manfredi Tesauro,Valentina Rovella,Giuseppe Tisone,Yufang Shi,Ying Wang,Gerry Melino
DOI: https://doi.org/10.1002/1878-0261.13056
2021-01-01
Molecular Oncology
Abstract:Cancer genomes have been explored from the early 2000s through massive exome sequencing efforts, leading to the publication of The Cancer Genome Atlas in 2013. Sequencing techniques have been developed alongside this project and have allowed scientists to bypass the limitation of costs for whole-genome sequencing (WGS) of single specimens by developing more accurate and extensive cancer sequencing projects, such as deep sequencing of whole genomes and transcriptomic analysis. The Pan-Cancer Analysis of Whole Genomes recently published WGS data from more than 2600 human cancers together with almost 1200 related transcriptomes. The application of WGS on a large database allowed, for the first time in history, a global analysis of features such as molecular signatures, large structural variations and noncoding regions of the genome, as well as the evaluation of RNA alterations in the absence of underlying DNA mutations. The vast amount of data generated still needs to be thoroughly deciphered, and the advent of machine-learning approaches will be the next step towards the generation of personalized approaches for cancer medicine. The present manuscript wants to give a broad perspective on some of the biological evidence derived from the largest sequencing attempts on human cancers so far, discussing advantages and limitations of this approach and its power in the era of machine learning.