Abstract:Background: Somatic Copy Number Alterations (CNAs) in human genomes are present in almost all human cancers. Systematic efforts to characterize such structural variants must effectively distinguish significant consensus events from random background aberrations. Here we introduce Significant Aberration in Cancer (SAIC), a new method for characterizing and assessing the statistical significance of recurrent CNA units. Three main features of SAIC include: (1) exploiting the intrinsic correlation among consecutive probes to assign a score to each CNA unit instead of single probes; (2) performing permutations on CNA units that preserve correlations inherent in the copy number data; and (3) iteratively detecting Significant Copy Number Aberrations (SCAs) and estimating an unbiased null distribution by applying an SCA-exclusive permutation scheme.Results: We test and compare the performance of SAIC against four peer methods (GISTIC, STAC, KC-SMART, CMDS) on a large number of simulation datasets. Experimental results show that SAIC outperforms peer methods in terms of larger area under the Receiver Operating Characteristics curve and increased detection power. We then apply SAIC to analyze structural genomic aberrations acquired in four real cancer genome-wide copy number data sets (ovarian cancer, metastatic prostate cancer, lung adenocarcinoma, glioblastoma). When compared with previously reported results, SAIC successfully identifies most SCAs known to be of biological significance and associated with oncogenes (e. g., KRAS, CCNE1, and MYC) or tumor suppressor genes (e. g., CDKN2A/B). Furthermore, SAIC identifies a number of novel SCAs in these copy number data that encompass tumor related genes and may warrant further studies.Conclusions: Supported by a well-grounded theoretical framework, SAIC has been developed and used to identify SCAs in various cancer copy number data sets, providing useful information to study the landscape of cancer genomes. Open-source and platform-independent SAIC software is implemented using C++, together with R scripts for data formatting and Perl scripts for user interfacing, and it is easy to install and efficient to use. The source code and documentation are freely available at http://www.cbil.ece.vt.edu/software.htm.

Genome-wide Identification of Significant Aberrations in Cancer Genome

Copy Number Aberrations from Affymetrix SNP 6.0 Genotyping Data-How Accurate Are Commonly Used Prediction Approaches?

TAGCNA: A Method to Identify Significant Consensus Events of Copy Number Alterations in Cancer

Genome-Wide Identification of Somatic Aberrations from Paired Normal-Tumor Samples

Comparative Analysis of Methods for Identifying Recurrent Copy Number Alterations in Cancer.

BIC-seq: a Fast Algorithm for Detection of Copy Number Alterations Based on High-Throughput Sequencing Data

SCCNAInfer: a robust and accurate tool to infer the absolute copy number on scDNA-seq data

Predicting Stage-Specific Recurrent Aberrations from Somatic Copy Number Dataset

Machine Learning Reveals Molecular Similarity and Fingerprints in Structural Aberrations of Somatic Cancer.

Copy Number Analysis Of Whole-Genome Data Using Bic-Seq2 And Its Application To Detection Of Cancer Susceptibility Variants

Using SAAS-CNV to Detect and Characterize Somatic Copy Number Alterations in Cancer Genomes from Next Generation Sequencing and SNP Array Data

Genome-wide Somatic Copy Number Alteration Analysis and Database Construction for Cervical Cancer.

SAAS-CNV: A Joint Segmentation Approach on Aggregated and Allele Specific Signals for the Identification of Somatic Copy Number Alterations with Next-Generation Sequencing Data.

An Accurate and Powerful Method for Copy Number Variation Detection

Copy Number Variation Detection In Whole-Genome Sequencing Data Using The Bayesian Information Criterion

P-Scnaclonal: Somatic Copy Number Alterations Based Tumor Subclonal Population Inferring Method

SCONCE: a method for profiling copy number alterations in cancer evolution using single-cell whole genome sequencing

Comprehensive Analysis of Clinically Relevant Copy Number Alterations (CNAs) Using a 523-Gene Next-Generation Sequencing Panel and NxClinical Software in Solid Tumors

CaSNP: a Database for Interrogating Copy Number Alterations of Cancer Genome from SNP Array Data

The landscape of somatic copy-number alteration across human cancers

Accucopy: accurate and fast inference of allele-specific copy number alterations from low-coverage low-purity tumor sequencing data