Abstract:Background AluScan combines inter- Alu PCR using multiple Alu -based primers with opposite orientations and next-generation sequencing to capture a huge number of Alu -proximal genomic sequences for investigation. Its requirement of only sub-microgram quantities of DNA facilitates the examination of large numbers of samples. However, the special features of AluScan data rendered difficult the calling of copy number variation (CNV) directly using the calling algorithms designed for whole genome sequencing (WGS) or exome sequencing. Results In this study, an AluScanCNV package has been assembled for efficient CNV calling from AluScan sequencing data employing a Geary-Hinkley transformation (GHT) of read-depth ratios between either paired test-control samples, or between test samples and a reference template constructed from reference samples, to call the localized CNVs, followed by use of a GISTIC-like algorithm to identify recurrent CNVs and circular binary segmentation (CBS) to reveal large extended CNVs. To evaluate the utility of CNVs called from AluScan data, the AluScans from 23 non-cancer and 38 cancer genomes were analyzed in this study. The glioma samples analyzed yielded the familiar extended copy-number losses on chromosomes 1p and 9. Also, the recurrent somatic CNVs identified from liver cancer samples were similar to those reported for liver cancer WGS with respect to a striking enrichment of copy-number gains in chromosomes 1q and 8q. When localized or recurrent CNV-features capable of distinguishing between liver and non-liver cancer samples were selected by correlation-based machine learning, a highly accurate separation of the liver and non-liver cancer classes was attained. Conclusions The results obtained from non-cancer and cancerous tissues indicated that the AluScanCNV package can be employed to call localized, recurrent and extended CNVs from AluScan sequences. Moreover, both the localized and recurrent CNVs identified by this method could be subjected to machine-learning selection to yield distinguishing CNV-features that were capable of separating between liver cancers and other types of cancers. Since the method is applicable to any human DNA sample with or without the availability of a paired control, it can also be employed to analyze the constitutional CNVs of individuals.

BIC-seq: a Fast Algorithm for Detection of Copy Number Alterations Based on High-Throughput Sequencing Data

Copy Number Variation Detection In Whole-Genome Sequencing Data Using The Bayesian Information Criterion

Copy Number Analysis Of Whole-Genome Data Using Bic-Seq2 And Its Application To Detection Of Cancer Susceptibility Variants

Accuracy Of Cnv Detection From Gwas Data

Copy Number Aberrations from Affymetrix SNP 6.0 Genotyping Data-How Accurate Are Commonly Used Prediction Approaches?

SeqCNV: a Novel Method for Identification of Copy Number Variations in Targeted Next-Generation Sequencing Data

Computational validation of clonal and subclonal copy number alterations from bulk tumor sequencing using CNAqc

DL-CNV: A Deep Learning Method for Identifying Copy Number Variations Based on Next Generation Target Sequencing

SCCNAInfer: a robust and accurate tool to infer the absolute copy number on scDNA-seq data

High resolution copy number inference in cancer using short-molecule nanopore sequencing

Evaluation of tools for identifying large copy number variations from ultra-low-coverage whole-genome sequencing data

SCCNV: A Software Tool for Identifying Copy Number Variation From Single-Cell Whole-Genome Sequencing

CNVbd: A Method for Copy Number Variation Detection and Boundary Search

Copy number variation analysis based on AluScan sequences

BMI-CNV: A Bayesian framework for multiple genotyping platforms detection of copy number variation

Comprehensive Analysis of Clinically Relevant Copy Number Alterations (CNAs) Using a 523-Gene Next-Generation Sequencing Panel and NxClinical Software in Solid Tumors

High-resolution detection of copy number alterations in single cells with HiScanner

Accucopy: accurate and fast inference of allele-specific copy number alterations from low-coverage low-purity tumor sequencing data

Clonecna: Detecting Subclonal Somatic Copy Number Alterations in Heterogeneous Tumor Samples from Whole-Exome Sequencing Data

Detecting copy number variations from single-cell chromatin sequencing data by AtaCNV

nbCNV: a multi-constrained optimization model for discovering copy number variants in single-cell sequencing data