Abstract:The detection of copy number abnormalities (CNAs) is of significant clinical relevance in hematological malignancy (e.g. TP53 CN loss in chronic lymphocytic leukemia). Whilst detection of CNAs is typically performed by FISH or SNP array, it is also possible from next generation sequencing (NGS) data. Given the rapid and global expansion of targeted sequencing for the diagnosis and management of hematological malignancy, we aimed to develop a copy number (CN) analysis tool specifically suited to reporting CN in hematological malignancy in the diagnostic laboratory. We describe CNspectorX, a comprehensive CNA solution from FASTQ to interactive whole genome CN visualization with significant advantages over existing bioinformatics tools. Detecting CNAs sensitively and specifically from NGS data requires a diploid baseline with minimal random variation (noise) and with non-random variation which can correct for systematic biases. An optimal baseline requires representation of recurrent technical artefacts in the reference set. Artefacts differ based on sample type and gender (amongst other variables), therefore CNspectorX creates a patient individualized reference by selecting samples from a specified cohort based on matching metadata to the analyzed sample. This corrects for systematic bias present in the diverse sample types seen in hematological malignancy, including blood, bone marrow aspirate, FFPE tissue and circulating tumor DNA (ctDNA). CNspectorX uses either a pre-built reference or can create one from samples within a single sequencing run and includes a web-based GUI (CNspector) where samples can be dynamically added to a reference to support interactive, real-time recalculation and display of CN calls. Moreover, CNspectorX computes references using a robust estimator allowing inclusion of other tumor samples in the reference set. This flexible and dynamic approach to reference generation is unique to CNspectorX when compared to callers such as CNVkit, CopywriteR and Sequenza. Despite enrichment of targeted areas, hybridization-based NGS panels contain a proportion of residual fragments which map uniquely at random locations throughout the genome. CNspectorX aligns these "off-target" reads to allow genome-wide CN in the absence of a genome-wide SNP backbone (Figure 1). Whilst resolution is lower in off-target regions, CNspectorX can detect clinically relevant aneuploidy, chromosomal arm deletions, and segmental deletion/amplification to approximately 1-3Mb in untargeted regions (Figure 1). CNspectorX also calculates and displays genome-wide B-allele frequency from off-target reads which can both provide further support for CN calls and also detect areas of copy number neutral loss of heterozygosity. CNspectorX has a user-friendly, interactive visualization interface that displays data at genome level, chromosomal arm level and gene/exon level, allowing the analyst to observe not only the local CN change but the surrounding genomic context which may be crucial for interpretation (Figure 1). Binning of CN calls is scaled at each level allowing accurate calling of CNA from chromosome to exon level. Morover, CNA calling by CNspectorX is model-free and not reliant on the measurement of allele frequencies. Hence CNspectorX works well with small gene panels where CNA calls are computed from read abundance alone. This also means that, unlike callers that use purity/ploidy models, CNspectorX is not confounded by heterogeneity that breaks assumptions of clonality. CNspectorX has been implemented in an accredited (ISO15189) diagnostic laboratory and has been used to analyze CNA in over 1500 samples from patients with hematological malignancy. With CNspectorX we have identified multiple clinically relevant somatic and germline CN abnormalities not detected by conventional testing including high level focal amplifications in CD274 (PD-L1), MYC, MCL1, XPO1/REL and CN losses involving TP53, ATM, RPS19, BIRC3 and CDKN2A/B. In summary, we have developed CNspectorX, a comprehensive CNA analysis tool that can calculate and display whole genome-level CN from targeted NGS panels. CNspectorX is able to take into account the diverse and specific technical artefacts from typical sample types (including ctDNA) tested by the diagnostic laboratory in the investigation of hematological malignancy resulting in accurate CN data for clinical interpretation. Disclosures Dickinson: GlaxoSmithKline: Consultancy, Honoraria, Membership on an entity's Board of Directors or advisory committees, Research Funding; Merck Sharpe and Dohme: Consultancy, Honoraria, Membership on an entity's Board of Directors or advisory committees, Research Funding, Speakers Bureau; Takeda: Consultancy, Honoraria, Membership on an entity's Board of Directors or advisory committees, Research Funding, Speakers Bureau; Celgene: Consultancy, Honoraria, Research Funding, Speakers Bureau; Novartis: Consultancy, Honoraria, Membership on an entity's Board of Directors or advisory committees, Research Funding, Speakers Bureau; Janssen: Consultancy, Honoraria, Membership on an entity's Board of Directors or advisory committees, Research Funding, Speakers Bureau; F. Hoffmann-La Roche Ltd: Consultancy, Honoraria, Research Funding, Speakers Bureau. Blombery:Novartis: Consultancy; Janssen: Honoraria; Invivoscribe: Honoraria.

Copy Number Aberrations from Affymetrix SNP 6.0 Genotyping Data-How Accurate Are Commonly Used Prediction Approaches?

Accuracy Of Cnv Detection From Gwas Data

Comprehensive Analysis of Clinically Relevant Copy Number Alterations (CNAs) Using a 523-Gene Next-Generation Sequencing Panel and NxClinical Software in Solid Tumors

SCCNAInfer: a robust and accurate tool to infer the absolute copy number on scDNA-seq data

Cnspectorx - Copy Number Assessment at a Genome Level from Targeted Sequence Data Optimized for Hematological Malignancy

Accucopy: accurate and fast inference of allele-specific copy number alterations from low-coverage low-purity tumor sequencing data

Comprehensive Assessment of Somatic Copy Number Variation Calling Using Next-Generation Sequencing Data

CaSNP: a Database for Interrogating Copy Number Alterations of Cancer Genome from SNP Array Data

A Remark on Copy Number Variation Detection Methods

HaplotypeCN: Copy Number Haplotype Inference with Hidden Markov Model and Localized Haplotype Clustering

Computational validation of clonal and subclonal copy number alterations from bulk tumor sequencing using CNAqc

Pivotal role of biallelic frequency analysis in identifying copy number alterations using genome-wide methods in tumors with a high level of aneuploidy

SCONCE: a method for profiling copy number alterations in cancer evolution using single-cell whole genome sequencing

Evaluation of somatic copy number variation detection by NGS technologies and bioinformatics tools on a hyper-diploid cancer genome

CNARA: reliability assessment for genomic copy number profiles

Abstract A027: Genome-wide quantification of copy-number aberration impact on gene expression in ovarian high-grade serous carcinoma

The Repertoire of Copy Number Alteration Signatures in Human Cancer

A novel signal processing approach for the detection of copy number variations in the human genome

CNAPE: A Machine Learning Method for Copy Number Alteration Prediction from Gene Expression

Evaluation of Copy Number Variation Detection for a SNP Array Platform

Analysis of acquired genomic copy number aberrations and regions of loss of heterozygosity in acute myelogenous leukemia genomes using Affymetrix SNP 6.0 arrays and supporting software tools.