Abstract:Background: A variety of diseases are caused by chromosomal abnormalities such as aneuploidies (having an abnormal number of chromosomes), microdeletions, microduplications, and uniparental disomy. High density single nucleotide polymorphism (SNP) microarrays provide information on chromosomal copy number changes, as well as genotype (heterozygosity and homozygosity). SNP array studies generate multiple types of data for each SNP site, some with more than 100,000 SNPs represented on each array. The identification of different classes of anomalies within SNP data has been challenging. Results: We have developed SNPscan, a web-accessible tool to analyze and visualize high density SNP data. It enables researchers (1) to visually and quantitatively assess the quality of user-generated SNP data relative to a benchmark data set derived from a control population, (2) to display SNP intensity and allelic call data in order to detect chromosomal copy number anomalies (duplications and deletions), (3) to display uniparental isodisomy based on loss of heterozygosity (LOH) across genomic regions, (4) to compare paired samples (e.g. tumor and normal), and (5) to generate a file type for viewing SNP data in the University of California, Santa Cruz (UCSC) Human Genome Browser. SNPscan accepts data exported from Affymetrix Copy Number Analysis Tool as its input. We validated SNPscan using data generated from patients with known deletions, duplications, and uniparental disomy. We also inspected previously generated SNP data from 90 apparently normal individuals from the Centre d'Etude du Polymorphisme Humain (CEPH) collection, and identified three cases of uniparental isodisomy, four females having an apparently mosaic X chromosome, two mislabelled SNP data sets, and one microdeletion on chromosome 2 with mosaicism from an apparently normal female. These previously unrecognized abnormalities were all detected using SNPscan. The microdeletion was independently confirmed by fluorescence in situ hybridization, and a region of homozygosity in a UPD case was confirmed by sequencing of genomic DNA. Conclusion: SNPscan is useful to identify chromosomal abnormalities based on SNP intensity (such as chromosomal copy number changes) and heterozygosity data (including regions of LOH and some cases of UPD). The program and source code are available at the SNPscan website http://pevsnerlab.kennedykrieger.org/snpscan.htm.

A simple method for comparing microarray genotype data between brain and other tissues

Accuracy Of Cnv Detection From Gwas Data

Encoding of low-quality DNA profiles as genotype probability matrices for improved profile comparisons, relatedness evaluation and database searches

Comparative linkage analysis and visualization of high-density oligonucleotide SNP array data

Identifying differentially expressed genes in human acute leukemia and mouse brain microarray datasets utilizing QTModel

A comparative review of statistical methods for discovering differentially expressed genes in replicated microarray experiments

Diffreps: Detecting Differential Chromatin Modification Sites from ChIP-seq Data with Biological Replicates.

Major Copy Proportion Analysis of Tumor Samples Using Snp Arrays

A comparison of software for analysis of rare and common short tandem repeat (STR) variation using human genome sequences from clinical and population-based samples

Analysis and visualization of chromosomal abnormalities in SNP data with SNPscan

Analysis of acquired genomic copy number aberrations and regions of loss of heterozygosity in acute myelogenous leukemia genomes using Affymetrix SNP 6.0 arrays and supporting software tools.

Calculation of reliable transcript levels of annotated genes on the basis of multiple probe-sets in Affymetrix microarrays.

A simple method for statistical analysis of intensity differences in microarray-derived gene expression data

Analysing multiple types of molecular profiles simultaneously: connecting the needles in the haystack

A study of inter-lab and inter-platform agreement of DNA microarray data

Methods for evaluating gene expression from Affymetrix microarray datasets

Cancer Sample Analysis Utilizing Single-Nucleotide Polymorphism Array and Array Comparative Genomic Hybridization

Cross-platform comparability of microarray technology: Intra-platform consistency and appropriate data analysis procedures are essential

Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling

Optimization Methods for Genotype Data Analysis in Epidemiological Studies

Excel Template for Identifying Mouse Myeloid Cell-Types in the Central Nervous System Based on Single-Cell RNA Sequencing Data