Abstract:BackgroundBiomedical applications of high-throughput sequencing methods generate a vast amount of data in which numerous chromatin features are mapped along the genome. The results are frequently analysed by creating binary data sets that link the presence/absence of a given feature to specific genomic loci. However, the nucleosome occupancy or chromatin accessibility landscape is essentially continuous. It is currently a challenge in the field to cope with continuous distributions of deep sequencing chromatin readouts and to integrate the different types of discrete chromatin features to reveal linkages between them.ResultsHere we introduce the NucTools suite of Perl scripts as well as MATLAB- and R-based visualization programs for a nucleosome-centred downstream analysis of deep sequencing data. NucTools accounts for the continuous distribution of nucleosome occupancy. It allows calculations of nucleosome occupancy profiles averaged over several replicates, comparisons of nucleosome occupancy landscapes between different experimental conditions, and the estimation of the changes of integral chromatin properties such as the nucleosome repeat length. Furthermore, NucTools facilitates the annotation of nucleosome occupancy with other chromatin features like binding of transcription factors or architectural proteins, and epigenetic marks like histone modifications or DNA methylation. The applications of NucTools are demonstrated for the comparison of several datasets for nucleosome occupancy in mouse embryonic stem cells (ESCs) and mouse embryonic fibroblasts (MEFs).ConclusionsThe typical workflows of data processing and integrative analysis with NucTools reveal information on the interplay of nucleosome positioning with other features such as for example binding of a transcription factor CTCF, regions with stable and unstable nucleosomes, and domains of large organized chromatin K9me2 modifications (LOCKs). As potential limitations and problems we discuss how inter-replicate variability of MNase-seq experiments can be addressed.

Epigenomics coverage data extraction and aggregation in R with tidyCoverage

The tidyomics ecosystem: enhancing omic data analyses

ggcoverage: an R package to visualize and annotate genome coverage for various NGS data

epidecodeR: a functional exploration tool for epigenetic and epitranscriptomic regulation

BRGenomics for analyzing high-resolution genomics data in R

Megadepth: efficient coverage quantification for BigWigs and BAMs

MethylCallR : a comprehensive analysis framework for Illumina Methylation Beadchip

covNorm: An R package for coverage based normalization of Hi-C and capture Hi-C data

Orchestrating chromosome conformation capture analysis with Bioconductor

gghic: A Versatile R Package for Exploring and Visualizing 3D Genome Organization

Identification of copy number variants in whole-genome data using Reference Coverage Profiles

mLiftOver: harmonizing data across Infinium DNA methylation platforms

epihet for intra-tumoral epigenetic heterogeneity analysis and visualization

epiTAD: a web application for visualizing high throughput chromosome conformation capture data in the context of genetic epidemiology

genomepy: genes and genomes at your fingertips

epiCOLOC: Integrating Large-Scale and Context-Dependent Epigenomics Features for Comprehensive Colocalization Analysis

Integrative Analysis of Histone ChIP‐seq and RNA‐seq Data

NucTools: analysis of chromatin feature occupancy profiles from high-throughput sequencing data

3t-seq: automatic gene expression analysis of single-copy genes, transposable elements, and tRNAs from RNA-seq data

Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R