Epigenomics coverage data extraction and aggregation in R with tidyCoverage

Jacques Serizay,Romain Koszul
DOI: https://doi.org/10.1093/bioinformatics/btae487
IF: 5.8
2024-07-29
Bioinformatics
Abstract:Abstract The tidyCoverage R package provides a framework for intuitive investigation of collections of genomic tracks over genomic features, relying on the principle of tidy data manipulation. It defines two data structures, CoverageExperiment and AggregatedCoverage classes, directly extending the SummarizedExperiment fundamental class, and introduces a principled approach to exploring genome-wide data. This infrastructure facilitates the extraction and manipulation of genomic coverage track data across individual or multiple sets of thousands of genomic loci. This allows the end user to rapidly visualize track coverage at individual genomic loci or aggregated coverage profiles over sets of genomic loci. tidyCoverage seamlessly combines with the existing Bioconductor ecosystem to accelerate the integration of genome-wide track data in epigenomic analysis workflows. tidyCoverage emerges as a valuable tool, contributing to the advancement of epigenomics research by promoting consistency, reproducibility, and accessibility in data analysis. Availability tidyCoverage is an R package freely available from Bioconductor ≥ 3.19 (https://www.bioconductor.org/packages/tidyCoverage) for R ≥ 4.4. The software is distributed under the MIT License and is accompanied by example files and data. Supplementary information Additional documentation is available from https://js2264.github.io/tidyCoverage/ and https://js2264.github.io/tidyCoverage/articles/tidyCoverage.html.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?