Peak calling by Sparse Enrichment Analysis for CUT&RUN chromatin profiling

Michael P. Meers,Dan Tenenbaum,Steven Henikoff
DOI: https://doi.org/10.1186/s13072-019-0287-4
2019-07-12
Abstract:<h3 class="Heading">Background</h3><p class="Para">CUT&amp;RUN is an efficient epigenome profiling method that identifies sites of DNA binding protein enrichment genome-wide with high signal to noise and low sequencing requirements. Currently, the analysis of CUT&amp;RUN data is complicated by its exceptionally low background, which renders programs designed for analysis of ChIP-seq data vulnerable to oversensitivity in identifying sites of protein binding.</p><h3 class="Heading">Results</h3><p class="Para">Here we introduce Sparse Enrichment Analysis for CUT&amp;RUN (SEACR), an analysis strategy that uses the global distribution of background signal to calibrate a simple threshold for peak calling. SEACR discriminates between true and false-positive peaks with near-perfect specificity from "gold standard" CUT&amp;RUN datasets and efficiently identifies enriched regions for several different protein targets. We also introduce a web server (<span class="ExternalRef"><a href="http://seacr.fredhutch.org"><span class="RefSource">http://seacr.fredhutch.org</span></a></span>) for plug-and-play analysis with SEACR that facilitates maximum accessibility across users of all skill levels.</p><h3 class="Heading">Conclusions</h3><p class="Para">SEACR is a highly selective peak caller that definitively validates the accuracy of CUT&amp;RUN for datasets with known true negatives. Its ease of use and performance in comparison with existing peak calling strategies make it an ideal choice for analyzing CUT&amp;RUN data.</p>
genetics & heredity
What problem does this paper attempt to address?