A data-adaptive methods in detecting exogenous methyltransferase accessible chromatin in human genome using nanopore sequencing

Kailing Tu,Xuemei Li,Qilin Zhang,Wei Huang,Dan Xie
DOI: https://doi.org/10.1093/bioinformatics/btae206
IF: 5.8
2024-04-13
Bioinformatics
Abstract:Abstract Motivation Identifying chromatin accessibility is one of the key steps in studying the regulation of eukaryotic genomes. The combination of exogenous methyltransferase and nanopore sequencing provides an strategy to identify open chromatin over long genomic ranges at the single-molecule scale. However, endogenous methylation, non-open-chromatin-specific exogenous methylation and base-calling errors limit the accuracy and hinders its application to complex genomes. Results We systematically evaluated the impact of these three influence factors, and developed a model-based computational method, methyltransferase accessible genome region finder(MAGNIFIER), to address the issues. By incorporating control data, MAGNIFIER attenuates the three influence factors with data-adaptive comparison strategy. We demonstrate that MAGNIFIER is not only sensitive to identify the open chromatin with much improved accuracy, but also able to detect the chromatin accessibility of repetitive regions that are missed by NGS-based methods. By incorporating long-read RNA-seq data, we revealed the association between the accessible Alu elements and non-classic gene isoforms. Availability Freely avaliable on web at https://github.com/Goatofmountain/MAGNIFIER Supplementary information Supplementary data are available at Bioinformatics online.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?