magpie : A power evaluation method for differential RNA methylation analysis in N6-methyladenosine sequencing
Zhenxing Guo,Daoyu Duan,Wen Tang,Julia Zhu,William S. Bush,Liangliang Zhang,Xiaofeng Zhu,Fulai Jin,Hao Feng
DOI: https://doi.org/10.1371/journal.pcbi.1011875
2024-02-13
PLoS Computational Biology
Abstract:Recently, novel biotechnologies to quantify RNA modifications became an increasingly popular choice for researchers who study epitranscriptome. When studying RNA methylations such as N6-methyladenosine (m 6 A), researchers need to make several decisions in its experimental design, especially the sample size and a proper statistical power. Due to the complexity and high-throughput nature of m 6 A sequencing measurements, methods for power calculation and study design are still currently unavailable. In this work, we propose a statistical power assessment tool, magpie , for power calculation and experimental design for epitranscriptome studies using m 6 A sequencing data. Our simulation-based power assessment tool will borrow information from real pilot data, and inspect various influential factors including sample size, sequencing depth, effect size, and basal expression ranges. We integrate two modules in magpie : (i) a flexible and realistic simulator module to synthesize m 6 A sequencing data based on real data; and (ii) a power assessment module to examine a set of comprehensive evaluation metrics. Sample size and sequencing depth are two essential quantitative factors determined prior to high throughput sequencing experiments, for statistical power maximization with limited budget. Due to the complex structure of data from m 6 A RNA methylation sequencing, analytical derivations for both quantities remain challenging in experimental designs. In response to this challenge, we propose a simulation-based statistical framework, together with a user-friendly R/Bioconductor package magpie , to comprehensively assess the power of the differential m 6 A methylation detection at varied sample sizes, effect sizes, baseline expression levels, and sequencing depths. Using in-silico synthetic data that mimic real data well, magpie provides several major evaluation metrics to assist users in study design and statistical power evaluation.
biochemical research methods,mathematical & computational biology