Machine learning methods for predicting guide RNA effects in CRISPR epigenome editing experiments

Wancen Mu,Tianyou Luo,Alejandro Barrera,Lexi R. Bounds,Tyler S. Klann,Maria ter Weele,Julien Bryois,Gregory E. Crawford,Patrick F. Sullivan,Charles A. Gersbach,Michael I. Love,Yun Li
DOI: https://doi.org/10.1101/2024.04.18.590188
2024-04-19
Abstract:CRISPR epigenomic editing technologies enable functional interrogation of non-coding elements. However, current computational methods for guide RNA (gRNA) design do not effectively predict the power potential, molecular and cellular impact to optimize for efficient gRNAs, which are crucial for successful applications of these technologies. We present “launch-dCas9” (machine LeArning based UNified CompreHensive framework for CRISPR-dCas9) to predict gRNA impact from multiple perspectives, including cell fitness, wild-type abundance (gauging power potential), and gene expression in single cells. Our launch-dCas9, built and evaluated using experiments involving >1 million gRNAs targeted across the human genome, demonstrates relatively high prediction accuracy (AUC up to 0.81) and generalizes across cell lines. Method-prioritized top gRNA(s) are 4.6-fold more likely to exert effects, compared to other gRNAs in the same cis-regulatory region. Furthermore, launch-dCas9 identifies the most critical sequence-related features and functional annotations from >40 features considered. Our results establish launch-dCas9 as a promising approach to design gRNAs for CRISPR epigenomic experiments.
Bioinformatics
What problem does this paper attempt to address?