Enhancing CRISPR-Cas9 gRNA efficiency prediction by data integration and deep learning
Xi Xiang,Giulia I Corsi,Christian Anthon,Kunli Qu,Xiaoguang Pan,Xue Liang,Peng Han,Zhanying Dong,Lijun Liu,Jiayan Zhong,Tao Ma,Jinbao Wang,Xiuqing Zhang,Hui Jiang,Fengping Xu,Xin Liu,Xun Xu,Jian Wang,Huanming Yang,Lars Bolund,George M Church,Lin Lin,Jan Gorodkin,Yonglun Luo
DOI: https://doi.org/10.1038/s41467-021-23576-0
2021-05-28
Abstract:The design of CRISPR gRNAs requires accurate on-target efficiency predictions, which demand high-quality gRNA activity data and efficient modeling. To advance, we here report on the generation of on-target gRNA activity data for 10,592 SpCas9 gRNAs. Integrating these with complementary published data, we train a deep learning model, CRISPRon, on 23,902 gRNAs. Compared to existing tools, CRISPRon exhibits significantly higher prediction performances on four test datasets not overlapping with training data used for the development of these tools. Furthermore, we present an interactive gRNA design webserver based on the CRISPRon standalone software, both available via https://rth.dk/resources/crispr/ . CRISPRon advances CRISPR applications by providing more accurate gRNA efficiency predictions than the existing tools.