Prediction of CRISPR/Cas9 Off-Target Activities with Mismatches and Indels Using Stacked BiGRU

Guishan Zhang,Ye Luo,Huanzeng Xie,Zhiming Dai
DOI: https://doi.org/10.3390/ijms252010945
IF: 5.6
2024-10-11
International Journal of Molecular Sciences
Abstract:CRISPR/Cas9 is a popular genome editing technology, yet its clinical application is hindered by off-target effects. Many deep learning-based methods are available for off-target prediction. However, few can predict off-target activities with insertions or deletions (indels) between single guide RNA and DNA sequence pairs. Additionally, the analysis of off-target data is challenged due to a data imbalance issue. Moreover, the prediction accuracy and interpretability remain to be improved. Here, we introduce a deep learning-based framework, named Crispr-SGRU, to predict off-target activities with mismatches and indels. This model is based on Inception and stacked BiGRU. It adopts a dice loss function to solve the inherent imbalance issue. Experimental results show our model outperforms existing methods for off-target prediction in terms of accuracy and robustness. Finally, we study the interpretability of this model through Deep SHAP and teacher–student-based knowledge distillation, and find it can provide meaningful explanations for sequence patterns regarding off-target activity.
biochemistry & molecular biology,chemistry, multidisciplinary
What problem does this paper attempt to address?