Accurate prediction of CRISPR editing outcomes in somatic cell lines and zygote with few-shot learning

Weizhong Zheng,Lu Yu,Guoliang Wang,Shaoxian Cao,Kenso Ho,Jun Song,Chen Cheng,Joshua W.K. Ho,Xueqing Liu,Meng Wu,Zhonghua Liu,Huili Wang,Pentao Liu,Guocheng Lan,Yuanhua Huang
DOI: https://doi.org/10.1101/2024.11.08.622621
2024-11-11
Abstract:The CRISPR-Cas system has revolutionized gene editing, while its outcome prediction remains unsatisfactory, especially in new cell states due to their distinct DNA repair preferences. In this study, we introduce inDecay, a flexible system for predicting CRISPR editing outcomes from target sequence, returning probabilities of nearly the full spectrum of indel events. Uniquely, inDecay utilizes informative and parameter-efficient features for each indel event and incorporates cell-type-specific repair preferences through a multi-stage design. While both inDecay and existing methods achieve accurate results for prediction within cell lines, only inDecay with transfer learning can retain the high performance for cross-cell line prediction. We then applied inDecay to mouse embryo editing using our newly generated data and observed remarkable accuracy by including as few as 30 fine-tuning embryonic samples. Notably, inDecay is the first software to predict embryonic editing. Therefore, our few-shot learning-supported system may accelerate guide RNA prioritization in mouse model generation, mammal embryonic gene editing, and cellular therapeutics.
Bioinformatics
What problem does this paper attempt to address?