Improved prediction of bacterial CRISPRi guide efficiency from depletion screens through mixed-effect machine learning and data integration

Yanying Yu,Sandra Gawlitt,Lisa Barros de Andrade e Sousa,Erinc Merdivan,Marie Piraud,Chase L. Beisel,Lars Barquist
DOI: https://doi.org/10.1186/s13059-023-03153-y
IF: 17.906
2024-01-13
Genome Biology
Abstract:CRISPR interference (CRISPRi) is the leading technique to silence gene expression in bacteria; however, design rules remain poorly defined. We develop a best-in-class prediction algorithm for guide silencing efficiency by systematically investigating factors influencing guide depletion in genome-wide essentiality screens, with the surprising discovery that gene-specific features substantially impact prediction. We develop a mixed-effect random forest regression model that provides better estimates of guide efficiency. We further apply methods from explainable AI to extract interpretable design rules from the model. This study provides a blueprint for predictive models for CRISPR technologies where only indirect measurements of guide activity are available.
genetics & heredity,biotechnology & applied microbiology
What problem does this paper attempt to address?