Krisp: A Python package to aid in the design of CRISPR and amplification-based diagnostic assays from whole genome sequencing data

Zachary S. L. Foster,Andrew S. Tupper,Caroline M. Press,Niklaus J. Grünwald
DOI: https://doi.org/10.1371/journal.pcbi.1012139
2024-05-22
PLoS Computational Biology
Abstract:Recent pandemics like COVID-19 highlighted the importance of rapidly developing diagnostics to detect evolving pathogens. CRISPR-Cas technology has recently been used to develop diagnostic assays for sequence-specific recognition of DNA or RNA. These assays have similar sensitivity to the gold standard qPCR but can be deployed as easy to use and inexpensive test strips. However, the discovery of diagnostic regions of a genome flanked by conserved regions where primers can be designed requires extensive bioinformatic analyses of genome sequences. We developed the Python package krisp to aid in the discovery of primers and diagnostic sequences that differentiate groups of samples from each other, using either unaligned genome sequences or a variant call format (VCF) file as input. Krisp has been optimized to handle large datasets by using efficient algorithms that run in near linear time, use minimal RAM, and leverage parallel processing when available. The validity of krisp results has been demonstrated in the laboratory with the successful design of a CRISPR diagnostic assay to distinguish the sudden oak death pathogen Phytophthora ramorum from closely related Phytophthora species. Krisp is released open source under a permissive license with all the documentation needed to quickly design CRISPR-Cas diagnostic assays. Pathogens continue to emerge at accelerated rates affecting animals, plants, and ecosystems. Rapid development of novel diagnostic tools is needed to monitor novel pathogen variants or groups. We developed the computational tool krisp to identify genetic regions suitable for development of CRISPR diagnostics and traditional amplification-based diagnostics such as PCR. Krisp scans whole genome sequence data for target and non-target groups to identify diagnostic regions based on DNA or RNA sequences. This computational tool has been validated using genome data for the sudden oak death pathogen Phytophthora ramorum . Krisp is released open source under a permissive license with all the documentation needed to quickly design CRISPR-Cas diagnostic assays and other amplification-based assays.
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?