Advances in phage–host interaction prediction: in silico method enhances the development of phage therapies

Wanchun Nie,Tianyi Qiu,Yiwen Wei,Hao Ding,Zhixiang Guo,Jingxuan Qiu
DOI: https://doi.org/10.1093/bib/bbae117
IF: 9.5
2024-03-27
Briefings in Bioinformatics
Abstract:Abstract Phages can specifically recognize and kill bacteria, which lead to important application value of bacteriophage in bacterial identification and typing, livestock aquaculture and treatment of human bacterial infection. Considering the variety of human-infected bacteria and the continuous discovery of numerous pathogenic bacteria, screening suitable therapeutic phages that are capable of infecting pathogens from massive phage databases has been a principal step in phage therapy design. Experimental methods to identify phage–host interaction (PHI) are time-consuming and expensive; high-throughput computational method to predict PHI is therefore a potential substitute. Here, we systemically review bioinformatic methods for predicting PHI, introduce reference databases and in silico models applied in these methods and highlight the strengths and challenges of current tools. Finally, we discuss the application scope and future research direction of computational prediction methods, which contribute to the performance improvement of prediction models and the development of personalized phage therapy.
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: how to efficiently predict the interaction between phages and host bacteria (Phage - Host Interaction, PHI) through computational methods, so as to promote the development of phage therapy. Specifically, the paper aims to systematically review the existing bioinformatics methods, introduce the reference databases and computational models used in these methods, and emphasize the advantages and challenges of current tools. In addition, the paper also discusses the application scope of computational prediction methods and future research directions in order to improve the performance of prediction models and the development of personalized phage therapy. ### Background and the Importance of the Problem Phages can specifically recognize and kill bacteria, so they have important application values in bacterial identification, animal husbandry, aquaculture, and the treatment of human bacterial infections. With the rapid emergence of drug - resistant bacteria, phage therapy is considered an effective means to treat drug - resistant bacterial infections. However, screening phages suitable for treatment is a time - consuming and expensive process. Although traditional experimental methods such as plaque assays and liquid assays are accurate, they are inefficient. Therefore, high - throughput computational methods have become potential alternatives for predicting phage - host interactions. ### Main Contents of the Paper 1. **Review of Existing Methods**: - **Alignment - based Methods**: Predict potential shared regions by explicitly aligning the genomic sequences of phages and bacteria. For example, the BLAST method and the CRISPR - based method. - **Alignment - free Methods**: Predict by comparing the nucleotide or protein features of phage and bacterial genomes and using optimized machine - learning methods. For example, abundance - based methods, k - mer frequency similarity methods based on nucleotide composition, and methods based on protein properties, etc. 2. **Reference Databases and Tools**: - **NCBI RefSeq Genomic Database**: It contains 4,194 phage - prokaryote interaction data. - **Microbe Versus Phage Database**: It contains 26,572 virus cluster - 9,245 prokaryote interaction data. - **Viral Host Range Database**: It contains 171,701 interaction data. 3. **Advantages and Challenges**: - **Advantages**: Computational methods can quickly screen out potential phage - host interactions, saving time and cost. - **Challenges**: Current methods still have deficiencies in prediction accuracy, sensitivity, and application scope, especially when dealing with short - fragment genomic data. 4. **Future Research Directions**: - **Improve the Performance of Prediction Models**: Develop more complex machine - learning models and conduct comprehensive analysis by combining multiple features (such as nucleotide composition, protein properties, etc.). - **Personalized Phage Therapy**: Use computational methods to screen out phages against specific pathogenic bacteria to achieve personalized treatment. ### Conclusion By systematically reviewing and analyzing the existing computational methods, the paper points out the advantages and challenges in the current field of phage - host interaction prediction and proposes future research directions. This not only helps to improve the performance of prediction models but also provides theoretical and technical support for the development of personalized phage therapy.