LazyAF, a pipeline for accessible medium-scale prediction of protein-protein interactions

Thomas C. McLean
DOI: https://doi.org/10.1101/2024.01.29.577767
2024-02-01
Abstract:Artificial intelligence has revolutionized the field of protein structure prediction. However, with more powerful and complex software being developed, it is accessibility and ease of use rather than capability that is quickly becoming a limiting factor to end users. Here, I present a Google Colaboratory-based pipeline, named LazyAF, which integrates the existing ColabFold BATCH to streamline the process of medium-scale protein-protein interaction prediction. I apply LazyAF to predict the interactome of the 76 proteins encoded on a broad-host-range multi-drug resistance plasmid RK2, demonstrating the ease and accessibility the pipeline provides.
Bioinformatics
What problem does this paper attempt to address?
The problem addressed in this paper is how to simplify and enhance the accessibility and efficiency of medium-scale protein-protein interaction (PPI) prediction, especially for researchers with limited bioinformatics experience. The authors have developed a pipeline called LazyAF, which is built on Google Colaboratory and integrates the existing ColabFold BATCH tool. LazyAF aims to automate the process of predicting interactions between proteins, enabling researchers to easily handle medium-scale datasets without the need for complex high-performance computing clusters or advanced computer skills. Through this pipeline, researchers can input a "bait" protein and a series of "candidate" protein sequences, and the system will automatically run the AlphaFold2-Multimer prediction, evaluate all possible interactions, and calculate ranking confidence scores based on predicted template modeling (pTM) and predicted interface template modeling (ipTM) scores. The system then generates a ranked list to determine high-probability protein-protein interaction pairs. The paper demonstrates the usability and capability of LazyAF by predicting the interaction network of 76 proteins encoded by the multidrug resistance plasmid RK2. Ultimately, LazyAF provides a user-friendly and cloud-hardware-supported platform that lowers the entry barrier for medium-scale PPI prediction and encourages more laboratory researchers to incorporate computational modeling into their workflows.