PPIscreenML: Structure-based screening for protein-protein interactions using AlphaFold

Victoria Mischley,Johannes Maier,Jesse Chen,John Karanicolas
DOI: https://doi.org/10.1101/2024.03.16.585347
2024-04-30
Abstract:Protein-protein interactions underlie nearly all cellular processes. With the advent of protein structure prediction methods such as AlphaFold2 (AF2), models of specific protein pairs can be built extremely accurately in most cases. However, determining the relevance of a given protein pair remains an open question. It is presently unclear how to use best structure-based tools to infer whether a pair of candidate proteins indeed interact with one another: ideally, one might even use such information to screen amongst candidate pairings to build up protein interaction networks. Whereas methods for evaluating quality of modeled protein complexes have been co-opted for determining which pairings interact (e.g., pDockQ and iPTM), there have been no rigorously benchmarked methods for this task. Here we introduce PPIscreenML, a classification model trained to distinguish AF2 models of interacting protein pairs from AF2 models of compelling decoy pairings. We find that PPIscreenML out-performs methods such as pDockQ and iPTM for this task, and further that PPIscreenML exhibits impressive performance when identifying which ligand/receptor pairings engage one another across the structurally conserved tumor necrosis factor superfamily (TNFSF). Analysis of benchmark results using complexes not seen in PPIscreenML development strongly suggest that the model generalizes beyond training data, making it broadly applicable for identifying new protein complexes based on structural models built with AF2.
Bioinformatics
What problem does this paper attempt to address?
The problem this paper attempts to address is the prediction and screening of protein-protein interactions (PPI). Specifically, the paper introduces a new method called PPIscreenML, which utilizes structure models generated by AlphaFold2 to distinguish between truly interacting protein pairs and non-interacting protein pairs. Current methods such as pDockQ and iPTM perform unsatisfactorily on this task, thus the authors developed PPIscreenML to improve the accuracy of identifying protein interaction pairs. Additionally, the paper demonstrates the application of PPIscreenML in the tumor necrosis factor superfamily (TNFSF), proving its ability to accurately identify selective ligand/receptor pairs. Overall, PPIscreenML aims to provide a more reliable and efficient tool for structure-based protein interaction screening.