Geometric Transformers for Protein Interface Contact Prediction

Alex Morehead,Chen Chen,Jianlin Cheng
DOI: https://doi.org/10.48550/arXiv.2110.02423
2022-03-05
Abstract:Computational methods for predicting the interface contacts between proteins come highly sought after for drug discovery as they can significantly advance the accuracy of alternative approaches, such as protein-protein docking, protein function analysis tools, and other computational methods for protein bioinformatics. In this work, we present the Geometric Transformer, a novel geometry-evolving graph transformer for rotation and translation-invariant protein interface contact prediction, packaged within DeepInteract, an end-to-end prediction pipeline. DeepInteract predicts partner-specific protein interface contacts (i.e., inter-protein residue-residue contacts) given the 3D tertiary structures of two proteins as input. In rigorous benchmarks, DeepInteract, on challenging protein complex targets from the 13th and 14th CASP-CAPRI experiments as well as Docking Benchmark 5, achieves 14% and 1.1% top L/5 precision (L: length of a protein unit in a complex), respectively. In doing so, DeepInteract, with the Geometric Transformer as its graph-based backbone, outperforms existing methods for interface contact prediction in addition to other graph-based neural network backbones compatible with DeepInteract, thereby validating the effectiveness of the Geometric Transformer for learning rich relational-geometric features for downstream tasks on 3D protein structures.
Machine Learning,Biomolecules,Quantitative Methods
What problem does this paper attempt to address?
The problem that this paper attempts to solve is protein interface contact prediction. Specifically, the authors aim to develop a new Geometric Transformer for predicting the contact sites (i.e., residue - residue contacts between proteins) when two proteins bind to form a complex. This task is crucial for drug discovery because it can significantly improve the accuracy of protein - protein docking, protein function analysis tools, and other computational biology methods. By proposing DeepInteract, an end - to - end prediction process, the authors hope to achieve higher prediction accuracy on challenging protein complex targets, especially those in the 13th and 14th CASP - CAPRI experiments and in Docking Benchmark 5. DeepInteract utilizes geometric transformers with rotation and translation invariance to process 3D data of protein structures, thus outperforming existing methods in predicting protein interface contacts.