SCREEN: a graph-based contrastive learning tool to infer catalytic residues and assess mutation tolerance in enzymes

Tong Pan,Yue Bi,Xiaoyu Wang,Ying Zhang,Geoffrey I Webb,Robin Gasser,Lukasz Kurgan,Jiangning Song
DOI: https://doi.org/10.1101/2024.06.27.601004
2024-07-02
Abstract:The accurate identification of catalytic residues contributes to our understanding of enzyme functions in biological processes and pathways. The increasing number of protein sequences necessitates computational tools for the automated prediction of catalytic residues in enzymes. Here, we introduce SCREEN, a graph neural network for the high-throughput prediction of catalytic residues via the integration of enzyme functional and structural information. SCREEN constructs residue representations based on spatial arrangements and incorporates enzyme function priors into such representations through contrastive learning. We demonstrate that SCREEN (i) consistently outperforms currently available predictors; (ii) provides accurate results when applied to inferred enzyme structures; and (iii) generalizes well to enzymes dissimilar from those in the training set. We also show that the putative catalytic residues predicted by SCREEN mimic key structural and biophysical characteristics of native catalytic residues. Moreover, using experimental data sets, we show that SCREENs predictions can be used to distinguish residues with a high mutation tolerance from those likely to cause functional loss when mutated, indicating that this tool might be used to infer disease-associated mutations.
Bioinformatics
What problem does this paper attempt to address?
This paper introduces a new tool called SCREEN, which is based on graph neural networks and contrastive learning methods to predict catalytic residues in enzymes and evaluate mutation tolerance. SCREEN integrates the functional and structural information of enzymes to construct residue representations and incorporates functional prior knowledge into these representations using contrastive learning. The study demonstrates that SCREEN outperforms existing tools in predicting catalytic residues, even when dealing with inferred enzyme structures or enzymes with significant differences from the training set, showing good generalization ability. Additionally, the predictions of SCREEN can distinguish residues with high mutation tolerance from those that may result in loss of function, which may help identify mutations related to diseases. Therefore, SCREEN provides a powerful computational tool for automating the prediction of enzyme catalysis and understanding its function.