Towards anchoring evolutionary fitness for protein stability with virtual chemical environment recovery

Zhiwei Nie,Yiming Ma,Jie Chen,Yutian Liu,Zhihong Liu,Peng Yang,Fan Xu,Xiansong Huang,Feng Yin,Zigang Li,Jie Fu,Zhixiang Ren,Wen-Bin Zhang,Wen Gao,Yonghong Tian
DOI: https://doi.org/10.1101/2024.04.22.590665
2024-04-26
Abstract:Protein stability offers valuable insights into protein folding and functionality, making it an integral component of evolutionary fitness. Previous computational methods possess both strengths and weaknesses, leading to practical and inter-pretational limitations. Here, we propose an interpretable protein stability change prediction method, S3C, to anchor evolutionary fitness for protein stability with virtual chemical environment recovery. S3C first gets rid of the shackles of high-resolution protein structure data and restores the local chemical environments of the mutations at the sequence level. Subsequently, S3C promotes the evolutionary fitness of protein stability to dominate the fitness landscape under the selective pressure. Naturally, S3C comprehensively outperforms state-of-the-art methods on benchmark datasets while showing ideal generalization when migrated to unseen protein families. More importantly, S3C is demonstrated to be interpretable at multiple scales, including high-fidelity recovery of local structure micro-environment, perception of intricate interaction reconstruction, and accurate mining of rare beneficial mutations. S3C expands the boundaries of protein evolution prediction and provides an ideal candidate for large-scale optimization of protein engineering.
Bioinformatics
What problem does this paper attempt to address?