A general temperature-guided language model to design proteins of enhanced stability and activity
Fan Jiang,Mingchen Li,Jiajun Dong,Yuanxi Yu,Xinyu Sun,Banghao Wu,Jin Huang,Liqi Kang,Yufeng Pei,Liang Zhang,Shaojie Wang,Wenxue Xu,Jingyao Xin,Wanli Ouyang,Guisheng Fan,Lirong Zheng,Yang Tan,Zhiqiang Hu,Yi Xiong,Yan Feng,Guangyu Yang,Qian Liu,Jie Song,Jia Liu,Liang Hong,Pan Tan
DOI: https://doi.org/10.1126/sciadv.adr2641
IF: 13.6
2024-11-28
Science Advances
Abstract:Designing protein mutants with both high stability and activity is a critical yet challenging task in protein engineering. Here, we introduce PRIME, a deep learning model, which can suggest protein mutants with improved stability and activity without any prior experimental mutagenesis data for the specified protein. Leveraging temperature-aware language modeling, PRIME demonstrated superior predictive ability compared to current state-of-the-art models on the public mutagenesis dataset across 283 protein assays. Furthermore, we validated PRIME's predictions on five proteins, examining the impact of the top 30 to 45 single-site mutations on various protein properties, including thermal stability, antigen-antibody binding affinity, and the ability to polymerize nonnatural nucleic acid or resilience to extreme alkaline conditions. More than 30% of PRIME-recommended mutants exhibited superior performance compared to their premutation counterparts across all proteins and desired properties. We developed an efficient and effective method based on PRIME to rapidly obtain multisite mutants with enhanced activity and stability. Hence, PRIME demonstrates broad applicability in protein engineering.
multidisciplinary sciences