Scalable protein design using optimization in a relaxed sequence space

Christopher Frank,Ali Khoshouei,Lara Fuβ,Dominik Schiwietz,Dominik Putz,Lara Weber,Zhixuan Zhao,Motoyuki Hattori,Shihao Feng,Yosta de Stigter,Sergey Ovchinnikov,Hendrik Dietz
DOI: https://doi.org/10.1126/science.adq1741
IF: 56.9
2024-10-25
Science
Abstract:Machine learning (ML)–based design approaches have advanced the field of de novo protein design, with diffusion-based generative methods increasingly dominating protein design pipelines. Here, we report a "hallucination"-based protein design approach that functions in relaxed sequence space, enabling the efficient design of high-quality protein backbones over multiple scales and with broad scope of application without the need for any form of retraining. We experimentally produced and characterized more than 100 proteins. Three high-resolution crystal structures and two cryo–electron microscopy density maps of designed single-chain proteins comprising up to 1000 amino acids validate the accuracy of the method. Our pipeline can also be used to design synthetic protein-protein interactions, as validated experimentally by a set of protein heterodimers. Relaxed sequence optimization offers attractive performance with respect to designability, scope of applicability for different design problems, and scalability across protein sizes.
multidisciplinary sciences
What problem does this paper attempt to address?