Abstract:Abstract Designing entirely new protein structures remains challenging because we do not fully understand the biophysical determinants of folding stability. Yet some protein folds are easier to design than others. Previous work identified the 43-residue αββα fold as especially challenging: the best designs had only a 2% success rate, compared to 39-87% success for other simple folds (1). This suggested the αββα fold would be a useful model system for gaining a deeper understanding of folding stability determinants and for testing new protein design methods. Here, we designed over ten thousand new αββα proteins and found over three thousand of them to fold into stable structures using a high-throughput protease-based assay. Nuclear magnetic resonance, hydrogen-deuterium exchange, circular dichroism, deep mutational scanning, and scrambled sequence control experiments indicated that our stable designs fold into their designed αββα structures with exceptional stability for their small size. Our large dataset enabled us to quantify the influence of universal stability determinants including nonpolar burial, helix capping, and buried unsatisfied polar atoms, as well as stability determinants unique to the αββα topology. Our work demonstrates how large-scale design and test cycles can solve challenging design problems while illuminating the biophysical determinants of folding. Significance Most computationally designed proteins fail to fold into their designed structures. This low success rate is a major obstacle to expanding the applications of protein design. In previous work, we discovered a small protein fold that was paradoxically challenging to design (only a 2% success rate) even though the fold itself is very simple. Here, we used a recently developed high-throughput approach to comprehensively examine the design rules for this simple fold. By designing over ten thousand proteins and experimentally measuring their folding stability, we discovered the key biophysical properties that determine the stability of these designs. Our results illustrate general lessons for protein design and also demonstrate how high-throughput stability studies can quantify the importance of different biophysical forces.

Protein superfolds are characterised as frustration-free topologies: A case study of pure parallel -sheet topologies

Folding Lattice HP Model of Proteins Using the Bond-Fluctuation Model

The Folding Transition State of Protein L is Extensive with Nonnative Interactions (and Not Small and Polarized)

Folding Rate Optimization Promotes Frustrated Interactions in Entangled Protein Structures

Are Protein Folds Atypical?

Topological descriptions of protein folding

Protein folding ‐ seeing is deceiving

Folds from fold: Exploring topological isoforms of a single-domain protein

Probing Possible Downhill Folding: Native Contact Topology Likely Places a Significant Constraint on the Folding Cooperativity of Proteins with ∼40 Residues

The contact angle in inviscid fluid mechanics

Transferable coarse-grained potential for $\textit{de novo}$ protein folding and design

Dissecting the stability determinants of a challenging de novo protein fold using massively parallel design and experimentation

Frustration, function and folding

Sequence and structural patterns detected in entangled proteins reveal the importance of co-translational folding

Protein Folding: From Classical Issues to a New Perspective

Simple Models of the Protein Folding Problem

Theoretical Perspectives on Protein Folding

What is the Origin of Those Common Structures of Protein-Model Chains?

Protein folding, protein dynamics and the topology of self-motions

Single-molecule Force Spectroscopy Reveals a Mechanically Stable Protein Fold and the Rational Tuning of Its Mechanical Stability

Secondary structure determines protein topology.