Abstract:Amyloids are insoluble fibrillar protein aggregates. While they are commonly found in human diseases, it is becoming increasingly clear that this type of structure is essential for a range of biological functions, with prominent examples in organisms ranging from bacteria to human. Functional amyloid and pathogenic amyloid share similar physical and chemical properties. Unlike pathological amyloids, however, the structures of functional amyloids are formed by polypeptide sequences whose amyloid structure has been under a positive evolutionary selection pressure. This important distinction provides us with an opportunity to obtain structural insights from an unexpected source: the covariation of amino acids among sequences within the same family of a functional amyloid protein. There is a long history for the idea of using coevolution for molecular structure prediction, but recent growth in sequence databases and new, efficient algorithms to disentangle indirect couplings in a network, have dramatically improved our ability to predict residue-residue contacts. We used recently developed sequence analysis methods (EVcoupling, PSICOV and GREMLIN) to extract distance restraints from a multiple sequence alignment of a functional amyloid protein. Together with an efficient force field, these restraints allow us to determine atomic resolution structural models. We find that the protein forms a beta-helical structure, where each turn corresponds to previously identified repeat sequences. The proposed structure is validated by previously published solid-state NMR, electron microscopy and X-ray diffraction data, and confirms an earlier proposed model derived by complementary means. To our knowledge, this is the first time the analysis of correlated mutations and computer simulations have been used together to study the structure of a functional amyloid. The current study therefore serves as a probe into the potential applicability of the approach in this domain.

PDB_Amyloid: The Extended Live Amyloid Structure List from the PDB

On the Border of the Amyloidogenic Sequences: Prefix Analysis of the Parallel Beta Sheets in the PDB\_Amyloid Collection

What Does Evolution Tell Us About The Structure Of A Functional Amyloid Protein?

Identification of a Novel Parallel Β‐strand Conformation Within Molecular Monolayer of Amyloid Peptide

Structure of a Functional Amyloid Protein Subunit Computed Using Sequence Variation.

CPAD, Curated Protein Aggregation Database: A Repository of Manually Curated Experimental Data on Protein and Peptide Aggregation

RCSB Protein Data Bank (RCSB.org): delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning

Half a Century of Amyloids: Past, Present and Future

RCSB Protein Data Bank: powerful new tools for exploring 3D structures of biological macromolecules for basic and applied research and education in fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences

Protein Data Bank: the single global archive for 3D macromolecular structure data

AmyloComp: a bioinformatic tool for prediction of amyloid co-aggregation

PDBlocal: A Web-Based Tool for Local Inspection of Biological Macromolecular 3D Structures

Facilities that make the PDB data collection more powerful

Beyond history and “on a roll”: The list of the most well‐studied human protein structures and overall trends in the protein data bank

RCSB Protein Data Bank: biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy

RCSB Protein Data Bank: Tools for Visualizing and Understanding Biological Macromolecules in 3D.

AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models

A series of PDB-related databanks for everyday needs

RCSB Protein Data Bank: supporting research and education worldwide through explorations of experimentally determined and computationally predicted atomic level 3D biostructures

RCSB Protein Data Bank: Enabling biomedical research and drug discovery