Abstract:Protein aggregation is a widespread phenomenon implicated in debilitating diseases like Alzheimer's, Parkinson's, and cataracts, presenting complex hurdles for the field of molecular biology. In this review, we explore the evolving realm of computational methods and bioinformatics tools that have revolutionized our comprehension of protein aggregation. Beginning with a discussion of the multifaceted challenges associated with understanding this process and emphasizing the critical need for precise predictive tools, we highlight how computational techniques have become indispensable for understanding protein aggregation. We focus on molecular simulations, notably molecular dynamics (MD) simulations, spanning from atomistic to coarse-grained levels, which have emerged as pivotal tools in unraveling the complex dynamics governing protein aggregation in diseases such as cataracts, Alzheimer's, and Parkinson's. MD simulations provide microscopic insights into protein interactions and the subtleties of aggregation pathways, with advanced techniques like replica exchange molecular dynamics, Metadynamics (MetaD), and umbrella sampling enhancing our understanding by probing intricate energy landscapes and transition states. We delve into specific applications of MD simulations, elucidating the chaperone mechanism underlying cataract formation using Markov state modeling and the intricate pathways and interactions driving the toxic aggregate formation in Alzheimer's and Parkinson's disease. Transitioning we highlight how computational techniques, including bioinformatics, sequence analysis, structural data, machine learning algorithms, and artificial intelligence have become indispensable for predicting protein aggregation propensity and locating aggregation-prone regions within protein sequences. Throughout our exploration, we underscore the symbiotic relationship between computational approaches and empirical data, which has paved the way for potential therapeutic strategies against protein aggregation-related diseases. In conclusion, this review offers a comprehensive overview of advanced computational methodologies and bioinformatics tools that have catalyzed breakthroughs in unraveling the molecular basis of protein aggregation, with significant implications for clinical interventions, standing at the intersection of computational biology and experimental research.

Massive experimental quantification of amyloid nucleation allows interpretable deep learning of protein aggregation

Experimental and Computational Protocols for Studies of Cross-Seeding Amyloid Assemblies.

Multiscale Exploration of Concentration-Dependent Amyloid-β(16-21) Amyloid Nucleation

AMYGNN: A Graph Convolutional Neural Network-Based Approach for Predicting Amyloid Formation from Polypeptides

Prediction of Amyloid Aggregation Rates by Machine Learning and Feature Selection

Prediction of Aggregation Prone Regions in Proteins Using Deep Neural Networks and Their Suppression by Computational Design

Massively parallel genetic perturbation reveals the energetic architecture of an amyloid beta nucleation reaction

Identification of a Novel Parallel Β‐strand Conformation Within Molecular Monolayer of Amyloid Peptide

iAmyP: A Multi-view Learning for Amyloidogenic Hexapeptides Identification Based on Sequence Least Squares Programming

What Does Evolution Tell Us About The Structure Of A Functional Amyloid Protein?

Enhancing protein aggregation prediction: a unified analysis leveraging graph convolutional networks and active learning

ECAmyloid: An Amyloid Predictor Based on Ensemble Learning and Comprehensive Sequence-derived Features

A generic approach to decipher the mechanistic pathway of heterogeneous protein aggregation kinetics

Computational studies of protein aggregation mediated by amyloid: Fibril elongation and secondary nucleation

Resolving the Amino Acid Sequence of Aβ1‐42 at the Single‐Residue Level Using Subnanopores in Ultrathin Films

Insights into the variability of nucleated amyloid polymerization by a minimalistic model of stochastic protein assembly

Fundamentals and exploration of aggregation-induced emission molecules for amyloid protein aggregation

Capturing the Conformational Heterogeneity of the Β-Amyloid Peptide Sequence Using the Engineered Aerolysin Confinement

Machine learning quantification of Amyloid-β deposits in the temporal lobe of 131 brain bank cases

Advanced computational approaches to understand protein aggregation

Generative AI unlocks PET insights: brain amyloid dynamics and quantification