LLMs predict protein phases

Arunima Singh
DOI: https://doi.org/10.1038/s41592-024-02421-4
IF: 48
2024-09-11
Nature Methods
Abstract:Proteins normally exist in soluble form but can undergo a phase transition to a dense liquid phase via liquid–liquid phase separation (LLPS) or to solid aggregates such as amyloids. These protein phase transitions (PPTs) have important functions: while droplet formation is often reversible and involved in cellular processes such as transcription regulation, genome organization and postsynaptic signaling, formation of solid aggregates is typically irreversible and implicated in neurogenerative diseases such as Alzheimer's and Parkinson's diseases. The ability to phase transition is believed to be encoded within a protein's sequence, and several methods based on classical machine learning that take various knowledge-based features into consideration have been developed. Most modeling methods, however, focus on the prediction of either LLPS or amyloid formation. The research group of Mark Gerstein at Yale university wanted to investigate whether a unified framework could predict propensities for both types of phase transition.
biochemical research methods
What problem does this paper attempt to address?