Disentangling the CHAOS of intrinsic disorder in human proteins

Ida de Vries,Jitske Bak,Daniel Alvarez Salmoral,Ren Xie,Razvan Borza,Maria Konijnenberg,Anastassis Perrakis
DOI: https://doi.org/10.1101/2024.10.26.620428
2024-10-29
Abstract:Most proteins consist of both folded domains and Intrinsically Disordered Regions (IDRs). However, the widespread occurrence of intrinsic disorder in human proteins, along with its characteristics, is often overlooked by the broader communities of structural and molecular biologists. Building on the MobiDB database of intrinsically disorder in proteins, here we develop a comprehensive dataset (Comprehensive analysis of Human proteins And their disOrdered Segments (CHAOS)). We implement empirical internally consistent definitions of what constitutes a disordered region, annotate general characteristics such as cellular location, essentiality, and post-translational modifications, and cross-reference to structure predictions from AlphaFold. Most proteins contain at least one disordered region, predominantly located at the protein termini. IDRs are less hydrophobic and are enriched in post-translational modifications compared to non-IDRs. Additionally, we discovered that proteins residing in different cellular locations possess distinct disorder profiles. Finally, the predicted AlphaFold models of proteins in CHAOS suggest that while protein disorder may be intrinsic, it does not have to be extrinsic. Hereby we enhance the visibility and understanding of intrinsic disorder in human proteins.
Bioinformatics
What problem does this paper attempt to address?