Metapredict enables accurate disorder prediction across the Tree of Life

Jeffrey M Lotthammer,Jorge Hernandez-Garcia,Daniel Griffith,Dolf Weijers,Alex S Holehouse,Ryan J Emenecker
DOI: https://doi.org/10.1101/2024.11.05.622168
2024-11-07
Abstract:Intrinsically disordered proteins and protein regions (collectively IDRs) are critical in numerous cellular processes. To understand how IDRs facilitate function, we need tools to accurately and rapidly identify them from sequence. While many methods for disorder prediction exist, we are currently limited by throughput and accuracy for evolutionary scale analyses. To bridge this gap, we developed metapredict V3, an updated version of our disorder predictor that enables evolutionary-scale disorder prediction. Metapredict V3 enables proteome-scale prediction with state-of-the-art accuracy in seconds and was developed with a focus on usability. It is distributed as a web server, Python software package, command-line interface, and Google Colab notebook. Here, we leverage the accuracy and throughput of metapredict V3 to predict disorder for over 20,000 proteomes to evaluate the prevalence of disorder across the kingdoms of life.
Bioinformatics
What problem does this paper attempt to address?