ProsperousPlus: a One-Stop and Comprehensive Platform for Accurate Protease-Specific Substrate Cleavage Prediction and Machine-Learning Model Construction.

Fuyi Li,Cong Wang,Xudong Guo,Tatsuya Akutsu,Geoffrey I. Webb,Lachlan J. M. Coin,Lukasz Kurgan,Jiangning Song
DOI: https://doi.org/10.1093/bib/bbad372
IF: 9.5
2023-01-01
Briefings in Bioinformatics
Abstract:Proteases contribute to a broad spectrum of cellular functions. Given a relatively limited amount of experimental data, developing accurate sequence-based predictors of substrate cleavage sites facilitates a better understanding of protease functions and substrate specificity. While many protease-specific predictors of substrate cleavage sites were developed, these efforts are outpaced by the growth of the protease substrate cleavage data. In particular, since data for 100+ protease types are available and this number continues to grow, it becomes impractical to publish predictors for new protease types, and instead it might be better to provide a computational platform that helps users to quickly and efficiently build predictors that address their specific needs. To this end, we conceptualized, developed, tested and released a versatile bioinformatics platform, ProsperousPlus, that empowers users, even those with no programming or little bioinformatics background, to build fast and accurate predictors of substrate cleavage sites. ProsperousPlus facilitates the use of the rapidly accumulating substrate cleavage data to train, empirically assess and deploy predictive models for user-selected substrate types. Benchmarking tests on test datasets show that our platform produces predictors that on average exceed the predictive performance of current state-of-the-art approaches. ProsperousPlus is available as a webserver and a stand-alone software package at http://prosperousplus.unimelb-biotools.cloud.edu.au/.
What problem does this paper attempt to address?