Abstract:Kinases play a central role in regulating cellular processes, making their study essential for understanding cellular function and disease mechanisms. To investigate the regulatory state of a kinase, numerous methods have been, and continue to be, developed to infer kinase activities from phosphoproteomics data. These methods usually rely on a set of kinase targets collected from various kinase-substrate libraries. However, only a small percentage of measured phosphorylation sites can usually be attributed to an upstream kinase in these libraries, limiting the scope of kinase activity inference. In addition, the inferred activities from different methods can vary making it crucial to evaluate them for accurate interpretation. Here, we present a comprehensive evaluation of kinase activity inference methods using multiple kinase-substrate libraries combined with different inference algorithms. Additionally, we try to overcome the coverage limitations for measured targets in kinase substrate libraries by adding predicted kinase-substrate interactions for activity inference. For the evaluation, in addition to classical cell-based perturbation experiments, we introduce a tumor-based benchmarking approach that utilizes multi-omics data to identify highly active or inactive kinases per tumor type. We show that while most computational algorithms perform comparably regardless of their complexity, the choice of kinase-substrate library can highly impact the inferred kinase activities. Hereby, manually curated libraries, particularly PhosphoSitePlus, demonstrate superior performance in recapitulating kinase activities from phosphoproteomics data. Additionally, in the tumor-based evaluation, adding predicted targets from NetworKIN further boosts the performance, while normalizing sites to host protein levels reduces kinase activity inference performance. We then showcase how kinase activity inference can help in characterizing the response to kinase inhibitors in different cell lines. Overall, the selection of reliable kinase activity inference methods is important in identifying deregulated kinases and novel drug targets. Finally, to facilitate the evaluation of novel methods in the future, we provide both benchmarking approaches in the R package benchmarKIN.

Improving the Performance of Protein Kinase Identification Via High Dimensional Protein-Protein Interactions and Substrate Structure Data

Identifying Human Kinase-Specific Protein Phosphorylation Sites By Integrating Heterogeneous Information From Various Sources

Phosphopredict: A Bioinformatics Tool for Prediction of Human Kinase-Specific Phosphorylation Substrates and Sites by Integrating Heterogeneous Feature Selection

A novel algorithm for identifying protein kinases associated with phosphorylation sites based on Bayesian decision theory

A Novel Phosphorylation Site-Kinase Network-Based Method for the Accurate Prediction of Kinase-Substrate Relationships.

PKIS: computational identification of protein kinases for experimentally discovered protein phosphorylation sites

Ksrmkl: a Novel Method for Identification of Kinase–substrate Relationships Using Multiple Kernel Learning

Psphos: Pk-Specific Phosphorylation Site Prediction Using Profile Svm

Sequence-based machine learning method for predicting the effects of phosphorylation on protein-protein interactions

PhosD: Inferring Kinase-Substrate Interactions Based on Protein Domains.

Comprehensive evaluation of phosphoproteomic-based kinase activity inference

Data-driven extraction of human kinase-substrate relationships from omics datasets

Quantitative phosphoproteomics-based molecular network description for high-resolution kinase-substrate interactome analysis

Improvement Of The Quantification Accuracy And Throughput For Phosphoproteome Analysis By A Pseudo Triplex Stable Isotope Dimethyl Labeling Approach

Phosphorylated Protein Chip Combined with Artificial Intelligence Tools for Precise Drug Screening

PKSPS: a Novel Method for Predicting Kinase of Specific Phosphorylation Sites Based on Maximum Weighted Bipartite Matching Algorithm and Phosphorylation Sequence Enrichment Analysis.

Prediction of kinase-specific phosphorylation sites with sequence features by a log-odds ratio approach.

Kinase Identification with Supervised Laplacian Regularized Least Squares

Inferring the Sign of Kinase-Substrate Interactions by Combining Quantitative Phosphoproteomics with a Literature-Based Mammalian Kinome Network

Prediction of Kinase-Substrate Relations Based on Heterogeneous Networks