DLKcat cannot predict meaningful kcat values for mutants and unfamiliar enzymes

Alexander Kroll,Martin J Lercher
DOI: https://doi.org/10.1101/2023.02.06.526991
2024-07-02
Abstract:The recently published DLKcat model, a deep learning approach for predicting enzyme turnover numbers (kcat), claims to enable high-throughput kcat predictions for metabolic enzymes from any organism and to capture kcat changes for mutated enzymes. Here, we critically evaluate these claims. We show that DLKcat predictions become positively misleading for enzymes with less than 60% sequence identity to the training data, performing worse than simply assuming a mean kcat value for all reactions. Furthermore, DLKcat's ability to predict mutation effects is much weaker than implied, capturing only 3% of the experimentally observed variation across mutants not included in the training data. These findings highlight significant limitations in DLKcat's generalizability and its practical utility for predicting kcat values for novel enzyme families or mutants, which are crucial applications in fields such as metabolic modeling.
Bioinformatics
What problem does this paper attempt to address?