EnzymeNet: Residual Neural Networks model for Enzyme Commission number prediction

Naoki Watanabe,Masaki Yamamoto,Masahiro Murata,Yuki Kuriya,Michihiro Araki
DOI: https://doi.org/10.1093/bioadv/vbad173
2023-11-24
Bioinformatics Advances
Abstract:Abstract Motivation Enzymes are key targets to biosynthesize functional substances in metabolic engineering. Therefore, various machine learning models have been developed to predict Enzyme Commission numbers, one of the enzyme annotations. However, the previously reported models predict the sequences with numerous consecutive identical amino acids, which are found within unannotated sequences, as enzymes. Results Here, we propose EnzymeNet for prediction of complete Enzyme Commission numbers using Residual Neural Networks. EnzymeNet can exclude the exceptional sequences described above. Several EnzymeNet models were built and optimized to explore the best conditions for removing such sequences. As a result, the models exhibited higher prediction accuracy with Macro F1 score up to 0.850 than previously reported models. Moreover, even the enzyme sequences with low similarity to training data, which were difficult to predict using the reported models, could be predicted extensively using EnzymeNet models. The robustness of EnzymeNet models will lead to discover novel enzymes for biosynthesis of functional compounds using microorganisms. Availability The source code of EnzymeNet models is freely available at and https://github.com/nwatanbe/enzymenet. Supplementary information Supplementary data are available at Bioinformatics Advances online.
English Else
What problem does this paper attempt to address?