Protein acetylation sites with complex-valued polynomial model

Wenzheng Bao,Bin Yang
DOI: https://doi.org/10.1007/s11704-023-2640-9
IF: 2.6688
2024-01-23
Frontiers of Computer Science
Abstract:Protein acetylation refers to a process of adding acetyl groups (CH3CO-) to lysine residues on protein chains. As one of the most commonly used protein post-translational modifications, lysine acetylation plays an important role in different organisms. In our study, we developed a human-specific method which uses a cascade classifier of complex-valued polynomial model (CVPM), combined with sequence and structural feature descriptors to solve the problem of imbalance between positive and negative samples. Complex-valued gene expression programming and differential evolution are utilized to search the optimal CVPM model. We also made a systematic and comprehensive analysis of the acetylation data and the prediction results. The performances of our proposed method aie 79.15% in S p , 78.17% in S n , 78.66% in ACC 78.76% in F 1, and 0.5733 in MCC , which performs better than other state-of-the-art methods.
computer science, information systems, theory & methods, software engineering
What problem does this paper attempt to address?