ACP-ESM: A novel framework for classification of anticancer peptides using protein-oriented transformer approach

Zeynep Hilal Kilimci,Mustafa Yalcin
2024-01-04
Abstract:Anticancer peptides (ACPs) are a class of molecules that have gained significant attention in the field of cancer research and therapy. ACPs are short chains of amino acids, the building blocks of proteins, and they possess the ability to selectively target and kill cancer cells. One of the key advantages of ACPs is their ability to selectively target cancer cells while sparing healthy cells to a greater extent. This selectivity is often attributed to differences in the surface properties of cancer cells compared to normal cells. That is why ACPs are being investigated as potential candidates for cancer therapy. ACPs may be used alone or in combination with other treatment modalities like chemotherapy and radiation therapy. While ACPs hold promise as a novel approach to cancer treatment, there are challenges to overcome, including optimizing their stability, improving selectivity, and enhancing their delivery to cancer cells, continuous increasing in number of peptide sequences, developing a reliable and precise prediction model. In this work, we propose an efficient transformer-based framework to identify anticancer peptides for by performing accurate a reliable and precise prediction model. For this purpose, four different transformer models, namely ESM, ProtBert, BioBERT, and SciBERT are employed to detect anticancer peptides from amino acid sequences. To demonstrate the contribution of the proposed framework, extensive experiments are carried on widely-used datasets in the literature, two versions of AntiCp2, cACP-DeepGram, ACP-740. Experiment results show the usage of proposed model enhances classification accuracy when compared to the state-of-the-art studies. The proposed framework, ESM, exhibits 96.45 of accuracy for AntiCp2 dataset, 97.66 of accuracy for cACP-DeepGram dataset, and 88.51 of accuracy for ACP-740 dataset, thence determining new state-of-the-art.
Biomolecules,Artificial Intelligence,Computational Engineering, Finance, and Science,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the problem of classifying anticancer peptides (ACPs) in cancer treatment. Specifically, the paper proposes a new protein-oriented transformer approach framework—ACP-ESM, for accurately identifying anticancer peptides from amino acid sequences. This framework utilizes four different transformer models: ESM, ProtBert, BioBERT, and SciBERT, and has been validated using extensive datasets such as AntiCp2, cACP-DeepGram, and ACP-740. Experimental results show that this framework significantly outperforms existing techniques in terms of classification accuracy. In particular, the ESM model achieved accuracies of 96.45%, 97.66%, and 88.51% on these datasets, respectively, thus establishing new benchmarks. The main contribution of the paper lies in introducing an efficient transformer framework, demonstrating the potential of deep learning techniques in this field, and showcasing its performance advantages through comparative analysis of multiple models. Furthermore, the robust performance of this framework on multiple commonly used datasets proves its generality and adaptability.