CysPresso: a classification model utilizing deep learning protein representations to predict recombinant expression of cysteine-dense peptides

Sébastien Ouellet,Larissa Ferguson,Angus Z. Lau,Tony K. Y. Lim
DOI: https://doi.org/10.1186/s12859-023-05327-8
IF: 3.307
2023-05-18
BMC Bioinformatics
Abstract:Cysteine-dense peptides (CDPs) are an attractive pharmaceutical scaffold that display extreme biochemical properties, low immunogenicity, and the ability to bind targets with high affinity and selectivity. While many CDPs have potential and confirmed therapeutic uses, synthesis of CDPs is a challenge. Recent advances have made the recombinant expression of CDPs a viable alternative to chemical synthesis. Moreover, identifying CDPs that can be expressed in mammalian cells is crucial in predicting their compatibility with gene therapy and mRNA therapy. Currently, we lack the ability to identify CDPs that will express recombinantly in mammalian cells without labour intensive experimentation. To address this, we developed CysPresso, a novel machine learning model that predicts recombinant expression of CDPs based on primary sequence.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?