Sequence-based prediction of the solubility of peptides containing non-natural amino acids
Oeller,M.,Kang,R.,Bolt,H.,Gomes dos Santos,A.,Langborg Weinmann,A.,Nikitidis,A.,Zlatoidsky,P.,Su,W.,Czechtizky,W.,De Maria,L.,Sormanni,P.,Vendruscolo,M.
DOI: https://doi.org/10.1101/2023.03.03.530952
2023-03-04
bioRxiv
Abstract:Non-natural amino acids are increasingly used as building blocks in the development of peptide-based drugs, as they expand the available chemical space to tailor function, half-life and other key properties. However, while the chemical space of modified amino acids (mAAs) is potentially vast, experimental methods for measuring the developability properties of mAA-containing peptides are expensive and time consuming. To facilitate developability programs through computational methods, we present CamSol-PTM, a method that enables the fast and reliable sequence-based prediction of the solubility of mAA-containing peptides. From a computational screening of 50,000 mAA-containing variants of three peptides, we selected five different mAAs for a total number of 30 peptide variants for experimental validation. We demonstrate the accuracy of the predictions by comparing the calculated and experimental solubility values. Our results indicate that the computational screening of mAA-containing peptides can extend by over four orders of magnitude the ability to explore the solubility chemical space of peptides. This method is available as a web server at https://www-cohsoftware.ch.cam.ac.uk/index.php/camsolptm.