peptidy: A light-weight Python library for peptide representation in machine learning

Rıza Özçelik,Laura van Weesep,Sarah de Ruiter,Francesca Grisoni
DOI: https://doi.org/10.26434/chemrxiv-2024-bm3lv
2024-04-22
Abstract:In this work, we introduce peptidy -- a lightweight Python library that facilitates converting peptides (expressed as aminoacid sequences) to numerical representations suited to machine learning. peptidy is free from external dependencies, integrates seamlessly into modern Python environments, and supports a range of encoding strategies suitable for both predictive and generative machine learning approaches. Additionally, peptidy supports peptides with post-translational modifications, such as phosphorylation, acetylation, and methylation, thereby extending the functionality of existing Python packages for peptides and proteins. peptidy is freely available with a permissive license on GitHub at the following URL: https://github.com/molML/peptidy
Chemistry
What problem does this paper attempt to address?