DeLA-DrugSelf: Empowering multi-objective de novo design through SELFIES molecular representation

Domenico Alberga,Giuseppe Lamanna,Giovanni Graziano,Pietro Delre,Maria Cristina Lomuscio,Nicola Corriero,Alessia Ligresti,Dritan Siliqi,Michele Saviano,Marialessandra Contino,Angela Stefanachi,Giuseppe Felice Mangiatordi
DOI: https://doi.org/10.1016/j.compbiomed.2024.108486
IF: 7.7
2024-04-24
Computers in Biology and Medicine
Abstract:In this paper, we introduce DeLA-DrugSelf , an upgraded version of DeLA-Drug [J. Chem. Inf. Model. 62 (2022) 1411–1424], which incorporates essential advancements for automated multi-objective de novo design. Unlike its predecessor, which relies on SMILES notation for molecular representation, DeLA-DrugSelf employs a novel and robust molecular representation string named SELFIES (SELF-referencing Embedded String). The generation process in DeLA-DrugSelf not only involves substitutions to the initial string representing the starting query molecule but also incorporates insertions and deletions. This enhancement makes DeLA-DrugSelf significantly more adept at executing data-driven scaffold decoration and lead optimization strategies. Remarkably, DeLA-DrugSelf explicitly addresses the SELFIES-related collapse issue, considering only collapse-free compounds during generation. These compounds undergo a rigorous quality metrics evaluation, highlighting substantial advancements in terms of drug-likeness, uniqueness, and novelty compared to the molecules generated by the previous version of the algorithm. To evaluate the potential of DeLA-DrugSelf as a mutational operator within a genetic algorithm framework for multi-objective optimization, we employed a fitness function based on Pareto dominance. Our objectives focused on target-oriented properties aimed at optimizing known cannabinoid receptor 2 (CB2R) ligands. The results obtained indicate that DeLA-DrugSelf, available as a user-friendly web platform ( https://www.ba.ic.cnr.it/softwareic/delaself/ ), can effectively contribute to the data-driven optimization of starting bioactive molecules based on user-defined parameters.
engineering, biomedical,computer science, interdisciplinary applications,mathematical & computational biology,biology
What problem does this paper attempt to address?