canSAR 2024—an update to the public drug discovery knowledgebase

Phillip W Gingrich,Rezvan Chitsazi,Ansuman Biswas,Chunjie Jiang,Li Zhao,Joseph E Tym,Kevin M Brammer,Jun Li,Zhigang Shu,David S Maxwell,Jeffrey A Tacy,Ioan L Mica,Michael Darkoh,Patrizio di Micco,Kaitlyn P Russell,Paul Workman,Bissan Al-Lazikani
DOI: https://doi.org/10.1093/nar/gkae1050
IF: 14.9
2024-11-14
Nucleic Acids Research
Abstract:canSAR (https://cansar.ai) continues to serve as the largest publicly available platform for cancer-focused drug discovery and translational research. It integrates multidisciplinary data from disparate and otherwise siloed public data sources as well as data curated uniquely for canSAR. In addition, canSAR deploys a suite of curation and standardization tools together with AI algorithms to generate new knowledge from these integrated data to inform hypothesis generation. Here we report the latest updates to canSAR. As well as increasing available data, we provide enhancements to our algorithms to improve the offering to the user. Notably, our enhancements include a revised ligandability classifier leveraging Positive Unlabeled Learning that finds twice as many ligandable opportunities across the pocketome, and our revised chemical standardization pipeline and hierarchy better enables the aggregation of structurally related molecular records.
biochemistry & molecular biology
What problem does this paper attempt to address?