Identification of structural features in chemicals associated with cancer drug response: a systematic data-driven analysis

Suleiman A Khan,Seppo Virtanen,Olli P Kallioniemi,Krister Wennerberg,Antti Poso,Samuel Kaski,Suleiman A. Khan,Olli P. Kallioniemi
DOI: https://doi.org/10.1093/bioinformatics/btu456
IF: 5.8
2014-08-22
Bioinformatics
Abstract:MOTIVATION: Analysis of relationships of drug structure to biological response is key to understanding off-target and unexpected drug effects, and for developing hypotheses on how to tailor drug therapies. New methods are required for integrated analyses of a large number of chemical features of drugs against the corresponding genome-wide responses of multiple cell models.RESULTS: In this article, we present the first comprehensive multi-set analysis on how the chemical structure of drugs impacts on genome-wide gene expression across several cancer cell lines [Connectivity Map (CMap) database]. The task is formulated as searching for drug response components across multiple cancers to reveal shared effects of drugs and the chemical features that may be responsible. The components can be computed with an extension of a recent approach called Group Factor Analysis. We identify 11 components that link the structural descriptors of drugs with specific gene expression responses observed in the three cell lines and identify structural groups that may be responsible for the responses. Our method quantitatively outperforms the limited earlier methods on CMap and identifies both the previously reported associations and several interesting novel findings, by taking into account multiple cell lines and advanced 3D structural descriptors. The novel observations include: previously unknown similarities in the effects induced by 15-delta prostaglandin J2 and HSP90 inhibitors, which are linked to the 3D descriptors of the drugs; and the induction by simvastatin of leukemia-specific response, resembling the effects of corticosteroids.AVAILABILITY AND IMPLEMENTATION: Source Code implementing the method is available at: http://research.ics.aalto.fi/mi/software/GFAsparse.SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?