Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations

Srinivas Niranj Chandrasekaran,Beth A Cimini,Amy Goodale,Lisa Miller,Maria Kost-Alimova,Nasim Jamali,John G Doench,Briana Fritchman,Adam Skepner,Michelle Melanson,Alexandr A Kalinin,John Arevalo,Marzieh Haghighi,Juan C Caicedo,Daniel Kuhn,Desiree Hernandez,James Berstler,Hamdah Shafqat-Abbasi,David E Root,Susanne E Swalley,Sakshi Garg,Shantanu Singh,Anne E Carpenter
DOI: https://doi.org/10.1038/s41592-024-02241-6
Abstract:The identification of genetic and chemical perturbations with similar impacts on cell morphology can elucidate compounds' mechanisms of action or novel regulators of genetic pathways. Research on methods for identifying such similarities has lagged due to a lack of carefully designed and well-annotated image sets of cells treated with chemical and genetic perturbations. Here we create such a Resource dataset, CPJUMP1, in which each perturbed gene's product is a known target of at least two chemical compounds in the dataset. We systematically explore the directionality of correlations among perturbations that target the same protein encoded by a given gene, and we find that identifying matches between chemical and genetic perturbations is a challenging task. Our dataset and baseline analyses provide a benchmark for evaluating methods that measure perturbation similarities and impact, and more generally, learn effective representations of cellular state from microscopy images. Such advancements would accelerate the applications of image-based profiling of cellular states, such as uncovering drug mode of action or probing functional genomics.
What problem does this paper attempt to address?