Targeted design of synthetic enhancers for selected tissues in the Drosophila embryo

Bernardo P. de Almeida,Christoph Schaub,Michaela Pagani,Stefano Secchia,Eileen E. M. Furlong,Alexander Stark
DOI: https://doi.org/10.1038/s41586-023-06905-9
IF: 64.8
2023-12-13
Nature
Abstract:Enhancers control gene expression and play crucial roles in development and homeostasis 1–3 . However, the targeted de novo design of enhancers with tissue-specific activities has remained challenging. Here, we combine deep learning and transfer learning to design tissue-specific enhancers for five tissues in the Drosophila melanogaster embryo – the central nervous system (CNS), epidermis, gut, muscle, and brain. We first train convolutional neural networks (CNNs) using genome-wide scATAC-seq datasets and then fine-tune the CNNs with smaller-scale data from in vivo enhancer activity assays, yielding models with 25% to 75% positive predictive value according to cross-validation. We designed and experimentally assessed 40 synthetic enhancers (eight per tissue) in vivo , of which 31 (78%) were active and 27 (68%) functioned in the target tissue (100% for CNS and muscle). The strategy to combine genome-wide and small-scale functional datasets by transfer learning is generally applicable and should enable the design of tissue-, cell type-, and cell state-specific enhancers in any system.
multidisciplinary sciences
What problem does this paper attempt to address?