Deep model predictive control of gene expression in thousands of single cells

Jean-Baptiste Lugagne,Caroline M. Blassick,Mary J. Dunlop
DOI: https://doi.org/10.1038/s41467-024-46361-1
IF: 16.6
2024-03-09
Nature Communications
Abstract:Gene expression is inherently dynamic, due to complex regulation and stochastic biochemical events. However, the effects of these dynamics on cell phenotypes can be difficult to determine. Researchers have historically been limited to passive observations of natural dynamics, which can preclude studies of elusive and noisy cellular events where large amounts of data are required to reveal statistically significant effects. Here, using recent advances in the fields of machine learning and control theory, we train a deep neural network to accurately predict the response of an optogenetic system in Escherichia coli cells. We then use the network in a deep model predictive control framework to impose arbitrary and cell-specific gene expression dynamics on thousands of single cells in real time, applying the framework to generate complex time-varying patterns. We also showcase the framework's ability to link expression patterns to dynamic functional outcomes by controlling expression of the tetA antibiotic resistance gene. This study highlights how deep learning-enabled feedback control can be used to tailor distributions of gene expression dynamics with high accuracy and throughput without expert knowledge of the biological system.
multidisciplinary sciences
What problem does this paper attempt to address?
The core problem that this paper attempts to solve is: how to achieve precise dynamic regulation of single - cell gene expression through deep learning and model predictive control techniques. Specifically, the researchers aim to overcome the limitations of traditional methods in the dynamic study of single - cell gene expression, especially the problems of insufficient data volume and the lack of effective tools to generate arbitrary gene expression dynamics. ### Specific Background of the Problem 1. **Dynamic Characteristics of Gene Expression** - Gene expression is dynamically changing and is affected by complex regulatory mechanisms and random biochemical events. - The impact of these dynamic changes on cell phenotypes is difficult to determine, especially in cases where a large amount of data is required to reveal statistically significant effects. 2. **Limitations of Existing Research** - Previous studies have been mainly limited to passively observing naturally occurring gene expression dynamics, which makes it difficult to study those elusive and noisy cell events. - Studies at single - cell resolution (such as genomics, transcriptomics, and flow cytometry) can quantify expression data with high throughput, but these measurements are usually "snapshots" and cannot capture the time context of cells. 3. **Lack of Tools for Generating Arbitrary Gene Expression Dynamics** - Existing tools are difficult to generate precise gene expression dynamics, especially when cells show different expression patterns under the same optogenetic input. ### Solutions in the Paper To solve the above problems, the researchers took advantage of the latest advances in machine learning and control theory and developed a method based on Deep Model Predictive Control (DMPC). The specific steps are as follows: 1. **Establishment of the Experimental Platform** - A high - throughput experimental platform was constructed, which can culture, observe, and optogenetically stimulate Escherichia coli cells at the single - cell level. - The CcaSR optogenetic system was used, with 535 nm green light activating gene expression and 670 nm red light inhibiting gene expression. 2. **Training of the Deep Learning Model** - A deep neural network was trained to accurately predict the response of the optogenetic system. - A deep learning model with an encoder - decoder structure, combined with Long Short - Term Memory (LSTM) networks and Multi - Layer Perceptrons (MLP), was used to predict future gene expression dynamics. 3. **Real - Time Feedback Control** - The trained deep learning model was applied to a deep model predictive control framework to achieve real - time feedback control. - Customized optogenetic stimuli were independently applied through a Digital Micromirror Device (DMD), enabling thousands of single cells to generate arbitrary and specific gene expression dynamics. 4. **Functional Verification** - This framework was applied to control the expression of the antibiotic - resistance gene tetA, generating complex time - varying patterns, and linking the expression patterns to dynamic functional results. - Through high - resolution data analysis, the relationship between gene expression levels, growth rates, and survival was revealed. ### Conclusion This study shows that deep model predictive control can be used to precisely regulate single - cell gene expression dynamics, with the characteristics of high precision and high throughput. This method can not only generate arbitrary gene expression patterns but also reveal the causal relationship between gene expression dynamics and cell functions, thus providing new tools and ideas for in - depth understanding of complex biological systems.