Data-driven and Physics Informed Modelling of Chinese Hamster Ovary Cell Bioreactors

Tianqi Cui,Tom S. Bertalan,Nelson Ndahiro,Pratik Khare,Michael Betenbaugh,Costas Maranas,Ioannis G. Kevrekidis
2023-05-05
Abstract:Fed-batch culture is an established operation mode for the production of biologics using mammalian cell cultures. Quantitative modeling integrates both kinetics for some key reaction steps and optimization-driven metabolic flux allocation, using flux balance analysis; this is known to lead to certain mathematical inconsistencies. Here, we propose a physically-informed data-driven hybrid model (a "gray box") to learn models of the dynamical evolution of Chinese Hamster Ovary (CHO) cell bioreactors from process data. The approach incorporates physical laws (e.g. mass balances) as well as kinetic expressions for metabolic fluxes. Machine learning (ML) is then used to (a) directly learn evolution equations (black-box modelling); (b) recover unknown physical parameters ("white-box" parameter fitting) or -- importantly -- (c) learn partially unknown kinetic expressions (gray-box modelling). We encode the convex optimization step of the overdetermined metabolic biophysical system as a differentiable, feed-forward layer into our architectures, connecting partial physical knowledge with data-driven machine learning.
Quantitative Methods,Machine Learning,Dynamical Systems
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to establish a model that can accurately simulate and predict the dynamic behavior of Chinese Hamster Ovary (CHO) cell bioreactors in biopharmaceutical production. Specifically, the paper focuses on how to combine data - driven methods and physics - based methods to overcome the mathematical inconsistencies and uncertainties existing in traditional metabolic models, especially in the applications of Metabolic Flux Analysis (MFA) and Flux Balance Analysis (FBA). By introducing machine - learning techniques, especially neural networks, the paper proposes a hybrid modeling approach, aiming to learn the kinetic evolution model of CHO cell bioreactors from experimental data while considering physical laws (such as mass conservation) and kinetic expressions of metabolic fluxes. The main contributions of the paper are as follows: 1. **Proposing a physics - informed data - driven hybrid model**: This model can not only directly learn the dynamic equations of the system from data (black - box modeling), but also recover unknown physical parameters (white - box parameter fitting), or learn partially unknown kinetic expressions (gray - box modeling). 2. **Solving the convex optimization step of over - determined metabolic biophysical systems**: Encoding this step as a feed - forward layer, connecting partial physical knowledge with data - driven machine learning, thereby improving the accuracy and reliability of the model. 3. **Describing in detail the model structure and optimization algorithm**: Including how to handle the constraint conditions in the embedded optimization problem and how to calculate the gradients to support model training. Through these methods, the paper aims to provide more accurate and precise tools for the simulation and control of CHO cell bioreactors, thereby improving the efficiency and quality of biopharmaceutical production.