Reinforcement learning for batch bioprocess optimization

P. Petsagkourakis,I.O. Sandoval,E. Bradford,D. Zhang,E.A. del Rio-Chanona
DOI: https://doi.org/10.1016/j.compchemeng.2019.106649
2020-02-01
Abstract:<p>Bioprocesses have received a lot of attention to produce clean and sustainable alternatives to fossil-based materials. However, they are generally difficult to optimize due to their unsteady-state operation modes and stochastic behaviours. Furthermore, biological systems are highly complex, therefore plant-model mismatch is often present. To address the aforementioned challenges we propose a Reinforcement learning based optimization strategy for batch processes.</p><p>In this work we applied the Policy Gradient method from batch-to-batch to update a control policy parametrized by a recurrent neural network. We assume that a preliminary process model is available, which is exploited to obtain a preliminary optimal control policy. Subsequently, this policy is updated based on measurements from the <em>true</em> plant. The capabilities of our proposed approach were tested on three case studies (one of which is nonsmooth) using a more complex process model for the <em>true</em> system embedded with adequate process disturbance. Lastly, we discussed advantages and disadvantages of this strategy compared against current existing approaches such as nonlinear model predictive control.</p>
engineering, chemical,computer science, interdisciplinary applications
What problem does this paper attempt to address?