Abstract:Reinforcement learning (RL) is an artificial intelligence algorithm that can learn adaptive optimal control law online. In view of the fact that the previous control approaches were usually overly dependent on the model parameters of system, and most existing RL methods are based on state feedback, their application in actual industrial production is limited. Additionally, developing accurate process system models and ensuring the closed-loop system’s control performance is more challenging, as modern businesses place a premium on product quality and economic efficiency. As a result, this work introduces a novel data-driven two-dimensional (2D) off-policy Q -learning method based on output feedback is used to achieve optimal tracking control for batch process. First, the error between the actual output and the given set-point is extended to the system to ensure the good tracking performance. Second, by analyzing the relationship between the value function and the Q -function obtained from the 2D system’s performance index, the 2D Bellman equation is obtained in terms of output feedback that is independent of the model parameters. The optimal control problem can be effectively solved by the proposed method in this paper when the policy iteration is executed using only the measurement data of system along the batch and time directions. Following that, the proposed approach’s unbiasedness and convergence are strictly confirmed. Finally, the simulation results for the injection molding process demonstrate that the proposed method is capable of determining the optimal control law as the number of batches is growing increasingly.

Optimal control of batch processes via a deterministic Q-learning method

An Integrated Design for Batch Process Using Optimal Average Profit Control with Unfixed Terminal Time

Batch-To-Batch Control of Batch Processes Based on Multilayer Recurrent Fuzzy Neural Network

Dynamic Modeling And Nonlinear Predictive Control Based On Partitioned Model And Nonlinear Optimization

Batch-to-batch self-optimizing control for batch processes

ILC Based Economic Optimization for Batch Processes Using Helpful Disturbance Information

A real-time optimization approach for uncertain batch processes

Iterative Optimal Control for Batch Process Based on Generalized Predictive Control

Real-time Optimization for Chemical Processes Based on On-Line Modeling of Controlled Variables

Optimal tracking control of batch processes with time-invariant state delay: Adaptive Q-learning with two-dimensional state and control policy

Distributed Dynamic Optimization for Chemical Process Networks Based on Differential Games

A Gradient Descent Method for Optimal Batch-to-Batch Control of Unknown Linear Systems

BATCH-TO-BATCH OPTIMAL CONTROL OF BATCH PROCESSES BASED ON RECURSIVELY UPDATED NONLINEAR PARTIAL LEAST SQUARES MODELS

Novel two-dimensional off-policy Q -learning method for output feedback optimal tracking control of batch process with unknown dynamics

Iterative optimal control for batch processes based on MKPLS and SQP methods

Reinforcement learning for batch bioprocess optimization

Optimal Iterative Learning Control for Batch Processes in the Presence of Time-Varying Dynamics

Batch-to-batch Model-Based Iterative Optimisation Control for a Batch Polymerisation Reactor

Novel data-driven two-dimensional Q-learning for optimal tracking control of batch process with unknown dynamics

A batch-to-batch iterative optimal control strategy based on recurrent neural network models

A Feedback-based Optimization Method for Uncertain Batch Processes