Abstract:The intrinsic high dimension of fluid dynamics is an inherent challenge to control of aerodynamic flows, and this is further complicated by a flow's nonlinear response to strong disturbances. Deep reinforcement learning, which takes advantage of the exploratory aspects of reinforcement learning (RL) and the rich nonlinearity of a deep neural network, provides a promising approach to discover feasible control strategies. However, the typical model-free approach to reinforcement learning requires a significant amount of interaction between the flow environment and the RL agent during training, and this high training cost impedes its development and application. In this work, we propose a model-based reinforcement learning (MBRL) approach by incorporating a novel reduced-order model as a surrogate for the full environment. The model consists of a physics-augmented autoencoder, which compresses high-dimensional CFD flow field snaphsots into a three-dimensional latent space, and a latent dynamics model that is trained to accurately predict the long-time dynamics of trajectories in the latent space in response to action sequences. The robustness and generalizability of the model is demonstrated in two distinct flow environments, a pitching airfoil in a highly disturbed environment and a vertical-axis wind turbine in a disturbance-free environment. Based on the trained model in the first problem, we realize an MBRL strategy to mitigate lift variation during gust-airfoil encounters. We demonstrate that the policy learned in the reduced-order environment translates to an effective control strategy in the full CFD environment.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the control problem in strongly perturbed and unstable airflows. Specifically, the paper focuses on how to use the Model - Based Reinforcement Learning (MBRL) method to effectively control unstable aerodynamic flows that exhibit highly nonlinear responses under strong external disturbances. Such control is crucial for improving the performance, efficiency, and safety of various systems, such as those in the energy, transportation, and biomedical science fields. ### Main Challenges 1. **High - Dimensional Problem**: The inherently high - dimensional nature of fluid dynamics makes airflow control challenging in itself. 2. **Nonlinear Response**: The nonlinear response of the airflow to strong perturbations further complicates the control problem. 3. **High Training Cost**: Traditional model - free reinforcement learning methods require a large number of interactions between the environment and the agent for training, which is very expensive in the fluid dynamics environment because each interaction may require running a Computational Fluid Dynamics (CFD) simulation. ### Solutions To address these challenges, the authors propose a Model - Based Reinforcement Learning method (MBRL) by introducing a novel reduced - order model as a substitute for the complete environment. This model consists of two main parts: - **Physics - Augmented Autoencoder (PA - AE)**: Compresses high - dimensional CFD flow field snapshots into a three - dimensional latent space and is able to reconstruct the flow field state. - **Latent Dynamics Model (LDM)**: Trained to accurately predict the long - time dynamic response of trajectories in the latent space in response to action sequences. ### Application Scenarios The authors verified the robustness and generalization ability of the model in two different fluid environments: 1. **Pitching Airfoil in a Strongly Perturbed Environment**: In this environment, the model can effectively mitigate lift changes caused by the encounter between gusts and the airfoil. 2. **Vertical - Axis Wind Turbine in an Unperturbed Environment**: In this environment, the model also shows good control effects. ### Experimental Results Based on the trained model, the authors implemented the MBRL policy and successfully transformed the policy learned in the reduced - order environment into an effective control policy in the full CFD environment, thus demonstrating the effectiveness and practicality of this method. ### Summary This paper proposes an efficient and economical Model - Based Reinforcement Learning method for controlling unstable airflows under strong perturbations by combining the Physics - Augmented Autoencoder and the Latent Dynamics Model. This method not only reduces the training cost but also improves the robustness and generalization ability of the control strategy.

Model-Based Reinforcement Learning for Control of Strongly-Disturbed Unsteady Aerodynamic Flows

Deep reinforcement learning of airfoil pitch control in a highly disturbed environment using partial observations

Single-step deep reinforcement learning for open-loop control of laminar and turbulent flows

Efficient Active Flow Control Strategy for Confined Square Cylinder Wake Using Deep Learning-Based Surrogate Model and Reinforcement Learning

Adaptive control of transonic buffet and buffeting flow with deep reinforcement learning

Physics-informed Dyna-style model-based deep reinforcement learning for dynamic control

Model-based deep reinforcement learning for accelerated learning from flow simulations

Deep reinforcement learning based synthetic jet control on disturbed flow over airfoil

Deep reinforcement transfer learning for active flow control of a 3D square cylinder under state dimension mismatch

Deep Reinforcement Learning: A New Beacon for Intelligent Active Flow Control

Deep Dynamical Modeling and Control of Unsteady Fluid Flows

Dynamic Feature-based Deep Reinforcement Learning for Flow Control of Circular Cylinder with Sparse Surface Pressure Sensing

Active flow control with rotating cylinders by an artificial neural network trained by deep reinforcement learning

Surrogate model-based deep reinforcement learning for experimental study of active flow control of circular cylinder

Deep reinforcement learning for turbulent drag reduction in channel flows

Adjoint-based machine learning for active flow control

Flow control of three-dimensional cylinders transitioning to turbulence via multi-agent reinforcement learning

Learning-Based Neural Dynamic Surface Predictive Control for MMC