A New Strategy Optimisation Method for Underwater Flapping Foil Propulsion Based on Twin-Delayed Deep Deterministic and Gaussian Process Regression

Yinghe Yang,Handi Wei,Dixia Fan,Ang Li
DOI: https://doi.org/10.1016/j.oceaneng.2024.118813
IF: 5
2024-01-01
Ocean Engineering
Abstract:In complex sea environments, underwater bionic flapping foils exhibit significant nonlinear characteristics across a high-dimensional parameter space. This study introduces a novel optimisation paradigm, G-TD3, integrating Twin-Delayed Deep Deterministic (TD3) reinforcement learning with Gaussian process regression (GPR). An automatic experimental device was developed, using which a series of flapping foil propulsion experiments were conducted, exploring both symmetric and asymmetric flutter modes. Utilizing GPR, we characterized the flapping foil's operational environment from experimental data. In addition, the TD3 algorithm was deployed to exploit this environment, enabling interactive learning in high-dimensional, continuous parameter spaces. Our integrated approach facilitates comprehensive exploration-exploitation dynamics. The optimized propulsion strategy, fine-tuned through maximizing acquisition rewards, demonstrates G-TD3's superiority over traditional brute-force methods, requiring less data for optimising both symmetric and asymmetric flutter strategies. This research provides a new optimisation method for nonlinear systems and offers valuable insights into the practical applications of bionic flapping foils.
What problem does this paper attempt to address?