ConcertoRL: An Innovative Time-Interleaved Reinforcement Learning Approach for Enhanced Control in Direct-Drive Tandem-Wing Vehicles

Minghao Zhang,Bifeng Song,Changhao Chen,Xinyu Lang

2024-05-22

Abstract:In control problems for insect-scale direct-drive experimental platforms under tandem wing influence, the primary challenge facing existing reinforcement learning models is their limited safety in the exploration process and the stability of the continuous training process. We introduce the ConcertoRL algorithm to enhance control precision and stabilize the online training process, which consists of two main innovations: a time-interleaved mechanism to interweave classical controllers with reinforcement learning-based controllers aiming to improve control precision in the initial stages, a policy composer organizes the experience gained from previous learning to ensure the stability of the online training process. This paper conducts a series of experiments. First, experiments incorporating the time-interleaved mechanism demonstrate a substantial performance boost of approximately 70% over scenarios without reinforcement learning enhancements and a 50% increase in efficiency compared to reference controllers with doubled control frequencies. These results highlight the algorithm's ability to create a synergistic effect that exceeds the sum of its parts.

Artificial Intelligence

What problem does this paper attempt to address?

### What problem does this paper attempt to solve? This paper aims to address the issue of precise control in direct-drive tandem wing aircraft, particularly improving control accuracy and safety during plug-in online training and control processes. Specifically: 1. **Control Challenges**: - Existing reinforcement learning models have limited safety during the exploration process and insufficient stability during continuous training. - The direct-drive experimental platform is affected by tandem wing interference, leading to nonlinear and unstable aerodynamic characteristics, which pose challenges for control. 2. **Proposed Solution**: - The ConcertoRL algorithm is proposed to enhance control accuracy and stabilize the online training process through two main innovations: - **Time Interleaving Mechanism**: Interleaving classical controllers with reinforcement learning-based controllers to improve control accuracy in the initial stages. - **Policy Orchestrator**: Organizing experiences gained from previous learning to ensure the stability of the online training process. 3. **Experimental Validation**: - Experiments show that with the time interleaving mechanism, performance improved by approximately 70%, and efficiency increased by 50% compared to the reference controller. - The policy orchestrator further enhanced the stability of ConcertoRL's online training. - Generalization experiments demonstrated that ConcertoRL is compatible with various classical controllers and can achieve excellent control effects under different parameters. In summary, this paper aims to solve the issues of accuracy and stability in the control process of direct-drive tandem wing aircraft through the ConcertoRL algorithm.

ConcertoRL: An Innovative Time-Interleaved Reinforcement Learning Approach for Enhanced Control in Direct-Drive Tandem-Wing Vehicles

A Plug-and-Play Fully On-the-Job Real-Time Reinforcement Learning Algorithm for a Direct-Drive Tandem-Wing Experiment Platforms Under Multiple Random Operating Conditions

Learning with Training Wheels: Speeding up Training with a Simple Controller for Deep Reinforcement Learning

Retro-RL: Reinforcing Nominal Controller With Deep Reinforcement Learning for Tilting-Rotor Drones

RL + Model-based Control: Using On-demand Optimal Control to Learn Versatile Legged Locomotion

Model-Based Safe Reinforcement Learning With Time-Varying Constraints: Applications to Intelligent Vehicles

Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles

Wing Kinematics-Based Flight Control Strategy in Insect-Inspired Flight Systems: Deep Reinforcement Learning Gives Solutions and Inspires Controller Design in Flapping MAVs

Realizing asynchronous finite-time robust tracking control of switched flight vehicles by using nonfragile deep reinforcement learning

Continuous-Time Reinforcement Learning: New Design Algorithms with Theoretical Insights and Performance Guarantees

Joint Optimization of Sensing, Decision-making and Motion-controlling for Autonomous Vehicles: A Deep Reinforcement Learning Approach

Deep Reinforcement-Learning-Based Air-Combat-Maneuver Generation Framework

How to Train Your Quadrotor: A Framework for Consistently Smooth and Responsive Flight Control via Reinforcement Learning

Reinforcement Learning Control of Hypersonic Vehicles and Performance Evaluations

Multi-Task Reinforcement Learning in Continuous Control with Successor Feature-Based Concurrent Composition

A Harmonized Approach: Beyond-the-Limit Control for Autonomous Vehicles Balancing Performance and Safety in Unpredictable Environments

A Reinforcement Learning Approach for Continuum Robot Control

End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks

Learning to Change: Choreographing Mixed Traffic Through Lateral Control and Hierarchical Reinforcement Learning

Reinforcement learning control method for real‐time hybrid simulation based on deep deterministic policy gradient algorithm

Guiding real-world reinforcement learning for in-contact manipulation tasks with Shared Control Templates