Learning-based Control for Tendon-Driven Continuum Robotic Arms

Nima Maghooli,Omid Mahdizadeh,Mohammad Bajelani,S. Ali A. Moosavian
2024-12-06
Abstract:This paper presents a learning-based approach for centralized position control of Tendon Driven Continuum Robots (TDCRs) using Deep Reinforcement Learning (DRL), with a particular focus on the Sim-to-Real transfer of control policies. The proposed control method employs the Modified Transpose Jacobian (MTJ) control strategy, with its parameters optimally tuned using the Deep Deterministic Policy Gradient (DDPG) algorithm. Classical model-based controllers encounter significant challenges due to the inherent uncertainties and nonlinear dynamics of continuum robots. In contrast, model-free control strategies require efficient gain-tuning to handle diverse operational scenarios. This research aims to develop a model-free controller with performance comparable to model-based strategies by integrating an optimal adaptive gain-tuning system. Both simulations and real-world implementations demonstrate that the proposed method significantly enhances the trajectory-tracking performance of continuum robots independent of initial conditions and paths within the operational task-space, effectively establishing a task-free controller.
Robotics,Systems and Control
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: How to achieve centralized position control of Tendon - Driven Continuum Robots (TDCRs) through Deep Reinforcement Learning (DRL), especially optimizing the control strategy in the process of Sim - to - Real transfer. Specifically, the paper focuses on the following points: 1. **Control Challenges**: - Traditional model - based controllers face significant challenges when dealing with the nonlinear dynamics and inherent uncertainties of continuum robots. - Model - based controllers are difficult to adapt to complex operating scenarios, while model - free control strategies require effective gain tuning to handle different tasks. 2. **Research Objectives**: - Develop a model - free controller whose performance can be comparable to model - based strategies, and achieve this by integrating an optimal adaptive gain tuning system. - Achieve effective transfer from simulation to reality, ensuring accurate trajectory tracking in uncertain and dynamic environments. 3. **Technical Methods**: - Propose a control strategy based on the Modified Transpose Jacobian (MTJ) and use the Deep Deterministic Policy Gradient (DDPG) algorithm to optimally tune its parameters. - Verify the proposed method through simulation and actual experiments, demonstrating a significant improvement in task - space trajectory tracking performance under different initial conditions and paths, thereby establishing a task - independent controller. 4. **Main Contributions**: - Achieve effective Sim - to - Real transfer, enabling the MTJ controller to achieve accurate trajectory tracking in uncertain and dynamic environments. - Use reinforcement learning as an optimal adaptive gain tuning system, reducing training time and improving the robustness and lightweight characteristics of the controller, making it suitable for practical applications. In summary, this paper aims to develop an efficient, robust, and practical model - free controller by combining deep reinforcement learning and the modified transpose Jacobian control strategy to solve the control problems of tendon - driven continuum robots in complex environments.