Abstract:This paper presents a study on path planning for 6-DOF free-floating space robotic manipulators using Deep Deterministic Policy Gradient-based Reinforcement Learning. The focus is the development of a novel reward function tailored to address critical requirements for efficient and effective manipulation in space. These requirements include accurate pose alignment between the end-effector and the target, collision avoidance with both the target and other links of the manipulator, smoothing of joint velocities, adaptability to strong dynamic coupling between the manipulator and its base spacecraft due to high manipulator-spacecraft mass ratio, and resilience to noise in the state observations. Uniquely, the proposed reward function employs quaternions for orientation control to reduce pose misalignments and dynamic singularities, as opposed to traditional Euler angles. Our findings demonstrate that the Reinforcement Learning algorithm, when guided by this new reward function that integrates these enhancements and constraints, not only achieves the desired path planning objectives more efficiently but also exhibits faster convergence. Furthermore, the Reinforcement Learning successfully manages significant dynamic coupling effects caused by a high mass ratio between the robotic manipulator and the base spacecraft. Even under the challenge of noisy state observations, the trained agent successfully completes the path planning task, proving the Reinforcement Learning's applicability to real-space mission designs where the noise in observation is inevitable. The study highlights the critical role of reward function design in the Reinforcement Learning training process and its consequential impact on the solution quality, providing a solid foundation for future advancements in free-floating space robotic missions.

Control of Free-Floating Space Robots to Capture Targets Using Soft Q-Learning.

Learning biped locomotion based on Q-learning and neural networks

Open-Loop Motion Control of a Hydraulic Soft Robotic Arm Using Deep Reinforcement Learning

Learning to Control Space Robots with Flexible Appendages Using Model-Based Policy Search

Autonomous Trajectory Planning of Free-floating Robot for Capturing Space Target.

A Multi-Target Trajectory Planning of a 6-Dof Free-Floating Space Robot Via Reinforcement Learning

A Learning System for Motion Planning of Free-Float Dual-Arm Space Manipulator towards Non-Cooperative Object

Reinforcement learning in dual-arm trajectory planning for a free-floating space robot

A Q-learning Control Method for a Soft Robotic Arm Utilizing Training Data from a Rough Simulator

Autonomous reinforcement learning control for space robot to capture non-cooperative targets

Improving Soft-Capture Phase Success in Space Debris Removal Missions: Leveraging Deep Reinforcement Learning and Tactile Feedback

Robust Adaptive Learning Control of Space Robot for Target Capturing Using Neural Network.

Control of Space Flexible Manipulator Using Soft Actor-Critic and Random Network Distillation

Autonomous Path Planning and Experiment Study of Free-Floating Space Robot for Spinning Satellite Capturing

A Trajectory Planning Method for Capture Operation of Space Robotic Arm Based on Deep Reinforcement Learning

Learning strategies for underwater robot autonomous manipulation control

Path Planning of 6-DOF Free-Floating Space Robotic Manipulators Using Reinforcement Learning

Trajectory Optimization and Tracking Control of Free-flying Space Robots for Capturing Non-cooperative Tumbling Objects

Reinforcement Learning with Prior Policy Guidance for Motion Planning of Dual-Arm Free-Floating Space Robot

Position Control of Cable-Driven Robotic Soft Arm Based on Deep Reinforcement Learning