Abstract:This article is concerned with the problem of planning optimal maneuver trajectories and guiding the mobile robot toward target positions in uncertain environments for exploration purposes. A hierarchical deep learning-based control framework is proposed which consists of an upper level motion planning layer and a lower level waypoint tracking layer. In the motion planning phase, a recurrent deep neural network (RDNN)-based algorithm is adopted to predict the optimal maneuver profiles for the mobile robot. This approach is built upon a recently proposed idea of using deep neural networks (DNNs) to approximate the optimal motion trajectories, which has been validated that a fast approximation performance can be achieved. To further enhance the network prediction performance, a recurrent network model capable of fully exploiting the inherent relationship between preoptimized system state and control pairs is advocated. In the lower level, a deep reinforcement learning (DRL)-based collision-free control algorithm is established to achieve the waypoint tracking task in an uncertain environment (e.g., the existence of unexpected obstacles). Since this approach allows the control policy to directly learn from human demonstration data, the time required by the training process can be significantly reduced. Moreover, a noisy prioritized experience replay (PER) algorithm is proposed to improve the exploring rate of control policy. The effectiveness of applying the proposed deep learning-based control is validated by executing a number of simulation and experimental case studies. The simulation result shows that the proposed DRL method outperforms the vanilla PER algorithm in terms of training speed. Experimental videos are also uploaded, and the corresponding results confirm that the proposed strategy is able to fulfill the autonomous exploration mission with improved motion planning performance, enhanced collision avoidance ability, and less training time.

Anti-collision Trajectory Planning for Satellite Formation Reconstruction Based on Deep Reinforcement Learning

Deep Reinforcement Learning-Based Autonomous Mission Planning Method for High and Low Orbit Multiple Agile Earth Observing Satellites

Satellite Attitude Tracking Control of Moving Targets Combining Deep Reinforcement Learning and Predefined-time Stability Considering Energy Optimization

Spacecraft Attitude Maneuver Planning Based on Deep Reinforcement Learning under Complex Constraints

Distributed Swarm Trajectory Optimization for Formation Flight in Dense Environments

Underwater Multi-agent Cooperative Formation Hunting Based on Deep Reinforcement Learning

Reinforcement learning-based satellite formation attitude control under multi-constraint

Simultaneous approach with partial error control on non-collocation points based satellite formation reconfiguration

A Fast Approach to Satellite Range Rescheduling Using Deep Reinforcement Learning

An Algorithm of Reinforcement Learning for Maneuvering Parameter Self-Tuning Applying in Satellite Cluster

DRL-Based Dynamic Destroy Approaches for Agile-Satellite Mission Planning

Trajectory Planning with Deep Reinforcement Learning in High-Level Action Spaces

A Path Planning Approach of Distributed Satellites Formation Reconfiguration

Online Trajectory Planning Method for Midcourse Guidance Phase Based on Deep Reinforcement Learning

Reinforcement Learning-enabled Satellite Constellation Reconfiguration and Retasking for Mission-Critical Applications

Deep Reinforcement Learning Based Trajectory Planning Under Uncertain Constraints

Deep Reinforcement Learning-Based 3D Trajectory Planning for Cellular Connected UAV

Spacecraft Formation Reconfiguration Trajectory Planning with Avoidance Constraints Using Adaptive Pigeon-Inspired Optimization

Trajectory Planning for Teleoperated Space Manipulators Using Deep Reinforcement Learning

Reinforcement learning for path planning of free-floating space robotic manipulator with collision avoidance and observation noise

Design and Experimental Validation of Deep Reinforcement Learning-Based Fast Trajectory Planning and Control for Mobile Robot in Unknown Environment