Abstract:Home service robots prioritize cost-effectiveness and convenience over the precision required for industrial tasks like autonomous driving, making their task execution more easily. Meanwhile, path planning tasks using Deep Reinforcement Learning(DRL) are commonly sparse reward problems with limited data utilization, posing challenges in obtaining meaningful rewards during training, consequently resulting in slow or challenging training. In response to these challenges, our paper introduces a lightweight end-to-end path planning algorithm employing with hindsight experience replay(HER). Initially, we optimize the reinforcement learning training process from scratch and map the complex high-dimensional action space and state space to the representative low-dimensional action space. At the same time, we improve the network structure to decouple the model navigation and obstacle avoidance module to meet the requirements of lightweight. Subsequently, we integrate HER and curriculum learning (CL) to tackle issues related to inefficient training. Additionally, we propose a multi-step hindsight experience replay (MS-HER) specifically for the path planning task, markedly enhancing both training efficiency and model generalization across diverse environments. To substantiate the enhanced training efficiency of the refined algorithm, we conducted tests within diverse Gazebo simulation environments. Results of the experiments reveal noteworthy enhancements in critical metrics, including success rate and training efficiency. To further ascertain the enhanced algorithm's generalization capability, we evaluate its performance in some "never-before-seen" simulation environment. Ultimately, we deploy the trained model onto a real lightweight robot for validation. The experimental outcomes indicate the model's competence in successfully executing the path planning task, even on a small robot with constrained computational resources.

Trajectory-based Split Hindsight Reverse Curriculum Learning

Ensemble Bootstrapped Deep Deterministic Policy Gradient For Vision-Based Robotic Grasping

Vision-Based Robotic Object Grasping—A Deep Reinforcement Learning Approach

Model Based Reinforcement Learning for Robot Grasping Trajectory Generation.

Learning to Regrasp Using Visual–Tactile Representation-Based Reinforcement Learning

Solving Robotic Manipulation With Sparse Reward Reinforcement Learning Via Graph-Based Diversity and Proximity

Physics-Guided Hierarchical Reward Mechanism for Learning-Based Robotic Grasping

Trajectory Planning for Teleoperated Space Manipulators Using Deep Reinforcement Learning

Learning from Trajectories via Subgoal Discovery

Hindsight States: Blending Sim and Real Task Elements for Efficient Reinforcement Learning

Learning of Long-Horizon Sparse-Reward Robotic Manipulator Tasks With Base Controllers

Diversity-based Trajectory and Goal Selection with Hindsight Experience Replay

Quantile Regression Hindsight Experience Replay

Off-road Autonomous Vehicles Traversability Analysis and Trajectory Planning Based on Deep Inverse Reinforcement Learning

Exploration via Hindsight Goal Generation

Combining Hindsight with Goal-enhanced Prediction for Multi-goal Reinforcement Learning

Spatial Repetitive Learning Control for Trajectory Learning in Human-Robot Collaboration.

Improvements on Hindsight Learning

Reinforcement learning path planning method incorporating multi-step Hindsight Experience Replay for lightweight robots

Hindsight Planner.

Towards Generalization and Data Efficient Learning of Deep Robotic Grasping