Abstract:Home service robots prioritize cost-effectiveness and convenience over the precision required for industrial tasks like autonomous driving, making their task execution more easily. Meanwhile, path planning tasks using Deep Reinforcement Learning(DRL) are commonly sparse reward problems with limited data utilization, posing challenges in obtaining meaningful rewards during training, consequently resulting in slow or challenging training. In response to these challenges, our paper introduces a lightweight end-to-end path planning algorithm employing with hindsight experience replay(HER). Initially, we optimize the reinforcement learning training process from scratch and map the complex high-dimensional action space and state space to the representative low-dimensional action space. At the same time, we improve the network structure to decouple the model navigation and obstacle avoidance module to meet the requirements of lightweight. Subsequently, we integrate HER and curriculum learning (CL) to tackle issues related to inefficient training. Additionally, we propose a multi-step hindsight experience replay (MS-HER) specifically for the path planning task, markedly enhancing both training efficiency and model generalization across diverse environments. To substantiate the enhanced training efficiency of the refined algorithm, we conducted tests within diverse Gazebo simulation environments. Results of the experiments reveal noteworthy enhancements in critical metrics, including success rate and training efficiency. To further ascertain the enhanced algorithm's generalization capability, we evaluate its performance in some "never-before-seen" simulation environment. Ultimately, we deploy the trained model onto a real lightweight robot for validation. The experimental outcomes indicate the model's competence in successfully executing the path planning task, even on a small robot with constrained computational resources.

Bias-reduced Multi-step Hindsight Experience Replay for Efficient Multi-goal Reinforcement Learning

MHER: Model-based Hindsight Experience Replay

Combining Hindsight with Goal-enhanced Prediction for Multi-goal Reinforcement Learning

MRHER: Model-based Relay Hindsight Experience Replay for Sequential Object Manipulation Tasks with Sparse Rewards

Addressing Hindsight Bias in Multigoal Reinforcement Learning

Quantile Regression Hindsight Experience Replay

Soft Hindsight Experience Replay

Learning and reusing primitive behaviours to improve Hindsight Experience Replay sample efficiency

Hindsight Planner.

ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning

Deep Reinforcement Learning for Complex Manipulation Tasks with Sparse Feedback

Exploration via Hindsight Goal Generation

Consistent Experience Replay in High-Dimensional Continuous Control with Decayed Hindsights

Relay Hindsight Experience Replay: Self-guided continual reinforcement learning for sequential object manipulation tasks with sparse rewards

Diversity-based Trajectory and Goal Selection with Hindsight Experience Replay

Reinforcement learning path planning method incorporating multi-step Hindsight Experience Replay for lightweight robots

Improvements on Hindsight Learning

Leveraging Efficiency Through Hybrid Prioritized Experience Replay in Door Environment.

ACDER: Augmented Curiosity-Driven Experience Replay

Hindsight States: Blending Sim and Real Task Elements for Efficient Reinforcement Learning

Efficient Multi-Goal Reinforcement Learning Via Value Consistency Prioritization