Abstract:The aim of Reinforcement Learning (RL) in real-world applications is to create systems capable of making autonomous decisions by learning from their environment through trial and error. This paper emphasizes the importance of reward engineering and reward shaping in enhancing the efficiency and effectiveness of reinforcement learning algorithms. Reward engineering involves designing reward functions that accurately reflect the desired outcomes, while reward shaping provides additional feedback to guide the learning process, accelerating convergence to optimal policies. Despite significant advancements in reinforcement learning, several limitations persist. One key challenge is the sparse and delayed nature of rewards in many real-world scenarios, which can hinder learning progress. Additionally, the complexity of accurately modeling real-world environments and the computational demands of reinforcement learning algorithms remain substantial obstacles. On the other hand, recent advancements in deep learning and neural networks have significantly improved the capability of reinforcement learning systems to handle high-dimensional state and action spaces, enabling their application to complex tasks such as robotics, autonomous driving, and game playing. This paper provides a comprehensive review of the current state of reinforcement learning, focusing on the methodologies and techniques used in reward engineering and reward shaping. It critically analyzes the limitations and recent advancements in the field, offering insights into future research directions and potential applications in various domains.

Learning to Shape Rewards using a Game of Switching Controls

Learning to Shape Rewards Using a Game of Two Partners

Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping

ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization

Shaping Reward Learning Approach from Passive Samples

Reward Shaping via Meta-Learning

Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning

Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications

Using Natural Language for Reward Shaping in Reinforcement Learning

Tactical Reward Shaping: Bypassing Reinforcement Learning with Strategy-Based Goals

Shaping in Reinforcement Learning Via Knowledge Transferred from Human-Demonstrations

Continuously Discovering Novel Strategies Via Reward-Switching Policy Optimization.

Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving

Benchmarking Potential Based Rewards for Learning Humanoid Locomotion

Best Response Shaping

Logic-based Reward Shaping for Multi-Agent Reinforcement Learning

Learning Task-Distribution Reward Shaping with Meta-Learning.

Temporal Video-Language Alignment Network for Reward Shaping in Reinforcement Learning

Barrier Functions Inspired Reward Shaping for Reinforcement Learning

Generalization in Deep Reinforcement Learning for Robotic Navigation by Reward Shaping

Story Shaping: Teaching Agents Human-like Behavior with Stories