Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping

Lauren H. Cooke,Harvey Klyne,Edwin Zhang,Cassidy Laidlaw,Milind Tambe,Finale Doshi-Velez
2023-12-18
Abstract:Inverse reinforcement learning (IRL) is computationally challenging, with common approaches requiring the solution of multiple reinforcement learning (RL) sub-problems. This work motivates the use of potential-based reward shaping to reduce the computational burden of each RL sub-problem. This work serves as a proof-of-concept and we hope will inspire future developments towards computationally efficient IRL.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address the computational efficiency issues in Inverse Reinforcement Learning (IRL). Specifically, common IRL methods require solving multiple Reinforcement Learning (RL) subproblems, making IRL computationally very challenging. To alleviate the computational burden of each RL subproblem, the authors propose a Potential-Based Reward Shaping method. This approach can reduce the computational complexity of each RL subproblem by adjusting the reward function without altering the optimal policy. The main contribution of the paper is providing a proof-of-concept, demonstrating how to use sample trajectories of random and optimal policies to select an initial feasible reward potential function. This method is referred to as "Planning-Aware Reward Shaping." Although this paper does not directly solve the problem of making IRL more computationally efficient, the authors hope that this research will inspire further developments in this direction.