Abstract:In reinforcement learning, a reward function is a priori specified mapping that informs the learning agent how well its current actions and states are performing. From the viewpoint of training, reinforcement learning requires no labeled data and has none of the errors that are induced in supervised learning because responsibility is transferred from the loss function to the reward function. Methods that infer an approximated reward function using observations of demonstrations are termed inverse reinforcement learning or apprenticeship learning. A reward function is generated that reproduces observed behaviors. In previous studies, the reward function is implemented by estimating the maximum likelihood, Bayesian or information theoretic methods. This study proposes an inverse reinforcement learning method that has an approximated reward function as a linear combination of feature expectations, each of which plays a role in a base weak classifier. This approximated reward function is used by the agent to learn a policy, and the resultant behaviors are compared with an expert demonstration. The difference between the behaviors of the agent and those of the expert is measured using defined metrics, and the parameters for the approximated reward function are adjusted using an ensemble fuzzy method that has a boosting classification. After some interleaving iterations, the agent performs similarly to the expert demonstration. A fuzzy method is used to assign credits for the rewards in respect of the most recent decision to the neighboring states. Using the proposed method, the agent approximates the expert behaviors in fewer steps. The results of simulation demonstrate that the proposed method performs well in terms of sampling efficiency.

Sensitivity-based inverse reinforcement learning

Off-Dynamics Inverse Reinforcement Learning from Hetero-Domain

Convergence Analysis of an Incremental Approach to Online Inverse Reinforcement Learning

Gaussian processes in inverse reinforcement learning

Modified Reward Function on Abstract Features in Inverse Reinforcement Learning

Inverse Reinforcement Learning with Unknown Reward Model based on Structural Risk Minimization

A Survey of Inverse Reinforcement Learning Techniques.

Towards Theoretical Understanding of Inverse Reinforcement Learning

Hybrid Inverse Reinforcement Learning

Inverse Delayed Reinforcement Learning

Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective

An Inverse Reinforcement Learning Algorithm for Semi-Markov Decision Processes

Inverse Reinforcement Learning: A Control Lyapunov Approach

Data-Driven Inverse Reinforcement Learning for Expert-Learner Zero-Sum Games

Active Learning for Risk-Sensitive Inverse Reinforcement Learning

Stable Inverse Reinforcement Learning: Policies from Control Lyapunov Landscapes

Model-Free Inverse H-Infinity Control for Imitation Learning

An Ensemble Fuzzy Approach for Inverse Reinforcement Learning

An Ensemble Method for Inverse Reinforcement Learning

Curricular Subgoals for Inverse Reinforcement Learning