D-GAN : Autonomous Driving using Generative Adversarial Networks

C. Fabbri
Abstract:We propose a framework for learning a policy directly from data in the contex of behavioral cloning. We explore environments in which a reward function R is not known. Classically, Inverse Reinforcement Learning is used to extract R to then use with Reinforcement Learning to learn a policy π. We skip this step, and train an agent that matches the policy of the behavior given by a human such that the two are indistinguishable. A focus is put on the self-driving environment, however we note that this framework is general and can be applied to any simulation for which human experience is obtainable.
Computer Science
What problem does this paper attempt to address?