Driver Modeling through Deep Reinforcement Learning and Behavioral Game Theory

Berat Mert Albaba,Yildiray Yildiz
DOI: https://doi.org/10.48550/arXiv.2003.11071
2020-03-25
Abstract:In this paper, a synergistic combination of deep reinforcement learning and hierarchical game theory is proposed as a modeling framework for behavioral predictions of drivers in highway driving scenarios. The need for a modeling framework that can address multiple human-human and human-automation interactions, where all the agents can be modeled as decision makers simultaneously, is the main motivation behind this work. Such a modeling framework may be utilized for the validation and verification of autonomous vehicles: It is estimated that for an autonomous vehicle to reach the same safety level of cars with drivers, millions of miles of driving tests are required. The modeling framework presented in this paper may be used in a high-fidelity traffic simulator consisting of multiple human decision makers to reduce the time and effort spent for testing by allowing safe and quick assessment of self-driving algorithms. To demonstrate the fidelity of the proposed modeling framework, game theoretical driver models are compared with real human driver behavior patterns extracted from traffic data.
Multiagent Systems,Machine Learning
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to develop a modeling framework that can handle human - to - human and human - to - automation - system interactions among multiple drivers simultaneously. Specifically, the paper proposes an approach that combines deep reinforcement learning and hierarchical game theory to predict drivers' behaviors in highway driving scenarios. The main motivation for this modeling framework is to provide a tool for verifying and validating the safety of autonomous vehicles, because it is estimated that in order to reach the same safety level as human - driven cars, autonomous vehicles need to conduct millions of miles of driving tests. ### Main Contributions: 1. **Extended Scenario Modeling Capability**: By using deep Q - learning (DQN), this method can handle a wider range of scenario categories than in previous studies, especially by using deep neural networks to handle the expanded state space. 2. **Reduced Collision Rate**: By expanding the observation space and eliminating drivers' blind spots, the collision rate in the simulation is reduced to a level close to that in real life. 3. **Model Validation**: The developed driver model was compared with two sets of traffic data to statistically analyze the predictive ability of the model. ### Method Overview: - **Level - k Reasoning**: This is a hierarchical decision - making concept that assumes different people have different reasoning levels. The lowest level of reasoning is called Level - 0. These agents are non - strategic, and their decisions are not based on the actions that other agents may take. Level - 1 agents assume that all other agents are Level - 0 and make the best response accordingly. This process can continue to higher levels. - **Deep Q - learning (DQN)**: DQN is a reinforcement learning method that approximates the Q - function through a deep neural network to handle large - scale state spaces. This enables the model to make the best response according to the possible actions of other agents in a multi - step decision - making process. - **Combining Level - k Reasoning and DQN**: By combining these two methods, agents with different reasoning levels are generated to simulate multi - step strategic decisions in traffic scenarios. For example, in order to obtain a Level - 1 strategy, a traffic scenario in which all drivers are Level - 0 agents is created, and the ego - driver learns how to make the best response to the Level - 0 strategy through DQN training. Similarly, higher - level strategies can be obtained. ### Application Scenarios: - **Verification and Validation of Autonomous Vehicles**: This modeling framework can be used in high - fidelity traffic simulators, including multiple simulated human decision - makers, thereby reducing test time and cost and quickly evaluating the safety of autonomous driving algorithms. ### Conclusion: The method proposed in the paper that combines deep reinforcement learning and game theory provides a new perspective for modeling driver behaviors in complex traffic scenarios, especially in dealing with multi - agent interactions. Through this method, researchers can more effectively verify and validate the safety of autonomous driving technology.