Markov Decision Processes with Dynamic Transition Probabilities: An Analysis of Shooting Strategies in Basketball

Nathan Sandholtz,Luke Bornn
DOI: https://doi.org/10.1214/20-AOAS1348
2018-12-13
Abstract:In this paper we model basketball plays as episodes from team-specific non-stationary Markov decision processes (MDPs) with shot clock dependent transition probabilities. Bayesian hierarchical models are employed in the modeling and parametrization of the transition probabilities to borrow strength across players and through time. To enable computational feasibility, we combine lineup-specific MDPs into team-average MDPs using a novel transition weighting scheme. Specifically, we derive the dynamics of the team-average process such that the expected transition count for an arbitrary state-pair is equal to the weighted sum of the expected counts of the separate lineup-specific MDPs. We then utilize these non-stationary MDPs in the creation of a basketball play simulator with uncertainty propagated via posterior samples of the model components. After calibration, we simulate seasons both on-policy and under altered policies and explore the net changes in efficiency and production under the alternate policies. Additionally, we discuss the game-theoretic ramifications of testing alternative decision policies.
Applications,Methodology
What problem does this paper attempt to address?
This paper mainly discusses the issue of shooting strategy in basketball games. The authors model offensive possessions in basketball games as Markov Decision Processes (MDP) with dynamic transition probabilities and consider the impact of a 24-second shot clock on decision making. They use a Bayesian hierarchical model to estimate transition probabilities for different players and time points, and propose a novel transition weighting scheme to combine specific lineup MDPs into a team average MDP. A basketball game simulator was developed in the paper, which allows simulating games under uncertainty in the posterior model components to study what happens when players change their shooting strategies. They not only focus on shooting decisions but also emphasize the flexibility of strategies, where players can choose when and where to shoot, which can be improved through training to rapidly enhance skill, rather than just increasing shooting accuracy. The authors utilize high-resolution optical tracking data to analyze NBA games from the 2015-2016 season, constructing MDP state spaces based on players, positions, and defensive pressure, and adopting a binary action space (shoot or not shoot). Through simulations and exploration of different policies, they study the effects of changing shooting strategies on efficiency, production, and the game theory aspect. In summary, this paper aims to test and compare shooting strategies in basketball games through a non-static MDP framework, in order to gain a better understanding and optimize game decisions.