A Stackelberg Game based on the Secretary Problem: Optimal Response is History Dependent

David Ramsey
2024-09-06
Abstract:This article considers a problem arising from a two-player game based on the classical secretary problem. First, Player 1 selects one object from a sequence as in the secretary problem. All of the other objects are then presented to Player 2 in the same order as in the original sequence. The goal of both players is to select the best object. The optimal response of Player 2 is adapted to the optimal strategy in the secretary problem. This means that when Player 2 observes an object that is the best seen so far, it can be inferred whether Player 1 selected one of the earlier objects in the original sequence. However, Player 2 cannot compare the current object with the one selected by Player 1. Hence, this game defines an auxiliary problem in which Player 2 has incomplete information on the relative rank of an object. It is shown that the optimal strategy of Player 2 is based on both the number of objects to have appeared and the probability that the current object is better than the object chosen by Player 1 (if Player 1 chose an earlier object in the sequence). However, this probability is dependent on the previously observed objects. A lower bound on the optimal expected reward in the auxiliary problem is defined by limiting the memory of Player 2. An upper bound is derived by giving Player 2 additional information at appropriate times. The methods used illustrate approaches that can be used to approximate the optimal reward in a stopping problem when there is incomplete information on the ranks of objects and/or the optimal strategy is history dependent, as in the Robbins' problem
Computer Science and Game Theory,Optimization and Control
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve the optimal response problem in a two - player game based on the classical secretary problem. Specifically, the paper explores how the second player (Player 2) can make an optimal decision based on historical information given that the first player (Player 1) adopts an optimal strategy. #### Problem background 1. **Classical secretary problem**: In a classical secretary problem, a decision - maker (DM) needs to select the best one from a series of objects that appear in a random order. The optimal strategy is to reject the first \(n^ * - 1\) objects and then accept the first relatively optimal object (i.e., an object better than all previous ones). Here, \(n^ *\) is the threshold that maximizes the expected payoff. 2. **Two - player game model**: - **Player 1**: First, select an object from a series of objects, similar to the decision - maker in the secretary problem. - **Player 2**: Observe the remaining objects in the same order and try to select the optimal one. Player 2 cannot directly compare the current object with the object selected by Player 1, but can infer from historical information. #### Research objectives The main objectives of the paper are: - **Determine the optimal response strategy of Player 2**: When Player 1 adopts an optimal strategy, how Player 2 can make an optimal decision based on historical information. - **Analyze historical dependence**: Prove that the optimal strategy of Player 2 depends not only on whether the current object is relatively optimal, but also on the number of objects that have already appeared and the historical information of these objects. - **Calculate the upper and lower bounds of the optimal expected reward**: Obtain the lower bound of the optimal expected reward by limiting the memory ability of Player 2, and obtain the upper bound by providing additional information. #### Main contributions - **Proof of historical dependence**: Show that the optimal strategy of Player 2 is history - dependent, and this dependence does not disappear as the number of objects increases. - **Calculation of upper and lower bounds**: Calculate the upper and lower bounds of the optimal expected reward of Player 2 by different methods, which are similar to the methods used to estimate the optimal expected ranking in the Robbins problem. ### Summary This paper studies the optimal response problem of Player 2 when facing the optimal strategy of Player 1 by constructing a two - player game model based on the classical secretary problem. The research results show that the optimal strategy of Player 2 is history - dependent, and by calculating the upper and lower bounds, further reveals the complexity and challenges of this problem.