Leveraging Noisy Observations in Zero-Sum Games

Emmanouil M Athanasakos,Samir M Perlaza
2024-02-05
Abstract:This paper studies an instance of zero-sum games in which one player (the leader) commits to its opponent (the follower) to choose its actions by sampling a given probability measure (strategy). The actions of the leader are observed by the follower as the output of an arbitrary channel. In response to that, the follower chooses its action based on its current information, that is, the leader's commitment and the corresponding noisy observation of its action. Within this context, the equilibrium of the game with noisy action observability is shown to always exist and the necessary conditions for its uniqueness are identified. Interestingly, the noisy observations have important impact on the cardinality of the follower's set of best responses. Under particular conditions, such a set of best responses is proved to be a singleton almost surely. The proposed model captures any channel noise with a density with respect to the Lebesgue measure. As an example, the case in which the channel is described by a Gaussian probability measure is investigated.
Computer Science and Game Theory,Information Theory,Machine Learning
What problem does this paper attempt to address?
This paper attempts to solve the problem of determining the equilibrium of the game in zero - sum games (ZSGs) when one player (the leader) commits to choosing actions according to a certain probability distribution, and the other player (the follower) observes these actions through a noisy channel. Specifically, the paper focuses on: 1. **Introducing a new game model**: In this model, the follower receives a noisy version of the leader's actions. This noise can be of any type and can be described by a probability density function. 2. **Existence and uniqueness**: The paper proves that in this case, the equilibrium of the game always exists and identifies the key conditions that affect its uniqueness. 3. **Characteristics of the best - response set**: The best - response set of the follower is studied, and it is proved that under specific conditions, this set is almost always a singleton (i.e., the follower has a unique optimal response). 4. **The influence of noise**: The influence of different types of noise on the game results is explored, especially the influence of noise on the follower's best - response set. In some cases, noise may be beneficial or insignificant. 5. **Specific examples**: Taking the Additive White Gaussian Noise (AWGN) channel as an example, the equilibrium solution under this special noise condition is analyzed in detail. ### Summary of mathematical formulas - **Expected payoff function**: \[ v(P_{A_1|\tilde{A}_2}, P_{A_2})=\int_{A_2}\left(\int_{\tilde{A}_2}\left(\int_{A_1}u(a, b)dP_{A_1|\tilde{A}_2 = \tilde{b}}(a)\right)dP_{\tilde{A}_2|A_2 = b}(\tilde{b})\right)dP_{A_2}(b) \] - **Best - response definition**: \[ BR_1(P, \tilde{b})=\arg\max_{Q\in\Delta(A_1)}\sum_{i = 1}^{m_1}Q(a_{1,i})\left(\sum_{j = 1}^{m_2}u_{i,j}P(a_{2,j})\frac{dP_{\tilde{A}_2|A_2 = a_{2,j}}}{dP_{\tilde{A}_2|A_2 = a_{2,1}}}(\tilde{b})\right) \] - **Optimal commitment function**: \[ \hat{v}(P)=\max_{Q_{A_1|\tilde{A}_2}\in\Delta(A_1|\tilde{A}_2)}v(Q_{A_1|\tilde{A}_2}, P)\quad\text{s.t.}\quad\forall\tilde{b}\in\tilde{A}_2,\;Q_{A_1|\tilde{A}_2=\tilde{b}}\in BR_1(P, \tilde{b}) \] ### Main contributions - A new game model is proposed, considering any type of noisy channel. - It is proved that in the case of noisy observations, the equilibrium of the game always exists and is unique under specific conditions. - The influence of noise on the follower's best - response set is studied, and it is proved that in some cases, noise can make the best - response set a singleton set. - Through specific examples (such as the Additive White Gaussian Noise channel), it is shown how to calculate specific equilibrium solutions. This paper provides a new perspective for understanding zero - sum games under noisy observation conditions and provides a theoretical basis for further research on adversarial learning and Generative Adversarial Networks (GANs).