Abstract:This article studies rational and persistent deception among intelligent robots to enhance security and operational efficiency. We present an N-player K-stage game with an asymmetric information structure where each robot's private information is modeled as a random variable or its type. The deception is persistent as each robot's private type remains unknown to other robots for all stages. The deception is rational as robots aim to achieve their deception goals at minimum cost. Each robot forms a dynamic belief of others' types based on intrinsic or extrinsic information. Perfect Bayesian Nash equilibrium (PBNE) is a natural solution concept for dynamic games of incomplete information. Due to its requirements of sequential rationality and belief consistency, PBNE provides a reliable prediction of players' actions, beliefs, and expected cumulative costs over the entire K stages. The contribution of this work is fourfold. First, we identify the PBNE computation as a nonlinear stochastic control problem and characterize the structures of players' actions and costs under PBNE. We further derive a set of extended Riccati equations with cognitive coupling under the linear-quadratic (LQ) setting and extrinsic belief dynamics. Second, we develop a receding-horizon algorithm with low temporal and spatial complexity to compute PBNE under intrinsic belief dynamics. Third, we investigate a deceptive pursuit-evasion game as a case study and use numerical experiments to corroborate the results. Finally, we propose metrics, such as deceivability, reachability, and the price of deception (PoD), to evaluate the strategy design and the system performance under deception.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to achieve rational and persistent deceptive behaviors among intelligent robots by establishing a dynamic game framework in multi - agent systems, so as to improve the security and operational efficiency of the system. Specifically, the paper focuses on the N - player K - stage game with an asymmetric information structure, where each robot's private information is modeled as a random variable or its type, and these types remain unknown to other robots throughout the game. This deceptive behavior is rational because the robot aims to achieve its deception goal at the minimum cost; at the same time, the deceptive behavior is persistent because each robot's private type remains unknown to other robots in all stages. The main contributions of the paper include: 1. **PBNE Computation**: Identify PBNE computation as a non - linear stochastic control problem, and describe the structure of player actions and costs under PBNE. Further derive a set of extended Riccati equations with cognitive coupling under the linear - quadratic setting and extrinsic belief dynamics. 2. **Low - Complexity Algorithm**: Develop a receding - horizon algorithm with low time and space complexity for computing PBNE under intrinsic belief dynamics. 3. **Case Study**: Study a deceptive pursuit - evasion game as a case study and use numerical experiments to verify the results. 4. **Evaluation Metrics**: Propose metrics such as deceptiveness, reachability, and deception cost to evaluate the strategy design and the performance of the system under deception. Through these contributions, the paper not only provides a theoretical basis for understanding robot deceptive behaviors but also provides methodological support for designing cost - effective deception countermeasures. These results have broad application prospects in fields such as cooperative robots, pursuit - evasion tasks, and human - machine collaboration.

A Dynamic Game Framework for Rational and Persistent Robot Deception With an Application to Deceptive Pursuit-Evasion

Large Scale Pursuit-Evasion under Collision Avoidance Using Deep Reinforcement Learning.

Bearing Angle Measurement Based Cooperative Pursuit-Evasion Game in Non-Convex Environments

A Game-Theoretic Foundation of Deception: Knowledge Acquisition and Fundamental Limits

Deception in Nash Equilibrium Seeking

Dynamic Bayesian Games for Adversarial and Defensive Cyber Deception

An Improved Approach Towards Multi-Agent Pursuit–Evasion Game Decision-Making Using Deep Reinforcement Learning

Bounded-Rational Pursuit-Evasion Games

Integrated Resource Allocation and Strategy Synthesis in Safety Games on Graphs with Deception

Game of Travesty: Decoy-based Psychological Cyber Deception for Proactive Human Agents

A Differentially Private Game Theoretic Approach for Deceiving Cyber Adversaries

Deception Maze: A Stackelberg Game-Theoretic Defense Mechanism for Intranet Threats

Monte Carlo Neural Fictitious Self-Play: Achieve Approximate Nash equilibrium of Imperfect-Information Games.

Rationalizing Irrational Beliefs

Imitative Follower Deception in Stackelberg Games

Deception Game: Closing the Safety-Learning Loop in Interactive Robot Autonomy

Deception by Design: Evidence-Based Signaling Games for Network Defense

Reward-Based Deception with Cognitive Bias

A Dynamic Games Approach to Proactive Defense Strategies against Advanced Persistent Threats in Cyber-Physical Systems

Stochastic Dynamic Games in Belief Space

Decentralized optimal large scale multi-player pursuit-evasion strategies: A mean field game approach with reinforcement learning