Abstract:As a transformative general-purpose technology, AI has empowered various industries and will continue to shape our lives through ubiquitous applications. Despite the enormous benefits from wide-spread AI deployment, it is crucial to address associated downside risks and therefore ensure AI advances are safe, fair, responsible, and aligned with human values. To do so, we need to establish effective AI governance. In this work, we show that the strategic interaction between the regulatory agencies and AI firms has an intrinsic structure reminiscent of a Stackelberg game, which motivates us to propose a game-theoretic modeling framework for AI governance. In particular, we formulate such interaction as a Stackelberg game composed of a leader and a follower, which captures the underlying game structure compared to its simultaneous play counterparts. Furthermore, the choice of the leader naturally gives rise to two settings. And we demonstrate that our proposed model can serves as a unified AI governance framework from two aspects: firstly we can map one setting to the AI governance of civil domains and the other to the safety-critical and military domains, secondly, the two settings of governance could be chosen contingent on the capability of the intelligent systems. To the best of our knowledge, this work is the first to use game theory for analyzing and structuring AI governance. We also discuss promising directions and hope this can help stimulate research interest in this interdisciplinary area. On a high, we hope this work would contribute to develop a new paradigm for technology policy: the quantitative and AI-driven methods for the technology policy field, which holds significant promise for overcoming many shortcomings of existing qualitative approaches.

Exploring the Constraints on Artificial General Intelligence: A Game-Theoretic No-Go Theorem

Appraising Regulatory Framework Towards Artificial General Intelligence (AGI) under Digital Humanism

Impossibility Results in AI: A Survey

On the Controllability of Artificial Intelligence: An Analysis of Limitations

Asymptotically Unambitious Artificial General Intelligence

Close the Gates: How we can keep the future human by choosing not to develop superhuman general-purpose artificial intelligence

On Controllability of AI

A Game-Theoretic Framework for AI Governance

The AI Race: Why Current Neural Network-based Architectures are a Poor Basis for Artificial General Intelligence

Superhuman Artificial Intelligence Can Improve Human Decision Making by Increasing Novelty

Games, AI and Systems

Rise of artificial general intelligence: risks and opportunities

Scenarios and branch points to future machine intelligence

On the link between conscious function and general intelligence in humans and machines

To Be, Or Not To Be?: Regulating Impossible AI in the United States

Does AlphaGo actually play Go? Concerning the State Space of Artificial Intelligence

Building Safer AGI by introducing Artificial Stupidity

Artificial General Intelligence, Existential Risk, and Human Risk Perception

The Threat of a Reward-Driven Adversarial Artificial General Intelligence

Don't Fear the Reaper: Refuting Bostrom's Superintelligence Argument

Possibilities and Implications of the Multi-AI Competition