Abstract:We show how the looming threat of bad actors using AI/GPT to generate harms across social media, can be addressed at scale by exploiting the intrinsic dynamics of the social media multiverse. We combine a uniquely detailed description of the current bad-actor-mainstream battlefield with a mathematical description of its behavior, to show what bad-actor-AI activity will likely dominate, where, and when. A dynamical Red Queen analysis predicts an escalation to daily bad-actor-AI activity by early 2024, just ahead of U.S. and other global elections. We provide a Policy Matrix that quantifies outcomes and trade-offs mathematically for the policy options of containment vs. removal. We give explicit plug-and-play formulae for risk measures.

What problem does this paper attempt to address?

The problems that this paper attempts to solve are: How to effectively control and predict the behavior of bad actors (Bad Actors) using AI to generate harmful content in large - scale online battlefields. Specifically, the paper focuses on the following core issues: 1. **What types of Bad - Actor - AI activities are most likely to occur?** - The paper analyzes the differences between basic forms of GPT (such as GPT - 2) and advanced forms of GPT (such as GPT - 3, 4, etc.), and points out that basic GPT may become the main source of threat due to its availability and ease of deployment. 2. **Where will these activities occur?** - The paper shows the current social media battlefield by drawing a dynamic network graph, especially the extreme anti - X communities (Bad Actor communities) across 13 platforms and their links with mainstream communities. Research shows that small - scale platforms, although small in size, play a crucial role because of their high - link activities. 3. **When will these activities occur?** - The paper uses the Red Queen hypothesis and the random walk model to predict the time pattern of Bad - Actor - AI activities. According to the existing data, it is expected that by the beginning of 2024, Bad - Actor - AI attacks will occur almost daily, which coincides with the upcoming global election time point. 4. **How to mitigate the impact of these activities and predict their results?** - The paper proposes a Policy Matrix, which combines mathematical descriptions to quantify the effects and trade - offs of different policy options (such as containment and removal). In addition, specific formulas are provided to measure risks, for example: \[ n_B(s)=C s^{-\alpha} e^{-\beta s} \] where \(n_B(s)\) represents the number of Bad - Actor - AI clusters with intensity \(s\), \(C\) is a normalization constant, and \(\alpha\) and \(\beta\) are parameters. 5. **How to control these activities on a large scale?** - The paper proposes a mathematical model based on community cluster dynamics to describe and control the Bad - Actor - AI system. This model considers two key equations: \[ T = \frac{2S_B}{S_A - S_B}\ln\left(\frac{S_B(S_A - S_B)}{S_A}\right) \] These equations help predict the time and resource requirements under different strategies and show that even a slight advantage can significantly reduce the intensity distribution of Bad - Actor - AI clusters. Through the research of these problems, the paper aims to provide a scientific basis for policymakers to deal with the increasing AI - driven online hazards.

Controlling bad-actor-AI activity at scale across online battlefields

Supporting the Free Market: Information Technology Policy in Hong Kong

Social Risks in the Era of Generative AI

Assessing the risks and opportunities posed by AI-enhanced influence operations on social media

Killer Apps: Low-Speed, Large-Scale AI Weapons

Developing safer AI–concepts from economics to the rescue

To regulate or not: a social dynamics analysis of the race for AI supremacy

Adaptive link dynamics drive online hate networks and their mainstream influence

A Simulation System Towards Solving Societal-Scale Manipulation

Considerations Influencing Offense-Defense Dynamics From Artificial Intelligence

Advances in AI for web integrity, equity, and well-being

Generative Language Models and Automated Influence Operations: Emerging Threats and Potential Mitigations

AI-Powered Autonomous Weapons Risk Geopolitical Instability and Threaten AI Research

[Robotic-assisted laparoscopic radical prostatectomy].

GenAI Against Humanity: Nefarious Applications of Generative Artificial Intelligence and Large Language Models

The Radicalization Risks of GPT-3 and Advanced Neural Language Models

Charting the Landscape of Nefarious Uses of Generative Artificial Intelligence for Online Election Interference

Escalation Risks from Language Models in Military and Diplomatic Decision-Making

Artificial Intelligence Crime: An Interdisciplinary Analysis of Foreseeable Threats and Solutions

Evolving AI Collectives to Enhance Human Diversity and Enable Self-Regulation

Societal Adaptation to Advanced AI