Abstract:We introduce the Proportional Payoff Allocation Game (PPA-Game) to model how agents, akin to content creators on platforms like YouTube and TikTok, compete for divisible resources and consumers' attention. Payoffs are allocated to agents based on heterogeneous weights, reflecting the diversity in content quality among creators. Our analysis reveals that although a pure Nash equilibrium (PNE) is not guaranteed in every scenario, it is commonly observed, with its absence being rare in our simulations. Beyond analyzing static payoffs, we further discuss the agents' online learning about resource payoffs by integrating a multi-player multi-armed bandit framework. We propose an online algorithm facilitating each agent's maximization of cumulative payoffs over $T$ rounds. Theoretically, we establish that the regret of any agent is bounded by $O(\log^{1 + \eta} T)$ for any $\eta > 0$. Empirical results further validate the effectiveness of our approach.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: How to better understand and optimize the behavior of online content creators in the recommendation system by modeling and analyzing the competitive dynamics among them. Specifically, the paper introduces the Proportional Payoff Allocation Game (PPA - Game) to describe the competition of content creators for limited resources (such as audience attention), and explores the Nash Equilibrium (NE) in this competitive environment and its existence. ### Main problems: 1. **Modeling of competitive dynamics**: The paper aims to establish a model that can accurately describe the competitive behavior among online content creators, especially how they compete for limited resources (such as audience attention). This helps to understand the competitive relationships among creators as well as the stability and fairness of the platform. 2. **Existence of Nash equilibrium**: Research on the existence of Pure Nash Equilibrium (PNE) in PPA - Game. Although PNE may not exist in some cases, the author discovers through simulation experiments that PNE is common in most practical scenarios. 3. **Online learning and resource allocation**: Considering the volatility of resource returns in the real world and the players' ignorance of resource weights, the paper introduces a Multi - player Multi - Armed Bandit (MPMAB) framework to simulate how players make decisions based on historical observations and maximize cumulative returns. Theoretical analysis shows that each player's regret is bounded by $O(\log^{1 + \eta}T)$, where $\eta>0$. ### Solutions: - **PPA - Game model**: By defining a game that includes multiple players and resources, each player selects resources and distributes returns proportionally according to their weights. The paper analyzes the existence of PNE under different conditions, including long - tailed distribution, homogeneous players and resources, etc. - **MPMAB framework**: To deal with the uncertainty of resource returns, the paper proposes an online learning algorithm to help players gradually learn and optimize their strategies without knowing the resource returns and their own weights. The algorithm alternates between exploration and exploitation phases and finally converges to the optimal PNE. ### Conclusion: This paper provides a new method to understand and optimize the competitive dynamics among online content creators by introducing the PPA - Game model and the MPMAB framework. The research results show that although PNE may not exist in some cases, it is common in most practical scenarios. In addition, the proposed online learning algorithm can effectively help players maximize returns in an uncertain environment.

PPA-Game: Characterizing and Learning Competitive Dynamics Among Online Content Creators

A competitive analysis of online failure-aware assignment.

Fans Economy and All-Pay Auctions with Proportional Allocations.

Regulating Competition in Age of Information under Network Externalities

Paying to Do Better: Games with Payments between Learning Agents

General Performance Evaluation for Competitive Resource Allocation Games via Unseen Payoff Estimation

Modeling Paying Behavior in Game Social Networks

Equilibrium Analysis of Customer Attraction Games

Differential Games of Competition in Online Content Diffusion

Reward Optimization for Content Providers with Mobile Data Subsidization: A Hierarchical Game Approach

Dynamic generation and attribution of revenues in a video platform

Incentivizing High-Quality Content in Online Recommender Systems

Competing for Shareable Arms in Multi-Player Multi-Armed Bandits

Modeling Content Creator Incentives on Algorithm-Curated Platforms

Non-zero-sum Stackelberg Budget Allocation Game for Computational Advertising

A k-hop Collaborate Game Model: Adaptive Strategy to Maximize Total Revenue

Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members

MACRO: Incentivizing Multi-Leader Game-Based Pareto-Efficient Crowdsourcing for Video Analytics

Game-Theoretic Analysis of Live-Streaming E-Commerce on TikTok

How Bad is Top-$K$ Recommendation under Competing Content Creators?

A Game Theoretic Analysis of Incentives in Content Production and Sharing Over Peer-to-Peer Networks