Abstract:Frontier models are currently developed and distributed primarily through two channels: centralized proprietary APIs or open-sourcing of pre-trained weights. We identify a third paradigm - Protocol Learning - where models are trained across decentralized networks of incentivized participants. This approach has the potential to aggregate orders of magnitude more computational resources than any single centralized entity, enabling unprecedented model scales and capabilities. However, it also introduces novel challenges: heterogeneous and unreliable nodes, malicious participants, the need for unextractable models to preserve incentives, and complex governance dynamics. To date, no systematic analysis has been conducted to assess the feasibility of Protocol Learning or the associated risks, particularly the 'No-Off Problem' arising from the inability to unilaterally halt a collectively trained model. We survey recent technical advances that suggest decentralized training may be feasible - covering emerging communication-efficient strategies and fault-tolerant methods - while highlighting critical open problems that remain. Contrary to the notion that decentralization inherently amplifies frontier risks, we argue that Protocol Learning's transparency, distributed governance, and democratized access ultimately reduce these risks compared to today's centralized regimes.

What problem does this paper attempt to address?

### What problems does this paper attempt to solve? This paper mainly explores an emerging paradigm in the development and distribution of deep - learning models - **Protocol Learning**. The author points out the limitations of the two main current models, namely centralized proprietary APIs and open - source pre - trained weights, and proposes a new decentralized collaborative training method to overcome these limitations and enable larger - scale model training. Specifically, the paper attempts to solve the following key problems: 1. **Scalability of computing resources**: - Current centralized model development depends on the computing resources provided by a few large companies or national institutions, which creates barriers to entry and limits the capabilities of other participants. - Protocol Learning can aggregate computing resources several orders of magnitude larger than any single centralized entity by incentivizing dispersed network nodes to participate, thus supporting unprecedented large - scale model training. 2. **Technical feasibility of decentralized training**: - Distributed training needs to address technical challenges such as communication efficiency, heterogeneous nodes, and Byzantine fault tolerance. - The paper evaluates existing technologies and progress, including communication - efficient Distributed Data Parallel (DDP), Pipeline Parallelism, and Byzantine - fault - tolerant training methods, and explores their applicability and improvement directions in a large - scale decentralized environment. 3. **Incentive mechanisms and governance structures**: - How to design effective incentive mechanisms to attract enough participants to contribute computing resources? - Protocol Learning proposes a scheme to incentivize contributors through partial ownership allocation, ensuring the transparency and fairness of the training process while preventing malicious actors from disrupting the system. 4. **Decentralized Frontier Risk (DFR)**: - New risks brought about by decentralized training, such as the "No - Off Problem", that is, the inability to unilaterally stop the collectively trained model. - The author believes that, despite new challenges, the transparency, distributed governance, and democratic access of Protocol Learning may ultimately reduce the overall frontier risk. 5. **Unextractability of the model**: - Ensure that the trained model cannot be extracted from the protocol to prevent someone from using a more efficient centralized inference service to replace the distributed inference, thereby weakening the enthusiasm of participants. - This feature is achieved through redundant sharding and verification mechanisms, ensuring that the model can only be inferred within the protocol. In summary, this paper aims to explore a new decentralized training paradigm, promote the development of large - scale model training by solving the above problems, and redefine the risk and governance framework for artificial intelligence development.

Protocol Learning, Decentralized Frontier Risk and the No-Off Problem

Decentralized Iot Data Sharing: A Blockchain-Based Federated Learning Approach with Joint Optimizations for Efficiency and Privacy

A survey on secure decentralized optimization and learning

Provable Privacy Advantages of Decentralized Federated Learning via Distributed Optimization

Secure and Efficient Decentralized Federated Learning with Data Representation Protection

Exploring the Robustness of Decentralized Training for Large Language Models

Secure Distributed Training at Scale

Training Encrypted Models with Privacy-preserved Data on Blockchain

Byzantine-resilient Decentralized Stochastic Gradient Descent

Decentralized Learning Made Easy with DecentralizePy

Decentralized Federated Learning Preserves Model and Data Privacy

A Survey on Decentralized Federated Learning

Impact of network topology on the performance of Decentralized Federated Learning

Network Fault-tolerant and Byzantine-resilient Social Learning via Collaborative Hierarchical Non-Bayesian Learning

Decentralized Federated Learning: A Survey on Security and Privacy

When Decentralized Optimization Meets Federated Learning

Communication-Efficient and Privacy-Preserving Decentralized Meta-Learning

PPT: A Privacy-Preserving Global Model Training Protocol for Federated Learning in P2P Networks

Byzantine-Resilient Decentralized Collaborative Learning.

Trading Off Privacy, Utility and Efficiency in Federated Learning