Abstract:We present a practical asynchronous data fusion model for networked agents to perform distributed Bayesian learning without sharing raw data. Our algorithm uses a gossip-based approach where pairs of randomly selected agents employ unadjusted Langevin dynamics for parameter sampling. We also introduce an event-triggered mechanism to further reduce communication between gossiping agents. These mechanisms drastically reduce communication overhead and help avoid bottlenecks commonly experienced with distributed algorithms. In addition, the reduced link utilization by the algorithm is expected to increase resiliency to occasional link failure. We establish mathematical guarantees for our algorithm and demonstrate its effectiveness via numerical experiments.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to enable multiple computing nodes (or agents) to effectively perform Bayesian learning without sharing the original data in a distributed network environment. Specifically, the researchers proposed an asynchronous Bayesian learning model, aiming to overcome problems in traditional synchronous update algorithms such as high communication overhead, easy occurrence of bottlenecks, and sensitivity to link failures. ### Core Problems of the Paper 1. **Requirements for Distributed Bayesian Learning** - In many application scenarios, such as sensor networks and multi - agent systems, the data required to train machine - learning models are usually distributed across multiple computing nodes. - Due to logistical limitations (such as memory, efficient data - sharing means) or data sensitivity (such as confidentiality requirements), this data cannot be concentrated in one server location. - Therefore, a method is needed to enable all nodes to collaboratively train the same model without sharing the original data. 2. **Limitations of Existing Methods** - Most existing distributed Bayesian learning methods rely on synchronous updates, that is, all nodes perform parameter updates at the same time point. - This synchronous method has many problems in practical applications. For example, link failures may cause the algorithm to fail, slower nodes will slow down the progress of the entire system, and faster nodes will be idle waiting. 3. **The Proposed New Method** - The paper introduced a Bayesian learning algorithm based on the asynchronous gossip protocol and combined with an event - triggered mechanism. - Asynchronous updates allow any two randomly selected nodes to exchange information at any time, thereby reducing communication overhead and improving the robustness of the system. - The event - triggered mechanism further reduces the unnecessary communication frequency, and information is exchanged only when certain specific events occur. ### Formula Representation To ensure the correctness and readability of the formulas, the following are the key formulas involved in the paper: - **Energy Function** \[ E(w, X)=\sum_{i = 1}^{n}E_i(w, X_i)+C \] where \(E_i(w)=-\log p(X_i|w)-\frac{1}{n}\log p(w)\) - **Centralized Sampling** \[ w(k + 1)=w(k)-\alpha\nabla E(w(k), X)+\sqrt{2\alpha}v(k) \] where \(v(k)\sim N(0, I_{dw})\) - **Distributed Sampling** \[ w_i(k + 1)=w_i(k)-\beta_k\sum_{j\in N_i}(w_i(k)-w_j(k))-\alpha_kn\nabla E_i(w_i(k), X_i)+\sqrt{2\alpha_k}v_i(k) \] - **Asynchronous Gossip Protocol Update** \[ w_i(\tau_i(k)+ 1)=w_i(\tau_i(k))-\beta\sum_{j\in A_k}(w_i(\tau_i(k))-w_j(\tau_j(k)))-\frac{n\alpha}{2p_i}\nabla E_i(w_i(\tau_i(k)), X_i)+\sqrt{2\alpha}v_i(\tau_i(k)) \] - **Event - Triggered Condition** \[ \|e_i(k)\|_2^2=\|w_i(k)-\hat{w}_i(k)\|_2^2>\epsilon_i(k) \] Through these improvements, the paper provides a more practical and efficient distributed Bayesian learning framework, which is suitable for large - scale data sets and complex network environments.

Asynchronous Bayesian Learning over a Network

Asynchronous Local Computations in Distributed Bayesian Learning

A Communication-Efficient Algorithm for Exponentially Fast Non-Bayesian Learning in Networks

Network Fault-tolerant and Byzantine-resilient Social Learning via Collaborative Hierarchical Non-Bayesian Learning

Distributed Robust Bayesian Filtering for State Estimation.

Distributed Learning from Interactions in Social Networks

Distributed Bayesian Inference Over Sensor Networks

Asynchronous Byzantine-Robust Stochastic Aggregation with Variance Reduction for Distributed Learning

Scale-Robust Timely Asynchronous Decentralized Learning

Interaction-Based Distributed Learning in Cyber-Physical and Social Networks

Wireless Localization and Formation Control with Asynchronous Agents

Aggregative Efficiency of Bayesian Learning in Networks

Dynamics of Social Networks: Multi-agent Information Fusion, Anticipatory Decision Making and Polling

Distributed Learning of Predictive Structures from Multiple Tasks over Networks

Asynchronous Diffusion Learning with Agent Subsampling and Local Updates

On Non-Bayesian Social Learning

Asynchronous Networked Aggregative Games

On Generalized Bayesian Data Fusion with Complex Models in Large Scale Networks

Policy evaluation for reinforcement learning over asynchronous multi-agent networks

Asynchronous Adaptation and Learning over Networks --- Part I: Modeling and Stability Analysis

Frequentist Guarantees of Distributed (Non)-Bayesian Inference