Abstract:Blockchain-based federated learning has gained significant interest over the last few years with the increasing concern for data privacy, advances in machine learning, and blockchain innovation. However, gaps in security and scalability hinder the development of real-world applications. In this study, we propose ScaleSFL, which is a scalable blockchain-based sharding solution for federated learning. ScaleSFL supports interoperability by separating the off-chain federated learning component in order to verify model updates instead of controlling the entire federated learning flow. We implemented ScaleSFL as a proof-of-concept prototype system using Hyperledger Fabric to demonstrate the feasibility of the solution. We present a performance evaluation of results collected through Hyperledger Caliper benchmarking tools conducted on model creation. Our evaluation results show that sharding can improve validation performance linearly while remaining efficient and secure.

What problem does this paper attempt to address?

The problems that this paper attempts to solve mainly focus on the security and scalability issues in Blockchain - based Federated Learning (BFL). Specifically: 1. **Security issues**: Although the traditional federated learning framework can protect data privacy, there are two main security risks: - **Data leakage risk**: Model updates may indirectly leak the local data used to train these models, especially through statistical analysis and data mining techniques, and users may be re - anonymized. - **Single - point failure**: The centralized model aggregation method creates a single - point failure, relying on a trusted central server to aggregate model updates, which is an obvious weakness in a decentralized environment. 2. **Scalability issues**: As the number of participating nodes in the network increases, the computational complexity and communication overhead of traditional federated learning methods in verifying and aggregating model updates increase significantly, leading to a decline in system performance. Especially in large - scale distributed systems, this performance bottleneck is more obvious. To solve the above problems, the author proposes a sharding solution named ScaleSFL. The main features of ScaleSFL include: - **Sharding mechanism**: By dividing the network into multiple shards, each shard independently verifies and partially aggregates model updates, thereby reducing the global computational amount and communication overhead. - **Committee consensus**: A committee is elected in each shard, which is responsible for verifying model updates and generating new blocks. Committee members ensure the validity and security of model updates through a local consensus mechanism. - **Flexible security strategy**: ScaleSFL supports pluggable poisoning mitigation and detection strategies to deal with false or harmful model updates submitted by malicious clients. Through these designs, ScaleSFL aims to improve the security and scalability of the blockchain - based federated learning system, enabling it to operate more effectively in practical applications.

ScaleSFL: A Sharding Solution for Blockchain-Based Federated Learning

Decentralized Iot Data Sharing: A Blockchain-Based Federated Learning Approach with Joint Optimizations for Efficiency and Privacy

GFL: A Decentralized Federated Learning Framework Based On Blockchain.

Towards a Secure and Reliable Federated Learning using Blockchain

Enhancing Scalability and Reliability in Semi-Decentralized Federated Learning With Blockchain: Trust Penalization and Asynchronous Functionality

Secure and Efficient Decentralized Federated Learning with Data Representation Protection

BlockDFL: A Blockchain-based Fully Decentralized Peer-to-Peer Federated Learning Framework

A Federated Learning Method Based on Blockchain and Cluster Training

GFL: A Decentralized Federated Learning Framework Based On Blockchain

When Federated Learning Meets Blockchain: A New Distributed Learning Paradigm

Fairness, Integrity, and Privacy in a Scalable Blockchain-based Federated Learning System

Towards blockchain-enabled decentralized and secure federated learning

DAG-Based Blockchain Sharding for Secure Federated Learning with Non-IID Data

A privacy-preserving federated learning framework for blockchain networks

Colorimetric characterisation of flatbed scanners for rock/sediment imaging

A Blockchain System for Clustered Federated Learning with Peer-to-Peer Knowledge Transfer.

A Research and Analysis of Blockchain Federated Learning

Quantifying Bytes: Understanding Practical Value of Data Assets in Federated Learning

A verifiable and privacy-preserving blockchain-based federated learning approach

DRL-based Adaptive Sharding for Blockchain-based Federated Learning

SCALE: Self-regulated Clustered federAted LEarning in a Homogeneous Environment