Abstract:In recent years, the burgeoning data market has witnessed a surge in data exchange, playing a pivotal role in augmenting the predictive and decision-making capabilities of machine learning . Despite these advancements, persistent concerns surrounding data privacy have resulted in stringent limitations on data sharing and trading. Consequently, the data market is undergoing a transformative shift from pricing individual datasets to pricing the models themselves. The challenge of training high-performance machine learning models with restricted data from a single client is substantial. Federated learning has emerged as a popular solution, allowing collaborative model training without the need to transfer client data beyond local environments. However, training federated models within a data market introduces several challenges, including the effective selection of clients for model training and the optimization of utility through model pricing. In response to these challenges, we propose the Tiered Federated Learning Client Selection Algorithm (TiFLCS-MAR), employing a multi-attribute reverse auction approach. Integrated into the federated learning framework, TiFLCS-MAR excels at the comprehensive evaluation of client attributes, employing a tiered strategy to mitigate issues arising from client heterogeneity. Additionally, we introduce the TiFLCS-MAR Pricing Framework (TiFLCS-MARP), leveraging Nash equilibrium principles to maximize profitability for both clients and servers. Our framework accommodates the heterogeneity of diverse clients, efficiently selecting suitable candidates from a large pool, thereby boosting training efficiency and curbing model pricing costs. Empirical evidence showcases the efficacy of federated training with TiFLCS-MAR, demonstrating nearly double the convergence speed and a 5-10 percentage point improvement in accuracy across real and synthetic datasets. Furthermore, when compared to three baseline algorithms, TiFLCS-MARP substantially increases central server revenue by factors of 1.99, 27.05, and 1.78, highlighting its superior performance in the data market context.

Billion-scale Federated Learning on Mobile Clients

Secure Federated Submodel Learning

Decentralized Personalized Online Federated Learning

Practical and Light-weight Secure Aggregation for Federated Submodel Learning

DistFL: Distribution-aware Federated Learning for Mobile Scenarios

Towards Efficient Federated Learning Framework Via Selective Aggregation of Models

Safely Learning with Private Data: A Federated Learning Framework for Large Language Model

Prioritized Multi-Criteria Federated Learning

Communication Efficient Federated Learning With Heterogeneous Structured Client Models

From distributed machine learning to federated learning: In the view of data privacy and security

FAIR: Quality-Aware Federated Learning with Precise User Incentive and Model Aggregation.

Auction Based Clustered Federated Learning in Mobile Edge Computing System

Exploring System-Heterogeneous Federated Learning with Dynamic Model Selection

A Secure and Fair Federated Learning Framework Based on Consensus Incentive Mechanism

Evaluation Framework For Large-scale Federated Learning

Efficient-FedRec: Efficient Federated Learning Framework for Privacy-Preserving News Recommendation

Toward Resource-Efficient Federated Learning in Mobile Edge Computing

An Online Learning Approach for Client Selection in Federated Edge Learning under Budget Constraint

Multicore Federated Learning for Mobile-Edge Computing Platforms

TiFLCS-MARP: Client selection and model pricing for federated learning in data markets

Privacy-Preserving and Robust Federated Deep Metric Learning