Abstract:Federated learning (FL) enables distribution of machine learning workloads from the cloud to resource-limited edge devices. Unfortunately, current deep networks remain not only too compute-heavy for inference and training on edge devices, but also too large for communicating updates over bandwidth-constrained networks. In this paper, we develop, implement, and experimentally validate a novel FL framework termed Federated Dynamic Sparse Training (FedDST) by which complex neural networks can be deployed and trained with substantially improved efficiency in both on-device computation and in-network communication. At the core of FedDST is a dynamic process that extracts and trains sparse sub-networks from the target full network. With this scheme, "two birds are killed with one stone:" instead of full models, each client performs efficient training of its own sparse networks, and only sparse networks are transmitted between devices and the cloud. Furthermore, our results reveal that the dynamic sparsity during FL training more flexibly accommodates local heterogeneity in FL agents than the fixed, shared sparse masks. Moreover, dynamic sparsity naturally introduces an "in-time self-ensembling effect" into the training dynamics and improves the FL performance even over dense training. In a realistic and challenging non i.i.d. FL setting, FedDST consistently outperforms competing algorithms in our experiments: for instance, at any fixed upload data cap on non-iid CIFAR-10, it gains an impressive accuracy advantage of 10% over FedAvgM when given the same upload data cap; the accuracy gap remains 3% even when FedAvgM is given 2x the upload data cap, further demonstrating efficacy of FedDST. Code is available at: <a class="link-external link-https" href="https://github.com/bibikar/feddst" rel="external noopener nofollow">this https URL</a>.

Distributed Training for Conditional Random Fields.

Joint CRF and Locality-Consistent Dictionary Learning for Semantic Segmentation.

CRF with Locality-Consistent Dictionary Learning for Semantic Segmentation

Parameter Estimation of Conditional Random Fields Model Based on Cloud Computing

Approach to Parallel Computing Conditional Random Field with Dataflow Processors

Efficient robust conditional random fields

Hadoop Recognition of Biomedical Named Entity Using Conditional Random Fields

Analyzing Sequence Data Based on Conditional Random Fields with Co-Training

Simplifying Distributed Neural Network Training on Massive Graphs: Randomized Partitions Improve Model Aggregation

Conditional Random Fields with Decode-based Learning: Simpler and Faster

Distributed Training Optimization for DCU

Efficient Sdp Inference For Fully-Connected Crfs Based On Low-Rank Decomposition

Cloudless-Training: A Framework to Improve Efficiency of Geo-Distributed ML Training

A Cluster-Driven Adaptive Training Approach for Federated Learning

Flexible Clustered Federated Learning for Client-Level Data Distribution Shift

Continuous Conditional Random Field Convolution for Point Cloud Segmentation

Tunnel condition assessment via cloud model‐based random forests and self‐training approach

Distributed Denial of Service Attacks Detection Method Based on Conditional Random Fields.

Domain Adaptation for Conditional Random Fields

Federated Dynamic Sparse Training: Computing Less, Communicating Less, Yet Learning Better

Chinese keyword extraction model with distributed computing