BASGD: Buffered Asynchronous SGD for Byzantine Learning

Yi-Rui Yang,Wu-Jun Li
DOI: https://doi.org/10.48550/arxiv.2003.00937
2021-01-01
Abstract:Distributed learning has become a hot research topic due to its wideapplication in clusterbased large-scale learning, federated learning, edgecomputing and so on. Most traditional distributed learning methods typicallyassume no failure or attack. However, many unexpected cases, such ascommunication failure and even malicious attack, may happen in realapplications. Hence, Byzantine learning (BL), which refers to distributedlearning with failure or attack, has recently attracted much attention. Mostexisting BL methods are synchronous, which are impractical in some applicationsdue to heterogeneous or offline workers. In these cases, asynchronous BL (ABL)is usually preferred. In this paper, we propose a novel method, called bufferedasynchronous stochastic gradient descent (BASGD), for ABL. To the best of ourknowledge, BASGD is the first ABL method that can resist non-omniscient attackswithout storing any instances on server. Furthermore, we also propose animproved variant of BASGD, called BASGD with momentum (BASGDm), by introducingmomentum into BASGD. BASGDm can resist both non-omniscient and omniscientattacks. Compared with those methods which need to store instances on server,BASGD and BASGDm have a wider scope of application. Both BASGD and BASGDm arecompatible with various aggregation rules. Moreover, both BASGD and BASGDm areproved to be convergent and be able to resist failure or attack. Empiricalresults show that our methods significantly outperform existing ABL baselineswhen there exists failure or attack on workers.
What problem does this paper attempt to address?