Communication Compression Techniques in Distributed Deep Learning: A Survey

Zeqin Wang,Ming Wen,Yuedong Xu,Yipeng Zhou,Jessie Hui Wang,Liang Zhang
DOI: https://doi.org/10.1016/j.sysarc.2023.102927
IF: 5.836
2023-01-01
Journal of Systems Architecture
Abstract:Nowadays, the training data and neural network models are getting increasingly large. The training time of deep learning will become unbearably long on a single machine. To reduce the computation and storage burdens, distributed deep learning has been put forward to collaboratively train a large neural network model with multiple computing nodes in parallel. The unbalanced development of computation and communication capabilities has led to training time being dominated by communication time, making the communication overhead a major challenge toward efficient distributed deep learning. Communication compression is an effective method to alleviate communication overhead, and it has evolved from simple random sparsification or quantization to versatile strategies or data structures. In this survey, existing communication compression techniques are reviewed and classified to provide a bird's eye view. The main properties of each class of compression methods are analyzed, and their applications or theoretical convergence are described if necessary. This survey is potentially helpful for researchers and engineers to understand the up-to-date achievements on the communication compression techniques that accelerate the training of large deep learning models.
What problem does this paper attempt to address?