Research on CNN Parallel Computing and Learning Architecture Based on Real-Time Streaming Architecture.

Yuting Zhu,Liang Qian,Chuyan Wang,Lianghui Ding,Feng Yang,Hao Wang
DOI: https://doi.org/10.1007/978-3-030-05366-6_12
2019-01-01
Abstract:Convolutional neural network (CNN) is a deep feed-forward artificial neural network, which is widely used in image recognition. However, this mode highlights the problems that the training time is too long and memory is insufficient. Traditional acceleration methods are mainly limited to optimizing for an algorithm. In this paper, we propose a method, namely CNN-S, to improve training efficiency and cost based on Storm and is suitable for every algorithm. This model divides data into several sub sets and processes data on several machine in parallel flexibly. The experimental results show that in the case of achieving a recognition accuracy rate of 95%, the training time of single serial model is around 913 s, and in CNN-S model only needs 248 s. The acceleration ratio can reach 3.681. This shows that the CNN-S parallel model has better performance than single serial mode on training efficiency and cost of system resource.
What problem does this paper attempt to address?