Reconfigurable neural network acceleration method and architecture

Yin Shouyi,Tu Fengbin,Yan Jiale,Ouyang Peng,Tang Shibin,Liu Leibo,Wei Shaojun
2018-01-01
Abstract:The invention provides a reconfigurable neural network acceleration method and architecture. In the invention, the modes of input data multiplexing, output data multiplexing and weighted data multiplexing are adopted respectively through the architecture of an input buffer unit, a weight buffer unit, a convolution calculation kernel unit and an output buffer unit, the read input data and convolution kernel are subjected to convolution operation through the convolution calculation kernel unit, and the output data are generated. According to the application, a strategy of layer-by-layer acceleration is adopted to deal with a neural network with a variety of layers, a method of cyclic transformation is used to optimize the acceleration of the neural network, the number of times of access to the Buffer and the DRAM is reduced, the problem of a waste of power consumption due to a large number of times of access to the memory in the prior art is solved, and the beneficial effects of reducingthe energy consumption and maximizing the utilization rate of hardware resources of a PE array are gained.
What problem does this paper attempt to address?