An Asynchronous Multi-core Accelerator for SNN inference

Zhuo Chen,De Ma,Xiaofei Jin,Qinghui Xing,Ouwen Jin,Xin Du,Shuibing He,Gang Pan
2024-07-31
Abstract:Spiking Neural Networks (SNNs) are extensively utilized in brain-inspired computing and neuroscience research. To enhance the speed and energy efficiency of SNNs, several many-core accelerators have been developed. However, maintaining the accuracy of SNNs often necessitates frequent explicit synchronization among all cores, which presents a challenge to overall efficiency. In this paper, we propose an asynchronous architecture for Spiking Neural Networks (SNNs) that eliminates the need for inter-core synchronization, thus enhancing speed and energy efficiency. This approach leverages the pre-determined dependencies of neuromorphic cores established during compilation. Each core is equipped with a scheduler that monitors the status of its dependencies, allowing it to safely advance to the next timestep without waiting for other cores. This eliminates the necessity for global synchronization and minimizes core waiting time despite inherent workload imbalances. Comprehensive evaluations using five different SNN workloads show that our architecture achieves a 1.86x speedup and a 1.55x increase in energy efficiency compared to state-of-the-art synchronization architectures.
Neural and Evolutionary Computing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to eliminate the inefficiency caused by multi - core synchronization in the process of achieving efficient and low - energy - consumption SNN (Spiking Neural Networks) inference in SNN accelerators. Specifically, existing multi - core SNN accelerators need to perform explicit synchronization on all cores at each time step in order to maintain the accuracy of SNN. Although this synchronization mechanism ensures the accuracy of the results, it also leads to a large amount of waiting time and reduces the overall efficiency. Therefore, the paper proposes an asynchronous architecture (DepAsync), aiming to improve the speed and energy efficiency of SNN inference by eliminating the need for synchronization between cores while maintaining the precision at each step (i.e., temporal accuracy). The core of this method lies in using the core dependencies determined at compile time to design a scheduling mechanism to dynamically track the states of these dependencies, so that each core can safely move on to the next time step without waiting for other cores to complete their tasks. This not only reduces the redundant waiting time caused by inter - core and temporal imbalances but also improves the flexibility and performance of the system.