Abstract:Organizations managing high-performance computing systems face a multitude of challenges, including overarching concerns such as overall energy consumption, microprocessor clock frequency limitations, and the escalating costs associated with chip production. Evidently, processor speeds have plateaued over the last decade, persisting within the range of 2 GHz to 5 GHz. Scholars assert that brain-inspired computing holds substantial promise for mitigating these challenges. The spiking neural network (SNN) particularly stands out for its commendable power efficiency when juxtaposed with conventional design paradigms. Nevertheless, our scrutiny has brought to light several pivotal challenges impeding the seamless implementation of large-scale neural networks (NNs) on silicon. These challenges encompass the absence of automated tools, the need for multifaceted domain expertise, and the inadequacy of existing algorithms to efficiently partition and place extensive SNN computations onto hardware infrastructure. In this paper, we posit the development of an automated tool flow capable of transmuting any NN into an SNN. This undertaking involves the creation of a novel graph-partitioning algorithm designed to strategically place SNNs on a network-on-chip (NoC), thereby paving the way for future energy-efficient and high-performance computing paradigms. The presented methodology showcases its effectiveness by successfully transforming ANN architectures into SNNs with a marginal average error penalty of merely 2.65%. The proposed graph-partitioning algorithm enables a 14.22% decrease in inter-synaptic communication and an 87.58% reduction in intra-synaptic communication, on average, underscoring the effectiveness of the proposed algorithm in optimizing NN communication pathways. Compared to a baseline graph-partitioning algorithm, the proposed approach exhibits an average decrease of 79.74% in latency and a 14.67% reduction in energy consumption. Using existing NoC tools, the energy-latency product of SNN architectures is, on average, 82.71% lower than that of the baseline architectures.

Cerebron: A Reconfigurable Architecture for Spatiotemporal Sparse Spiking Neural Networks

A Scatter-and-Gather Spiking Convolutional Neural Network on a Reconfigurable Neuromorphic Hardware

A Sparsity-Adapted Hardware Implementation of SNN for Cortical Spike Trains Decoding

A Reconfigurable FPGA-based Spiking Neural Network Accelerator

Mapping Very Large Scale Spiking Neuron Network to Neuromorphic Hardware.

An Energy-Efficient Spiking Neural Network Accelerator Based on Spatio-Temporal Redundancy Reduction

A A 22nm 0.43pJ/SOP Sparsity-Aware In-Memory Neuromorphic Computing System with Hybrid Spiking and Artificial Neural Network and Configurable Topology

Memory-Efficient Reversible Spiking Neural Networks

Hardware-Software Co-optimised Fast and Accurate Deep Reconfigurable Spiking Inference Accelerator Architecture Design Methodology

A Convolutional Spiking Neural Network Accelerator with the Sparsity-Aware Memory and Compressed Weights

30.2 A 22nm 0.26nW/Synapse Spike-Driven Spiking Neural Network Processing Unit Using Time-Step-First Dataflow and Sparsity-Adaptive In-Memory Computing

Compiling Spiking Neural Networks to Neuromorphic Hardware

SpikeSim: An end-to-end Compute-in-Memory Hardware Evaluation Tool for Benchmarking Spiking Neural Networks

ESSA: Design of a Programmable Efficient Sparse Spiking Neural Network Accelerator

Multi-core ARM-based Hardware-Accelerated Computation for Spiking Neural Networks

An Asynchronous Multi-core Accelerator for SNN inference

An Event-driven Spiking Neural Network Accelerator with On-chip Sparse Weight

Benchmarking Artificial Neural Network Architectures for High-Performance Spiking Neural Networks

A Cost-Efficient High-Speed VLSI Architecture for Spiking Convolutional Neural Network Inference Using Time-Step Binary Spike Maps

Spiking Deep Convolutional Neural Networks for Energy-Efficient Object Recognition

Efficient GCN Deployment with Spiking Property on Spatial-Temporal Neuromorphic Chips.