Toward Large-scale Spiking Neural Networks: A Comprehensive Survey and Future Directions

Yangfan Hu,Qian Zheng,Guoqi Li,Huajin Tang,Gang Pan
2024-08-19
Abstract:Deep learning has revolutionized artificial intelligence (AI), achieving remarkable progress in fields such as computer vision, speech recognition, and natural language processing. Moreover, the recent success of large language models (LLMs) has fueled a surge in research on large-scale neural networks. However, the escalating demand for computing resources and energy consumption has prompted the search for energy-efficient alternatives. Inspired by the human brain, spiking neural networks (SNNs) promise energy-efficient computation with event-driven spikes. To provide future directions toward building energy-efficient large SNN models, we present a survey of existing methods for developing deep spiking neural networks, with a focus on emerging Spiking Transformers. Our main contributions are as follows: (1) an overview of learning methods for deep spiking neural networks, categorized by ANN-to-SNN conversion and direct training with surrogate gradients; (2) an overview of network architectures for deep spiking neural networks, categorized by deep convolutional neural networks (DCNNs) and Transformer architecture; and (3) a comprehensive comparison of state-of-the-art deep SNNs with a focus on emerging Spiking Transformers. We then further discuss and outline future directions toward large-scale SNNs.
Machine Learning
What problem does this paper attempt to address?
The paper is primarily dedicated to addressing the challenges faced by large-scale Spiking Neural Networks (SNNs) in practical applications, particularly in terms of energy efficiency. Specifically: 1. **Background and Motivation**: With the development of deep learning technology, especially the success of large language models (LLMs), the demand for computational resources has increased dramatically, leading to issues of energy consumption and carbon footprint. The human brain can perform complex tasks with a power budget of about 20 watts, whereas current deep neural networks (DNNs) like GPT-3 require a significant amount of electrical power. Inspired by the human brain, researchers have proposed Spiking Neural Networks (SNNs) to achieve high-efficiency computing. 2. **Core Issues**: This paper aims to provide a comprehensive survey of existing deep spiking neural network development methods and to highlight the emerging Spiking Transformers architecture, offering future directions for building efficient large-scale SNN models. The main content includes: - An overview of learning methods for deep spiking neural networks, divided into methods for converting from artificial neural networks (ANNs) to SNNs and methods for training directly using surrogate gradients; - A review of network architectures, divided into deep convolutional neural networks (DCNNs) and transformer-based architectures; - A comprehensive comparison of the latest deep SNNs, especially the emerging Spiking Transformers. 3. **Contributions**: The main contribution of the paper is to provide a systematic review of the learning methods, network architectures, and performance of deep SNNs, and to explore future directions for building large-scale SNNs. In short, the goal of this paper is to promote the development of efficient and large-scale Spiking Neural Networks through in-depth analysis of existing technologies and the proposal of new ideas.