What are guanosine triphosphate-binding proteins doing in mitochondria?

M. Thomson

DOI: https://doi.org/10.1016/S0167-4889(98)00069-X

1998-07-24

Biochimica et Biophysica Acta

Abstract:

What problem does this paper attempt to address?

Towards Efficient Deployment of Hybrid SNNs on Neuromorphic and Edge AI Hardware

James Seekings,Peyton Chandarana,Mahsa Ardakani,MohammadReza Mohammadi,Ramtin Zand

2024-07-12

Abstract:This paper explores the synergistic potential of neuromorphic and edge computing to create a versatile machine learning (ML) system tailored for processing data captured by dynamic vision sensors. We construct and train hybrid models, blending spiking neural networks (SNNs) and artificial neural networks (ANNs) using PyTorch and Lava frameworks. Our hybrid architecture integrates an SNN for temporal feature extraction and an ANN for classification. We delve into the challenges of deploying such hybrid structures on hardware. Specifically, we deploy individual components on Intel's Neuromorphic Processor Loihi (for SNN) and Jetson Nano (for ANN). We also propose an accumulator circuit to transfer data from the spiking to the non-spiking domain. Furthermore, we conduct comprehensive performance analyses of hybrid SNN-ANN models on a heterogeneous system of neuromorphic and edge AI hardware, evaluating accuracy, latency, power, and energy consumption. Our findings demonstrate that the hybrid spiking networks surpass the baseline ANN model across all metrics and outperform the baseline SNN model in accuracy and latency.

Neural and Evolutionary Computing,Artificial Intelligence,Hardware Architecture,Computer Vision and Pattern Recognition,Machine Learning
Multi-grained system integration for hybrid-paradigm brain-inspired computing

Jing Pei,Lei Deng,Cheng Ma,Xue Liu,Luping Shi

DOI: https://doi.org/10.1007/s11432-021-3510-6

2023-03-15

Science China Information Sciences

Abstract:Hybrid neuromorphic computing supporting the prevailing artificial neural networks and neuroscience-inspired models/algorithms offers substantial flexibility for cross-paradigm model integration. It is one of the most promising technologies for accelerating intelligence development, ultimately contributing to artificial general intelligence development. Recently, an increasing number of hybrid neuromorphic computing chips have been reported, but such research focuses on chip design without demonstrating systems for large-scale workloads. To this end, we construct a multi-grained system based on many Tianjic chips, presenting a large-scale system for hybrid-paradigm brain-inspired computing. With different numbers of chips and different connection topologies, we develop a Tianjic card and a Tianjic board as the infrastructure for building embedded systems and cloud servers, respectively. Extensive measurements of the communication latency, computational latency, and power consumption evidence the superior potential of Tianjic systems for exploring brain-inspired computing for artificial general intelligence.

computer science, information systems,engineering, electrical & electronic
NEUTRAMS: Neural Network Transformation and Co-Design under Neuromorphic Hardware Constraints

Yu Ji,YouHui Zhang,ShuangChen Li,Ping Chi,CiHang Jiang,Peng Qu,Yuan Xie,WenGuang Chen

DOI: https://doi.org/10.1109/micro.2016.7783724

2016-01-01

Abstract:With the recent reincarnations of neuromorphic computing comes the promise of a new computing paradigm, with a focus on the design and fabrication of neuromorphic chips. A key challenge in design, however, is that programming such chips is difficult. This paper proposes a systematic methodology with a set of tools to address this challenge. The proposed toolset is called NEUTRAMS (Neural network Transformation, Mapping and Simulation), and includes three key components: a neural network (NN) transformation algorithm, a configurable clock-driven simulator of neuromorphic chips and an optimized runtime tool that maps NNs onto the target hardware for better resource utilization. To address the challenges of hardware constraints on implementing NN models (such as the maximum fan-in/fan-out of a single neuron, limited precision, and various neuron models), the transformation algorithm divides an existing NN into a set of simple network units and retrains each unit iteratively, to transform the original one into its counterpart under such constraints. It can support both spiking neural networks (SNNs) and traditional artificial neural networks (ANNs), including convolutional neural networks (CNNs) and multilayer perceptrons (MLPs) and recurrent neural networks (RNNs). With the combination of these tools, we have explored the hardware/software co-design space of the correlation between network error-rates and hardware constraints and consumptions. Doing so provides insights which can support the design of future neuromorphic architectures. The usefulness of such a toolset has been demonstrated with two different designs: a real Complementary Metal-Oxide-Semiconductor (CMOS) neuromorphic chip for both SNNs and ANNs and a processing-in-memory architecture design for ANNs.
Tianjic: A Unified and Scalable Chip Bridging Spike-Based and Continuous Neural Computation

Lei Deng,Guanrui Wang,Guoqi Li,Shuangchen Li,Ling Liang,Maohua Zhu,Yujie Wu,Zheyu Yang,Zhe Zou,Jing Pei,Zhenzhi Wu,Xing Hu,Yufei Ding,Wei He,Yuan Xie,Luping Shi

DOI: https://doi.org/10.1109/jssc.2020.2970709

IF: 5.4

2020-01-01

IEEE Journal of Solid-State Circuits

Abstract:Toward the long-standing dream of artificial intelligence, two successful solution paths have been paved: 1) neuromorphic computing and 2) deep learning. Recently, they tend to interact for simultaneously achieving biological plausibility and powerful accuracy. However, models from these two domains have to run on distinct substrates, i.e., neuromorphic platforms and deep learning accelerators, respectively. This architectural incompatibility greatly compromises the modeling flexibility and hinders promising interdisciplinary research. To address this issue, we build a unified model description framework and a unified processing architecture (Tianjic), which covers the full stack from software to hardware. By implementing a set of integration and transformation operations, Tianjic is able to support spiking neural networks, biological dynamic neural networks, multilayered perceptron, convolutional neural networks, recurrent neural networks, and so on. A compatible routing infrastructure enables homogeneous and heterogeneous scalability on a decentralized many-core network. Several optimization methods are incorporated, such as resource and data sharing, near-memory processing, compute/access skipping, and intra-/inter-core pipeline, to improve performance and efficiency. We further design streaming mapping schemes for efficient network deployment with a flexible tradeoff between execution throughput and resource overhead. A 28-nm prototype chip is fabricated with >610-GB/s internal memory bandwidth. A variety of benchmarks are evaluated and compared with GPUs and several existing specialized platforms. In summary, the fully unfolded mapping can achieve significantly higher throughput and power efficiency; the semi-folded mapping can save 30x resources while still presenting comparable performance on average. Finally, two hybrid-paradigm examples, a multimodal unmanned bicycle and a hybrid neural network, are demonstrated to show the potential of our unified architecture. This article paves a new way to explore neural computing.
Modular Building Blocks for Mapping Spiking Neural Networks Onto a Programmable Neuromorphic Processor

Chenglong Zou,Xiaoxin Cui,Guang Chen,Shuo Feng,Kefei Liu,Xinan Wang,Yuan Wang

DOI: https://doi.org/10.1016/j.mejo.2022.105612

IF: 1.992

2022-01-01

Microelectronics Journal

Abstract:In the past few years, brain-inspired artificial intelligence (AI) system with energy-efficient neuromorphic computing has gathered lots of interests. However, how to build an efficient algorithm-to-hardware development tool for spiking neural networks (SNNs) remains a big challenge. In this paper, we present a novel design of neural controller which can flexibly generate the store-and-release signals in the designed SNNs. Further, we propose a mapping methodology with modular building blocks to deploy SNN models onto a many-core pro-grammable neuromorphic processor. Experimental results show that the presented mapping method can be adaptive to various SNN layers of different sizes and quantization precisions. Besides, our demonstrated system on chip can achieve about 181 and 26 images per second runtime inference speed on MNIST and CIFAR-10 dataset respectively and show comparable accuracies and significantly better power performance with their artificial neural network (ANN) counterparts running on traditional GPU.
A Hybrid Heterogeneous Neural Network Accelerator Based on Systolic Array

Zilin Wang,Yi Zhong,Guang Chen,Shuo Feng,Youming Yang,Xiaoxin Cui,Yuan Wang

DOI: https://doi.org/10.1109/aicas59952.2024.10595910

2024-01-01

Abstract:Spiking neural networks (SNNs) and artificial neural networks (ANNs) are two methods to achieve artificial intelligence. Realizing the long-term and ambitious ideal of a universal artificial intelligence platform requires hardware that supports both SNN and ANN models. ANNs, such as convolutional neural networks (CNNs), are adept at extracting features and have achieved outstanding achievements in many fields. Due to the event-driven processing paradigm, SNNs can achieve competitive accuracy with minimal power usage. The weight accumulation of SNN and ANN is similar, but the activation function is significantly different. How to integrate SNN and ANN on one hardware platform is a challenge. In this paper, we propose a hybrid heterogeneous neural network accelerator that can support both SNN and ANN models. It also has good compatibility with the binary neural network (BNN) models. Our design is implemented on Xilinx XCKU115 FPGA, which can achieve the peak performance of 48.51GSOP/s (SNN) and 26.38GOP/s (ANN). The energy efficiency is 25.8GSOP/W (SNN) and 12.05GOP/W (ANN). Moreover, it supports the SNN-ANN fusion mode, where each row of processing elements (PEs) can independently perform SNN or ANN computation, allowing the design to fully leverage the respective strengths of SNN and ANN.
Bridging the Gap Between Neural Networks and Neuromorphic Hardware with A Neural Network Compiler

Yu Ji,YouHui Zhang,WenGuang Chen,Yuan Xie

DOI: https://doi.org/10.48550/arXiv.1801.00746

2017-11-15

Neural and Evolutionary Computing

Abstract:Different from developing neural networks (NNs) for general-purpose processors, the development for NN chips usually faces with some hardware-specific restrictions, such as limited precision of network signals and parameters, constrained computation scale, and limited types of non-linear functions. This paper proposes a general methodology to address the challenges. We decouple the NN applications from the target hardware by introducing a compiler that can transform an existing trained, unrestricted NN into an equivalent network that meets the given hardware's constraints. We propose multiple techniques to make the transformation adaptable to different kinds of NN chips, and reliable for restrict hardware constraints. We have built such a software tool that supports both spiking neural networks (SNNs) and traditional artificial neural networks (ANNs). We have demonstrated its effectiveness with a fabricated neuromorphic chip and a processing-in-memory (PIM) design. Tests show that the inference error caused by this solution is insignificant and the transformation time is much shorter than the retraining time. Also, we have studied the parameter-sensitivity evaluations to explore the tradeoffs between network error and resource utilization for different transformation strategies, which could provide insights for co-design optimization of neuromorphic hardware and software.
A High Energy-Efficiency Multi-core Neuromorphic Architecture for Deep SNN Training

Mingjing Li,Huihui Zhou,Xiaofeng Xu,Zhiwei Zhong,Puli Quan,Xueke Zhu,Yanyu Lin,Wenjie Lin,Hongyu Guo,Junchao Zhang,Yunhao Ma,Wei Wang,Zhengyu Ma,Guoqi Li,Xiaoxin Cui,Yonghong Tian

2024-11-26

Abstract:There is a growing necessity for edge training to adapt to dynamically changing environment. Neuromorphic computing represents a significant pathway for high-efficiency intelligent computation in energy-constrained edges, but existing neuromorphic architectures lack the ability of directly training spiking neural networks (SNNs) based on backpropagation. We develop a multi-core neuromorphic architecture with Feedforward-Propagation, Back-Propagation, and Weight-Gradient engines in each core, supporting high efficient parallel computing at both the engine and core levels. It combines various data flows and sparse computation optimization by fully leveraging the sparsity in SNN training, obtaining a high energy efficiency of 1.05TFLOPS/W@ FP16 @ 28nm, 55 ~ 85% reduction of DRAM access compared to A100 GPU in SNN trainings, and a 20-core deep SNN training and a 5-worker federated learning on FPGAs. Our study develops the first multi-core neuromorphic architecture supporting the direct SNN training, facilitating the neuromorphic computing in edge-learnable applications.

Hardware Architecture,Distributed, Parallel, and Cluster Computing,Machine Learning
Compiling Spiking Neural Networks to Neuromorphic Hardware

Shihao Song,Adarsha Balaji,Anup Das,Nagarajan Kandasamy,James Shackleford

DOI: https://doi.org/10.1145/3372799.3394364

2020-05-12

Abstract:Machine learning applications that are implemented with spike-based computation model, e.g., Spiking Neural Network (SNN), have a great potential to lower the energy consumption when they are executed on a neuromorphic hardware. However, compiling and mapping an SNN to the hardware is challenging, especially when compute and storage resources of the hardware (viz. crossbar) need to be shared among the neurons and synapses of the SNN. We propose an approach to analyze and compile SNNs on a resource-constrained neuromorphic hardware, providing guarantee on key performance metrics such as execution time and throughput. Our approach makes the following three key contributions. First, we propose a greedy technique to partition an SNN into clusters of neurons and synapses such that each cluster can fit on to the resources of a crossbar. Second, we exploit the rich semantics and expressiveness of Synchronous Dataflow Graphs (SDFGs) to represent a clustered SNN and analyze its performance using Max-Plus Algebra, considering the available compute and storage capacities, buffer sizes, and communication bandwidth. Third, we propose a self-timed execution-based fast technique to compile and admit SNN-based applications to a neuromorphic hardware at run-time, adapting dynamically to the available resources on the hardware. We evaluate our approach with standard SNN-based applications and demonstrate a significant performance improvement compared to current practices.

Distributed, Parallel, and Cluster Computing,Hardware Architecture,Neural and Evolutionary Computing
Bridge the Gap Between Neural Networks and Neuromorphic Hardware with a Neural Network Compiler

Yu Ji,YouHui Zhang,WenGuang Chen,Yuan Xie

DOI: https://doi.org/10.1109/pact.2017.59

2018-01-01

Abstract:Different from developing neural networks (NNs) for general-purpose processors, the development for NN chips usually faces with some hardware-specific restrictions, such as limited precision of network signals and parameters, constrained computation scale, and limited types of non-linear functions. This paper proposes a general methodology to address the challenges. We decouple the NN applications from the target hardware by introducing a compiler that can transform an existing trained, unrestricted NN into an equivalent network that meets the given hardware's constraints. We propose multiple techniques to make the transformation adaptable to different kinds of NN chips, and reliable for restrict hardware constraints. We have built such a software tool that supports both spiking neural networks (SNNs) and traditional artificial neural networks (ANNs). We have demonstrated its effectiveness with a fabricated neuromorphic chip and a processing-in-memory (PIM) design. Tests show that the inference error caused by this solution is insignificant and the transformation time is much shorter than the retraining time. Also, we have studied the parameter-sensitivity evaluations to explore the tradeoffs between network error and resource utilization for different transformation strategies, which could provide insights for co-design optimization of neuromorphic hardware and software.
Sparsity-Aware In-Memory Neuromorphic Computing Unit With Configurable Topology of Hybrid Spiking and Artificial Neural Network

Ying Liu,Zhiyuan Chen,Wentao Zhao,Tianhao Zhao,Tianyu Jia,Zhixuan Wang,Ru Huang,Le Ye,Yufei Ma

DOI: https://doi.org/10.1109/tcsi.2024.3377700

2024-01-01

Abstract:Spiking neural networks (SNNs) have shown great potential in achieving high energy efficiency and low power consumption compared to artificial neural networks (ANNs). However, there remains a significant accuracy gap between SNNs and ANNs. To address this issue, we present an in-memory neuromorphic computing (IMNC) chip that supports hybrid spiking/artificial neural networks (S/ANNs) and sparsity-aware data flows. With the IMNC chip, we aim to improve inference accuracy while simultaneously achieving high energy efficiency through optimization at the algorithm, architecture, and circuit levels. First, at the algorithm level, we note that SNNs extract temporal features from input spikes using time-domain convolution operations. Based on this insight, we efficiently utilize leaky integrate (LI) neurons to hybridize SNNs and ANNs, thereby improving accuracy while maintaining highly sparse operations. Second, at the architecture level, we design a sparsity-aware architecture that supports a hybrid S/ANN topology with varying sparsity. Finally, at the circuit level, we propose a ring-based in-memory computing (IMC) macro, whose energy consumption is inversely proportional to the input sparsity, making it ideal for performing energy-efficient multiplication and accumulation (MAC) operations in both SNNs and ANNs. We evaluate the proposed hybrid S/ANNs on various classification tasks and demonstrate their stronger classification and generalization ability compared with pure SNNs. Notably, our IMNC chip, fabricated using 22 nm CMOS technology, achieves impressive measured accuracy rates of over 95% for voice activity detection (VAD) and ECG anomaly detection. Additionally, our IMNC chip demonstrates superior dynamic energy efficiency of 0.43 pJ per synaptic operation, outperforming related works.

engineering, electrical & electronic
Mapping Very Large Scale Spiking Neuron Network to Neuromorphic Hardware.

Ouwen Jin,Qinghui Xing,Ying Li,Shuiguang Deng,Shuibing He,Gang Pan

DOI: https://doi.org/10.1145/3582016.3582038

2023-01-01

Abstract:Neuromorphic hardware is a multi-core computer system specifically designed to run Spiking Neuron Network (SNN) applications. As the scale of neuromorphic hardware increases, it becomes very challenging to efficiently map a large SNN to hardware. In this paper, we proposed an efficient approach to map very large scale SNN applications to neuromorphic hardware, aiming to reduce energy consumption, spike latency, and on-chip network communication congestion. The approach consists of two steps. Firstly, it solves the initial placement using the Hilbert curve, a space-filling curve with unique properties that are particularly suitable for mapping SNNs. Secondly, the Force Directed (FD) algorithm is developed to optimize the initial placement. The FD algorithm formulates the connections of clusters as tension forces, thus converts the local optimization of placement as a force analysis problem. The proposed approach is evaluated with the scale of 4 billion neurons, which is more than 200 times larger than previous research. The results show that our approach achieves state-of-the-art performance, significantly exceeding existing approaches.
A Mapping Model of SNNs to Neuromorphic Hardware

Xiuping Cui,Xiaochen Hao,Yun Liang,Guangyu Sun,Xiaoxin Cui,Yuan Wang,Ru Huang

DOI: https://doi.org/10.1109/aicas54282.2022.9869998

2022-01-01

Abstract:Spiking neural networks (SNNs) can achieve lower power consumption than traditional artificial neural networks. To take full advantage of spiking neural networks, a large number of neuromorphic hardware has emerged. However, it is nontrivial to map SNNs onto neuromorphic hardware due to the hardware constraints and complex networks-on-chip (NoCs). In this paper, we propose a mapping model to bridge the gap between SNNs and neuromorphic hardware. The mapping model is general to neuromorphic hardware with various on-chip networks. The core of the model is a loop-based representation, which can model computation and connection in software and hardware space simultaneously. We further propose transformation primitives to transform networks from software space to hardware space. We evaluate the mapping model using realistic spiking neural networks and three on-chip network topologies. Experiments show that compared to the simulated annealing algorithm without using the model, the energy consumption of computation and communication can be reduced by 19.4% and 27.4% on average, respectively.
Heterogeneous Systems with Reconfigurable Neuromorphic Computing Accelerators

Sicheng Li,Xiaoxiao Liu,Menglie Mao,Hai (Helen) Li,Yiran Chen,Boxun Li,Yu Wang

DOI: https://doi.org/10.1109/iscas.2016.7527186

2016-01-01

Abstract:Developing heterogeneous system with hardware accelerator is a promising solution to implement high performance applications where explicitly programmed, rule-based algorithms are either infeasible or inefficient. However, mapping a neural network model to a hardware representation is a complex process, where balancing computation resources and memory accesses is crucial. In this work, we present a systematic approach o optimize the heterogeneous system with a FPGA-based neuromorphic computing accelerator (NCA). For any applications, the neural network topology and computation flow of the accelerator can be configured through a NCA-aware compiler. The FPGA-based NCA contains a generic multi-layer neural network composed of a set of parallel neural processing elements. Such a scheme imitates the human cognition process and follows the hierarchy of neocortex. At architectural level, we decrease the computing resource requirement to enhance computation efficiency. The hardware implementation primarily targets at reducing data communication load: a multi-thread computation engine is utilized to mask the long memory latency. Such a combined solution can well accommodate the ever increasing complexity and scalability of machine learning applications and improve the system performance and efficiency. Through the evaluation across eight representative benchmarks, we observed on average 12.1× speedup and 45.8× energy reduction, with marginal accuracy loss comparing with CPU-only computation.
A Framework for the General Design and Computation of Hybrid Neural Networks

Rong Zhao,Zheyu Yang,Hao Zheng,Yujie Wu,Faqiang Liu,Zhenzhi Wu,Lukai Li,Feng Chen,Seng Song,Jun Zhu,Wenli Zhang,Haoyu Huang,Mingkun Xu,Kaifeng Sheng,Qianbo Yin,Jing Pei,Guoqi Li,Youhui Zhang,Mingguo Zhao,Luping Shi

DOI: https://doi.org/10.1038/s41467-022-30964-7

IF: 16.6

2022-01-01

Nature Communications

Abstract:There is a growing trend to design hybrid neural networks (HNNs) by combining spiking neural networks and artificial neural networks to leverage the strengths of both. Here, we propose a framework for general design and computation of HNNs by introducing hybrid units (HUs) as a linkage interface. The framework not only integrates key features of these computing paradigms but also decouples them to improve flexibility and efficiency. HUs are designable and learnable to promote transmission and modulation of hybrid information flows in HNNs. Through three cases, we demonstrate that the framework can facilitate hybrid model design. The hybrid sensing network implements multi-pathway sensing, achieving high tracking accuracy and energy efficiency. The hybrid modulation network implements hierarchical information abstraction, enabling meta-continual learning of multiple tasks. The hybrid reasoning network performs multimodal reasoning in an interpretable, robust and parallel manner. This study advances cross-paradigm modeling for a broad range of intelligent tasks.
A Scatter-and-Gather Spiking Convolutional Neural Network on a Reconfigurable Neuromorphic Hardware

Chenglong Zou,Xiaoxin Cui,Yisong Kuang,Kefei Liu,Yuan Wang,Xinan Wang,Ru Huang

DOI: https://doi.org/10.3389/fnins.2021.694170

IF: 4.3

2021-01-01

Frontiers in Neuroscience

Abstract:Artificial neural networks (ANNs), like convolutional neural networks (CNNs), have achieved the state-of-the-art results for many machine learning tasks. However, inference with large-scale full-precision CNNs must cause substantial energy consumption and memory occupation, which seriously hinders their deployment on mobile and embedded systems. Highly inspired from biological brain, spiking neural networks (SNNs) are emerging as new solutions because of natural superiority in brain-like learning and great energy efficiency with event-driven communication and computation. Nevertheless, training a deep SNN remains a main challenge and there is usually a big accuracy gap between ANNs and SNNs. In this paper, we introduce a hardware-friendly conversion algorithm called “scatter-and-gather” to convert quantized ANNs to lossless SNNs, where neurons are connected with ternary {−1,0,1} synaptic weights. Each spiking neuron is stateless and more like original McCulloch and Pitts model, because it fires at most one spike and need be reset at each time step. Furthermore, we develop an incremental mapping framework to demonstrate efficient network deployments on a reconfigurable neuromorphic chip. Experimental results show our spiking LeNet on MNIST and VGG-Net on CIFAR-10 datasetobtain 99.37% and 91.91% classification accuracy, respectively. Besides, the presented mapping algorithm manages network deployment on our neuromorphic chip with maximum resource efficiency and excellent flexibility. Our four-spike LeNet and VGG-Net on chip can achieve respective real-time inference speed of 0.38 ms/image, 3.24 ms/image, and an average power consumption of 0.28 mJ/image and 2.3 mJ/image at 0.9 V, 252 MHz, which is nearly two orders of magnitude more efficient than traditional GPUs.
Stabilization of Inverted Pendulum by Fractional Order PD Controller with Experimental Validation: D-decomposition Approach

P. Mandic,M. Lazarevic,T. Šekara

DOI: https://doi.org/10.1007/978-3-319-49058-8_4

2016-06-30

Abstract:
A New Hybrid Neural System Interfacing Neurons and Silicon Hardware for Fast Signal Recognition

ZH Liu,ZH Wang

DOI: https://doi.org/10.1109/ijcnn.2005.1556446

2006-01-01

Abstract:Built on the biological neural network (BNN) theories, artificial neural network (ANN) has exhibited many significant advantages as of now. But yet, the high complexity of live beings' nervous system leads to quite limited knowledge on the working principles of learning, thinking and cognition at molecular level today, i.e. in a sense, the development of ANN has to be confined by the understanding of BNN. On the other hand, the huge memory space in an ANN chip for storing all connection weights is also a serious problem. In this paper, a novel mixed neural system interfacing biological neurons and semiconductor chip on a shared silicon wafer substrate for fast signal recognition is proposed, where three blocks are designed and interconnected. Recorded simulations with a 5 /spl times/ 5 microelectrode-array covered by a 100 /spl times/ 100 BNN show that combining the individual advantages of large-scale integrated circuits and BNN, this system has faster and more intelligent capabilities for fuzzy control, speech or pattern recognition as compared with common ways. At the same time, it can resolve the problems of huge memory space in ANN chips and the high complexity for algorithms, with an average 90.3% degree reduced efficiently between 5 trials.
A heterogeneous computing system with memristor-based neuromorphic accelerators

Xiaoxiao Liu,Mengjie Mao,Hai Li,Yiran Chen,Hao Jiang,J. Joshua Yang,Qing Wu,Mark Barnell

DOI: https://doi.org/10.1109/HPEC.2014.7040986

2014-01-01

Abstract:As technology scales, on-chip heterogeneous architecture emerges as a promising solution to combat the power wall of microprocessors. In this work, we propose a heterogeneous computing system with memristor-based neuromorphic computing accelerators (NCAs). In the proposed system, NCA is designed to speed up the artificial neural network (ANN) executions in many high-performance applications by leveraging the extremely efficient mixed-signal computation capability of nanoscale memristor-based crossbar (MBC) arrays. The hierarchical MBC arrays of the NCA can be flexibly configured to different ANN topologies through the help of an analog Network-on-Chip (A-NoC). A general approach which translates the target codes within a program to the corresponding NCA instructions is also developed to facilitate the utilization of the NCA. Our simulation results show that compared to the baseline general purpose processor, the proposed system can achieve on average 18.2X performance speedup and 20.1X energy reduction over nine representative applications. The computation accuracy degradation is constrained within an acceptable range (e.g., 11%), by considering the limited data precision, realistic device variations and analog signal fluctuations.
An End-to-End SoC for Brain-Inspired CNN-SNN Hybrid Applications

Zhaotong Zhang,Yi Zhong,Yingying Cui,Yawei Ding,Yukun Xue,Qibin Lie,Ruining Yang,Jian Cao,Yuan Wang

DOI: https://doi.org/10.1109/iscas58744.2024.10558308

2024-01-01

Abstract:Inspired by the brain, Spiking Neural Network (SNN) applies temporally sparse spiking communication to gain more bio-mimetic and highly energy efficient computing. The current mainstream platforms for SNN applications are typically the combination of Host+FPGA+Chip Array, which requires an efficient host to preprocess and encode data. It’s not suitable for end-to-end tasks in edge due to its high system power consumption of host and non-negligible high latency of protocol conversion on FPGA. In addition, Convolutional Neural Network (CNN), exhibits strong feature extraction capabilities. Like the brain's visual system, a hierarchical CNN-SNN hybrid network, in which SNN can make use of CNN’s feature extraction capabilities during encoding, can achieve better performance. In this study, we design a 64Neural-Core Array and integrate it with a CNN encoder and a low-power RISC-V CPU within a System-on-Chip (SoC) to enable comprehensive end-to-end hybrid network application support. The proposed heterogeneous SoC is implemented on a Virtex UltraScale+ XCVU9P FPGA, featuring 32.8K neurons, 37.7M synapses and 578GOPS/s peak performance. It processes MNIST classification with a peak throughput of 2022 images per second at frequency of 250MHz. This design gains a balance between high throughput and recognition accuracy simultaneously.

What are guanosine triphosphate-binding proteins doing in mitochondria?

Towards Efficient Deployment of Hybrid SNNs on Neuromorphic and Edge AI Hardware

Multi-grained system integration for hybrid-paradigm brain-inspired computing

NEUTRAMS: Neural Network Transformation and Co-Design under Neuromorphic Hardware Constraints

Tianjic: A Unified and Scalable Chip Bridging Spike-Based and Continuous Neural Computation

Modular Building Blocks for Mapping Spiking Neural Networks Onto a Programmable Neuromorphic Processor

A Hybrid Heterogeneous Neural Network Accelerator Based on Systolic Array

Bridging the Gap Between Neural Networks and Neuromorphic Hardware with A Neural Network Compiler

A High Energy-Efficiency Multi-core Neuromorphic Architecture for Deep SNN Training

Compiling Spiking Neural Networks to Neuromorphic Hardware

Bridge the Gap Between Neural Networks and Neuromorphic Hardware with a Neural Network Compiler

Sparsity-Aware In-Memory Neuromorphic Computing Unit With Configurable Topology of Hybrid Spiking and Artificial Neural Network

Mapping Very Large Scale Spiking Neuron Network to Neuromorphic Hardware.

A Mapping Model of SNNs to Neuromorphic Hardware

Heterogeneous Systems with Reconfigurable Neuromorphic Computing Accelerators

A Framework for the General Design and Computation of Hybrid Neural Networks

A Scatter-and-Gather Spiking Convolutional Neural Network on a Reconfigurable Neuromorphic Hardware

Stabilization of Inverted Pendulum by Fractional Order PD Controller with Experimental Validation: D-decomposition Approach

A New Hybrid Neural System Interfacing Neurons and Silicon Hardware for Fast Signal Recognition

A heterogeneous computing system with memristor-based neuromorphic accelerators

An End-to-End SoC for Brain-Inspired CNN-SNN Hybrid Applications