Abstract:As the Internet of Things (IoTs) increasingly combines AI technology, it is a trend to deploy neural network algorithms at edges and make IoT devices more intelligent than ever. Moreover, energy-harvesting technology-based IoT devices have shown the advantages of green and low-carbon economy, convenient maintenance, and theoretically infinite lifetime, etc. However, the harvested energy is often unstable, resulting in low performance due to the fact that a fixed load cannot sufficiently utilize the harvested energy. To address this problem, recent works focusing on ReRAM-based convolutional neural networks (CNN) accelerators under harvested energy have proposed hardware/software optimizations. However, those works have overlooked the mismatch between the power requirement of different CNN layers and the variation of harvested power. Motivated by the above observation, this paper proposes a novel strategy, called REC , that retimes convolutional layers of CNN inferences to improve the performance and energy efficiency of energy harvesting ReRAM-based accelerators. Specifically, at the offline stage, REC defines different power levels to fit the power requirements of different convolutional layers. At runtime, instead of sequentially executing the convolutional layers of an inference one by one, REC retimes the execution timeframe of different convolutional layers so as to accommodate different CNN layers to the changing power inputs. What is more, REC provides a parallel strategy to fully utilize very high power inputs. Moreover, a case study is presented to show that REC is effective to improve the real-time accomplishment of periodical critical inferences because REC provides an opportunity for critical inferences to preempt the process window with a high power supply. Our experimental results show that the proposed REC scheme achieves an average performance improvement of 6.1 × (up to 16.5 ×) compared to the traditional strategy without the REC idea. The case study results show that the REC scheme can significantly improve the success rate of periodical critical inferences’ real-time accomplishment.

Low power driven loop tiling for RRAM crossbar-based CNN.

A Peripheral Circuit Reuse Structure Integrated with a Retimed Data Flow for Low Power RRAM Crossbar-Based CNN.

DaDianNao: A Machine-Learning Supercomputer

Switched by input: power efficient structure for RRAM-based convolutional neural network.

Low Bit-Width Convolutional Neural Network on RRAM

Training Low Bitwidth Convolutional Neural Network on RRAM

Binary Convolutional Neural Network on RRAM.

Low power Convolutional Neural Networks on a chip

Mixed Size Crossbar Based RRAM CNN Accelerator with Overlapped Mapping Method

High Area/Energy Efficiency RRAM CNN Accelerator with Pattern-Pruning-Based Weight Mapping Scheme

RRAM Based Buffer Design for Energy Efficient CNN Accelerator.

A Configurable Multi-Precision CNN Computing Framework Based on Single Bit RRAM

Straightforward Data Transfer in a Blockwise Dataflow for an Analog RRAM-based CIM System

Efficient Implementation of Multi-Channel Convolution in Monolithic 3D ReRAM Crossbar

A RRAM Based Max-Pooling Scheme for Convolutional Neural Network

A Low-Power Hardware Architecture for Real-Time CNN Computing

AEPE: an Area and Power Efficient RRAM Crossbar-Based Accelerator for Deep CNNs

An Energy-Efficient Mixed-Bit CNN Accelerator With Column Parallel Readout for ReRAM-Based In-Memory Computing

Reducing SRAM Reading Power with Column Data Segment and Weights Correlation Enhancement for CNN Processing.

Design And Optimization Of Fefet-Based Crossbars For Binary Convolution Neural Networks

REC: REtime Convolutional layers to fully exploit harvested energy for ReRAM-based CNN accelerators