Abstract:This paper proposes a high power-performance-area efficient background noise aware keyword-spotting (KWS) processor based on an optimized binarized weight network (BWN). To reduce the power consumption while maintaining the system recognition accuracy for different background noise, the KWS processor with a SNR prediction module can be adaptively configured to use dual computing modes (standard computing mode and approximate computing mode) for both high recognition accuracy under high background noise and ultra-low power consumption under low background noise. The mel-scale frequency cepstral coefficients (MFCC) module is optimized with approximate computing technologies, which can reduce the power consumption by up to <inline-formula> <tex-math notation="LaTeX">$3.1\times $ </tex-math></inline-formula> and <inline-formula> <tex-math notation="LaTeX">$5.7\times $ </tex-math></inline-formula> for high/low background noise, respectively. Based on the evaluation of the architecture design space exploration, an ultra-low power BWN accelerator with low voltage, area and leakage power and using precision self-adaptive approximate computing units was proposed. Evaluated under 22nm process technology, this work can support up to 10 keywords real time recognition with power consumption of <inline-formula> <tex-math notation="LaTeX">$15.1~\mu \text{W}$ </tex-math></inline-formula> for high background noise and <inline-formula> <tex-math notation="LaTeX">$10.8~\mu \text{W}$ </tex-math></inline-formula> for low background noise. Compared to the state-of-the-art KWS architectures, our work can achieve ultra-low power consumption (about <inline-formula> <tex-math notation="LaTeX">$1.7\times $ </tex-math></inline-formula> reduced), while maintaining high system capability and adaptability.

AAD-KWS: a Sub- $\mu\mathrm{w}$ Keyword Spotting Chip with a Zero-Cost, Acoustic Activity Detector from a 170nw MFCC Feature Extractor in 28nm CMOS

AAD-KWS: A Sub-μ W Keyword Spotting Chip with an Acoustic Activity Detector Embedded in MFCC and a Tunable Detection Window in 28-Nm CMOS

A 510-nW Wake-Up Keyword-Spotting Chip Using Serial-FFT-Based MFCC and Binarized Depthwise Separable CNN in 28-nm CMOS

14.1 A 510nw 0.41V Low-Memory Low-Computation Keyword-Spotting Chip Using Serial FFT-Based MFCC and Binarized Depthwise Separable Convolutional Neural Network in 28nm CMOS

A 0.61-Μw Fully Integrated Keyword-Spotting ASIC with Real-Point Serial FFT-Based MFCC and Temporal Depthwise Separable CNN

A $2.81\mu \mathrm{w}$, Energy Efficient MFCC Feature Extractor for Keyword-Spotting in 65nm CMOS

A 608nW Near-Microphone Keyword-Spotting Chip Using Real-Point Serial FFT-Based MFCC and Temporal Depthwise Separable CNN in 28nm CMOS

A $2.81\mu \mathrm{W}$, Energy Efficient MFCC Feature Extractor for Keyword-Spotting in 65nm CMOS.

A 110nw Always-on Keyword Spotting Chip Using Spiking CNN in 40nm CMOS.

A Background-Noise and Process-Variation-Tolerant 109nW Acoustic Feature Extractor Based on Spike-Domain Divisive-Energy Normalization for an Always-On Keyword Spotting Device

An Ultra-low Power Keyword-Spotting Accelerator Using Circuit-Architecture-System Co-design and Self-adaptive Approximate Computing Based BWN

A 22nm, 10.8 μ W/15.1 μ W Dual Computing Modes High Power-Performance-Area Efficiency Domained Background Noise Aware Keyword- Spotting Processor

9.1 μW keyword spotting processor based on optimized MFCC and small‐footprint TENet in 28‐nm CMOS

Sub-mW Keyword Spotting on an MCU: Analog Binary Feature Extraction and Binary Neural Networks

NS-KWS: joint optimization of near-sensor processing architecture and low-precision GRU for always-on keyword spotting

A 22nm, 10.8 <italic>μ</italic> W/15.1 <italic>μ</italic> W Dual Computing Modes High Power-Performance-Area Efficiency Domained Background Noise Aware Keyword- Spotting Processor

NS-FDN: Near-Sensor Processing Architecture of Feature-Configurable Distributed Network for Beyond-Real-Time Always-on Keyword Spotting

Design of an Ultra-Low Power MFCC Feature Extraction Circuit with Embedded Speech Activity Detector

An Ultra-Low Power Always-On Keyword Spotting Accelerator Using Quantized Convolutional Neural Network and Voltage-Domain Analog Switching Network-Based Approximate Computing

A Low-Power Keyword Spotting System with High-Order Passive Switched-Capacitor Bandpass Filters for Analog-MFCC Feature Extraction

VoAD: A Sub-μW Multiscene Voice Activity Detector Deploying Analog-Frontend Digital-Backend Circuits