Abstract:Researchers are exploring novel computational paradigms such as sparse coding and neuromorphic computing to bridge the efficiency gap between the human brain and conventional computers in complex tasks. A key area of focus is neuromorphic audio processing. While the Locally Competitive Algorithm has emerged as a promising solution for sparse coding, offering potential for real-time and low-power processing on neuromorphic hardware, its applications in neuromorphic speech classification have not been thoroughly studied. The Adaptive Locally Competitive Algorithm builds upon the Locally Competitive Algorithm by dynamically adjusting the modulation parameters of the filter bank to fine-tune the filters' sensitivity. This adaptability enhances lateral inhibition, improving reconstruction quality, sparsity, and convergence time, which is crucial for real-time applications. This paper demonstrates the potential of the Locally Competitive Algorithm and its adaptive variant as robust feature extractors for neuromorphic speech classification. Results show that the Locally Competitive Algorithm achieves better speech classification accuracy at the expense of higher power consumption compared to the LAUSCHER cochlea model used for benchmarking. On the other hand, the Adaptive Locally Competitive Algorithm mitigates this power consumption issue without compromising the accuracy. The dynamic power consumption is reduced to a range of 4 to 13 milliwatts on neuromorphic hardware, three orders of magnitude less than setups using Graphics Processing Units. These findings position the Adaptive Locally Competitive Algorithm as a compelling solution for efficient speech classification systems, promising substantial advancements in balancing speech classification accuracy and power efficiency.

Optimization and evaluation of energy-efficient mixed-signal MFCC feature extraction architecture

Energy-efficient MFCC Extraction Architecture in Mixed-Signal Domain for Automatic Speech Recognition

MSP-MFCC: Energy-Efficient MFCC Feature Extraction Method With Mixed-Signal Processing Architecture for Wearable Speech Recognition Applications

Design and Implementation of End-Point Detection Accelerator for Speech Recognition

Energy Efficiency Optimization for Beamspace Massive MIMO Systems with Low-Resolution ADCs.

An Energy-Efficient Binarized Neural Network Using Analog-Intensive Feature Extraction for Keyword and Speaker Verification Wakeup.

Performance Optimization of Energy Efficient Semantic Communications over Wireless Networks

NS-FDN: Near-Sensor Processing Architecture of Feature-Configurable Distributed Network for Beyond-Real-Time Always-on Keyword Spotting

Nanowatt Acoustic Inference Sensing Exploiting Nonlinear Analog Feature Extraction

An Ultra-Low Power Binarized Convolutional Neural Network-Based Speech Recognition Processor with On-Chip Self-Learning.

Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition

A 11.6μ W Computing-on-Memory-Boundary Keyword Spotting Processor with Joint MFCC-CNN Ternary Quantization

More is Less: Domain-Specific Speech Recognition Microprocessor Using One-Dimensional Convolutional Recurrent Neural Network

MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology

Dual-Stage Low-Complexity Reconfigurable Speech Enhancement

A Fully Integrated 1.7mw Attention-Based Automatic Speech Recognition Processor

A Low-Power Keyword Spotting System with High-Order Passive Switched-Capacitor Bandpass Filters for Analog-MFCC Feature Extraction

Efficient Sparse Coding with the Adaptive Locally Competitive Algorithm for Speech Classification

Real-Time Implementation of an Efficient Speech Enhancement Algorithm for Digital Hearing Aids

Sub-mW Keyword Spotting on an MCU: Analog Binary Feature Extraction and Binary Neural Networks

Improved speech recognition algorithm based on MFCC feature