Abstract:Researchers are exploring novel computational paradigms such as sparse coding and neuromorphic computing to bridge the efficiency gap between the human brain and conventional computers in complex tasks. A key area of focus is neuromorphic audio processing. While the Locally Competitive Algorithm has emerged as a promising solution for sparse coding, offering potential for real-time and low-power processing on neuromorphic hardware, its applications in neuromorphic speech classification have not been thoroughly studied. The Adaptive Locally Competitive Algorithm builds upon the Locally Competitive Algorithm by dynamically adjusting the modulation parameters of the filter bank to fine-tune the filters' sensitivity. This adaptability enhances lateral inhibition, improving reconstruction quality, sparsity, and convergence time, which is crucial for real-time applications. This paper demonstrates the potential of the Locally Competitive Algorithm and its adaptive variant as robust feature extractors for neuromorphic speech classification. Results show that the Locally Competitive Algorithm achieves better speech classification accuracy at the expense of higher power consumption compared to the LAUSCHER cochlea model used for benchmarking. On the other hand, the Adaptive Locally Competitive Algorithm mitigates this power consumption issue without compromising the accuracy. The dynamic power consumption is reduced to a range of 4 to 13 milliwatts on neuromorphic hardware, three orders of magnitude less than setups using Graphics Processing Units. These findings position the Adaptive Locally Competitive Algorithm as a compelling solution for efficient speech classification systems, promising substantial advancements in balancing speech classification accuracy and power efficiency.

EfficientLEAF: A Faster LEarnable Audio Frontend of Questionable Use

LEAF: A Learnable Frontend for Audio Classification

What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions

Learnable Frontends that do not Learn: Quantifying Sensitivity to Filterbank Initialisation

Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks

FastAudio: A Learnable Audio Front-End for Spoof Speech Detection

Biomimetic Frontend for Differentiable Audio Processing

Efficient Sparse Coding with the Adaptive Locally Competitive Algorithm for Speech Classification

Fitting Auditory Filterbanks with Multiresolution Neural Networks

Learning neural audio features without supervision

How to Optimize the Gain Filter of LD-CELP

Low-Complexity Audio Embedding Extractors

Learnable Acoustic Frontends in Bird Activity Detection

Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance

Studying Performance For The Gain Filters Of 8kbit/S Ld-Acelp

DeepSpectrumLite: A Power-Efficient Transfer Learning Framework for Embedded Speech and Audio Processing From Decentralized Data

Audio Enhancement for Computer Audition—An Iterative Training Paradigm Using Sample Importance

Deep Feature Learning for Medical Acoustics

Audio-Visual Efficient Conformer for Robust Speech Recognition

Optimizing Audio Augmentations for Contrastive Learning of Health-Related Acoustic Signals

AudioRepInceptionNeXt: A lightweight single-stream architecture for efficient audio recognition