Abstract:The capability for environmental sound recognition (ESR) can determine the fitness of individuals in a way to avoid dangers or pursue opportunities when critical sound events occur. It still remains mysterious about the fundamental principles of biological systems that result in such a remarkable ability. Additionally, the practical importance of ESR has attracted an increasing amount of research attention, but the chaotic and nonstationary difficulties continue to make it a challenging task. In this article, we propose a spike-based framework from a more brain-like perspective for the ESR task. Our framework is a unifying system with consistent integration of three major functional parts which are sparse encoding, efficient learning, and robust readout. We first introduce a simple sparse encoding, where key points are used for feature representation, and demonstrate its generalization to both spike- and nonspike-based systems. Then, we evaluate the learning properties of different learning rules in detail with our contributions being added for improvements. Our results highlight the advantages of multispike learning, providing a selection reference for various spike-based developments. Finally, we combine the multispike readout with the other parts to form a system for ESR. Experimental results show that our framework performs the best as compared to other baseline approaches. In addition, we show that our spike-based framework has several advantageous characteristics including early decision making, small dataset acquiring, and ongoing dynamic processing. Our framework is the first attempt to apply the multispike characteristic of nervous neurons to ESR. The outstanding performance of our approach would potentially contribute to draw more research efforts to push the boundaries of spike-based paradigm to a new horizon.

Spike-based Encoding and Learning of Spectrum Features for Robust Sound Recognition.

Temporal Coding of Local Spectrogram Features for Robust Sound Recognition

A Spiking Neural Network Model for Sound Recognition.

A Multi-Spike Approach For Robust Sound Recognition

A Spike-Timing Based Integrated Model for Pattern Recognition

Odor Recognition with a Spiking Neural Network for Bioelectronic Nose

A Spiking Neural Network System for Robust Sequence Recognition

Sparse Temporal Encoding of Visual Features for Robust Object Recognition by Spiking Neurons

Robust Environmental Sound Recognition with Sparse Key-point Encoding and Efficient Multi-spike Learning.

Bipolar Population Threshold Encoding for Audio Recognition with Deep Spiking Neural Networks

Fast and Accurate Classification with a Multi-Spike Learning Algorithm for Spiking Neurons.

A Supervised Multi-Spike Learning Algorithm for Spiking Neural Networks

Toward Efficient Processing and Learning with Spikes: New Approaches for Multispike Learning

Robust Transcoding Sensory Information with Neural Spikes

Learning Real-World Stimuli By Single-Spike Coding And Tempotron Rule

A Brain-Inspired Spiking Neural Network Model with Temporal Encoding and Learning

Spike Attention Coding for Spiking Neural Networks.

Pattern Recognition Computation in a Spiking Neural Network with Temporal Encoding and Learning

An Event-based Feature Representation Method for Event Stream Classification Using Deep Spiking Neural Networks

An Event-Driven Computational System With Spiking Neurons For Object Recognition