Abstract:Address event representation (AER) sensors, recording frameless event data, have recently attracted more attention due to the advantages of sparsed spatiotemporal representation. Spiking neural network (SNN) is a representative biologically plausible model, which is inherently suitable for event-driven AER data processing. However, AER object classification using SNN is still challenging due to the lack of robustness for free moving object recognition. This paper proposes a novel event-driven hierarchical recognition model using an activated connected domain (ACD) location method and an SNN classifier with fusion mechanism. The proposed model extracts bio-inspired cortex-like features by Gabor filters with multiple orientations and scales. Meanwhile, the ACD mechanism coordinates with feature extraction to obtain stable features under random movement of objects. Finally, the features are discriminated by the Tempotron classifier with feature fusion to reduce computing consumption while maintaining comparable performance. Comprehensive experiments conducted on several AER datasets have shown superior performance of the proposed system. Besides, we extend the MNIST-DVS dataset to simulate random moving objects by adding a random continuous spatial offset to the event streams of its samples. Ablation experiments demonstrate that the ACD mechanism enriches the model recognizing capability for free moving objects, especially for training samples only with fixed trajectory movement, which reflects the applicability and robustness. This model equips a high potential for innovation and development in the recognition of moving objects in natural scenes.

Unsupervised Temporal Feature Learning Based On Sparse Coding Embedded Boaw For Acoustic Event Recognition

Audio Sentiment Analysis by Heterogeneous Signal Features Learned from Utterance-Based Parallel Neural Network.

Temporal Coding of Local Spectrogram Features for Robust Sound Recognition

Subspace Pooling Based Temporal Features Extraction for Audio Event Recognition

Semantic Feature Extraction Based on Subspace Learning with Temporal Constraints for Acoustic Event Recognition

An Event-based Feature Representation Method for Event Stream Classification Using Deep Spiking Neural Networks

Using Deep Belief Network to Capture Temporal Information for Audio Event Classification.

Task-driven Common Subspace Learning Based Semantic Feature Extraction for Acoustic Event Recognition

Effective AER Object Classification Using Segmented Probability-Maximization Learning in Spiking Neural Networks

A Novel Codebook Representation Method and Encoding Strategy for Bag-of-words Based Acoustic Event Classification.

Real-world acoustic event detection

Spike-based Encoding and Learning of Spectrum Features for Robust Sound Recognition.

Multi-dimensional Edge-based Audio Event Relational Graph Representation Learning for Acoustic Scene Classification

An Event-Driven Object Recognition Model Using Activated Connected Domain Detection

Sparse coding for sound event classification

Audio Event-Relational Graph Representation Learning for Acoustic Scene Classification

Pyramidal Temporal Pooling with Discriminative Mapping for Audio Classification

Direction-of-Arrival Estimation Method Based on Neural Network with Temporal Structure for Underwater Acoustic Vector Sensor Array

Balanced Deep CCA for Bird Vocalization Detection

A Joint Framework for Audio Tagging and Weakly Supervised Acoustic Event Detection Using DenseNet with Global Average Pooling.

Transferring Voice Knowledge for Acoustic Event Detection: An Empirical Study