Abstract:In recent years, drones have brought about numerous conveniences in our work and daily lives due to their advantages of low cost and ease of use. However, they have also introduced significant hidden threats to public safety and personal privacy. Effectively and promptly detecting drone is thus a crucial task to ensure public safety and protect individual privacy. This paper proposes a method that combines beamforming algorithm with Deep Learning neural network to achieve the detection of drone acoustic event using microphone array technology. The aim is to achieve maximum coverage and accuracy in drone detection. The proposed approach utilizes beamforming algorithm to perform directional audio capture of the drone sound signal acquired by the microphone array. It then extracts features such as Log-Mel spectrogram and Mel-Frequency Cepstral Coefficients from the audio signal, which are subsequently input to a Convolutional Neural Network for classification. The final detection result is obtained through this process. The study also incorporates experimental analysis to assess the impact of different frontend processing algorithms, dataset compositions and feature selections on the detection performance. To provide a more specific and pronounced indication of the accomplishment of the drone sound event detection task, a novel evaluation criterion is introduced, termed as the Machine- Human Ultimate Distance Ratio. This criterion is employed to assess the detection effectiveness of the drone sound event detection task. The results demonstrate that the detection range and accuracy of the drone sound event detection system based on Deep Learning and microphone array surpass those of single-microphone sound event detection method. The proposed detection approach achieves effective detection within a range of up to 135 m in the surrounding environment.

Multi-task deep learning approach for sound event recognition and tracking

Implementation of Abnormal Sound Detection in Intelligent Surveillance Front-end System

Robust Audio Sensing with Multi-Sound Classification.

Training environmental sound classification models for real-world deployment in edge devices

A hybrid parametric-deep learning approach for sound event localization and detection

Deep Convolutional Neural Network for Roadway Incident Surveillance Using Audio Data

Real-Time Vehicle Sound Detection System Based on Depthwise Separable Convolution Neural Network and Spectrogram Augmentation

Robust sound event classification using deep neural networks

Active Object Discovery and Localization Using Sound-Induced Attention

Mobile Microphone Array Speech Detection and Localization in Diverse Everyday Environments

Audio Enhancement and Intelligent Classification of Household Sound Events Using a Sparsely Deployed Array

A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds

Auto++: Detecting Cars Using Embedded Microphones in Real-Time.

Deep Learning Applied to Dereverberation and Sound Event Classification in Reverberant Environments

Multi-label Sound Event Retrieval Using a Deep Learning-based Siamese Structure with a Pairwise Presence Matrix

Speech Activity Detection and Speaker Localization Based on Distributed Microphones.

Multi-microphone fusion for detection of speech and acoustic events in smart spaces

An Automatic Classification System for Environmental Sound in Smart Cities

Deep Learning-based drone acoustic event detection system for microphone arrays

Sound-Based Construction Activity Monitoring with Deep Learning

Multi-mode Study of Deep Learning Applications in Acoustic Signal Processing