Multi-task deep learning approach for sound event recognition and tracking

Tzung-Shi Chen,Ming-Ju Chen,Tzung-Cheng Chen
DOI: https://doi.org/10.1504/ijahuc.2024.138747
2024-05-30
International Journal of Ad Hoc and Ubiquitous Computing
Abstract:In smart cities, it is important to detect abnormal activities through cameras. However, cameras have limitations such as blind spots and blocked areas that can result in detection failures. Sound, on the other hand, is less likely to be obstructed. This paper proposes using microphone arrays to identify sound events, predict their locations, and track their trajectories using multi-task deep learning approaches. Experimental results show high predictive accuracy. Finally, the proposed models are also converted to quantised versions and deployed on embedded devices in vehicles to analyse memory footprint and execution time.
computer science, information systems,telecommunications
What problem does this paper attempt to address?