A Time-domain Unsupervised Learning Based Sound Source Localization Method

Yankun Huang,Xihong Wu,Tianshu Qu
DOI: https://doi.org/10.1109/icicsp50920.2020.9232117
2020-01-01
Abstract:In recent years, deep neural networks have been applied in many fields. In this paper, a time-domain unsupervised learning based sound source localization method is proposed, where auto-encoder neural networks are adopted so that some operation like time-delay compensation can be removed and there is no need to prepare training data with precise alignment labels. In order to improve its performance, a training strategy based on the multi-task learning and acoustic transfer function is proposed as well, called joint training of alternating and splitting. Experiments show that the proposed method can learn the transmission characteristics, including the change of time delay and intensity. What's more, the proposed method also has better performance compared with SRP-PHAT, MUSIC and two other neural networks based methods.
What problem does this paper attempt to address?