Model-based distributed node clustering and multi-speaker speech presence probability estimation in wireless acoustic sensor networks

Yingke Zhao,Jesper Kjaer Nielsen,Jingdong Chen,Mads Graesboll Christensen
DOI: https://doi.org/10.1121/10.0001449
2020-01-01
Abstract:The knowledge of speech presence probability (SPP) plays an essential role in noise estimation and speech enhancement. Single channel SPP estimation and centralized multi-channel SPP estimation have been well studied. However, how to estimate SPP in wireless acoustic sensor networks (WASNs) remains a great challenge and few efforts can be found in this topic, particularly for WASN applications with multiple speakers. Accordingly, this paper is devoted to the problem of SPP estimation in WASNs and it presents a distributed model-based SPP estimation method for multi-speaker detection, which does not need any fusion center. A distributedk-means clustering method is first used to cluster the nodes into subnetworks, which detect different speakers. For each node in the subnetwork, the speech and noise power spectral densities are estimated locally by using a model-based method, then a distributed SPP estimator is developed and applied in every subnetwork. A distributed consensus method is used to obtain the distributed clustering and the distributed SPP estimation. Simulation results show that the proposed distributed clustering method can assign nodes into subnetworks based on their noisy observations. Moreover, the proposed distributed SPP estimator achieves robust speech detection performance under different noise conditions.
What problem does this paper attempt to address?