Binaural Cues Estimates Based on Interaural Matching Filter for Sound Source Localization.

Ling Chen,Jie Zhang,Guodong Chen,Meng Zhang,Hong Liu
DOI: https://doi.org/10.1109/robio.2015.7418879
2015-01-01
Abstract:Robot audition is a natural and convenient way for robot to interact with outside. Binaural sound source localization for robot audition plays an important role in human-robot interaction, speech capturing, enhancement and communication, etc. For the issue, interaural time difference (ITD) and interaural level difference (ILD) are two essential binaural (interaural) cues, yet they are still difficult to extract in complex environments. In this context, this paper proposes a new scenario to simultaneously estimate the ITD and ILD based on Interaural Matching Filter (IMF). The IMF is an optimal filter in view of eliminating the disparity of binaural signals so that it implies some information of binaural cues. Firstly, the IMF is decomposed into a minimum phase component and an all-pass component by homomorphic filtering. Subsequently, ITD is evaluated from the phase response of all-pass component, and ILD is yielded by the amplitude response of minimum phase component. Experiments verify the effectiveness of our method to preserve binaural cues, and it is robust for sound localization.
What problem does this paper attempt to address?