Predicting reflection patterns from binaural activity maps using deep neural networks

Jeramey Tyler,Mei Si,Jonas Braasch
DOI: https://doi.org/10.1121/10.0011124
2022-04-01
The Journal of the Acoustical Society of America
Abstract:A new model architecture is presented to predict room acoustical parameters from a running binaural signal. For this purpose, a deep neural network architecture is combined with a precedence effect model to extract the spatial and temporal locations of the direct signal and early reflections. The precedence effect model builds on the modified BICAM algorithm [Braasch, J. Acoust. Soc. Am. 140, EL143], for which the 1st layer auto-/cross correlation functions are replaced with a Cepstrum method. The latter allows a better separation of features relating to the source signal's early reflections and harmonic structure. The precedence effect model is used to create binaural activity maps that are analyzed by the neural network for pattern recognition. Anechoic orchestral recordings were reverberated by adding four early reflections and late reverberation to test the model. Head-related transfer functions were used to spatialize direct sound and early reflections. The model can identify the main reflection characteristics of a room, offering applications in numerous fields, including room acoustical assessment, acoustical analysis for virtual-reality applications, and modeling of human perception. [Work supported by the National Science Foundation under Grant No. IIS-1909229.]
acoustics,audiology & speech-language pathology
What problem does this paper attempt to address?