Sound Field Synthesis with Acoustic Waves

Mohamed F. Mansour
2024-07-13
Abstract:We propose a practical framework to synthesize the broadband sound-field on a small rigid surface based on the physics of sound propagation. The sound-field is generated as a composite map of two components: the room component and the device component, with acoustic plane waves as the core tool for the generation. This decoupling of room and device components significantly reduces the problem complexity and provides accurate rendering of the sound-field. We describe in detail the theoretical foundations, and efficient procedures of the implementation. The effectiveness of the proposed framework is established through rigorous validation under different environment setups.
Sound,Audio and Speech Processing
What problem does this paper attempt to address?
The problems that this paper attempts to solve are as follows: Synthesize a broadband sound field at the microphone array in the room, generate high - quality synthetic audio data in a low - cost and efficient manner, thereby replacing actual data collection. Specifically, this method aims to solve the following problems: 1. **Limitations of traditional methods**: - Although the actual measurement method is accurate, it is costly, time - consuming, and error - prone. - The brute - force simulation method requires solving complex sound wave equations, and for broadband spectra and tests of different device form factors, the calculation time is very long. - The image source method has a poor effect in the low - frequency band and cannot model small surfaces (such as furniture) and rough surfaces (such as curtains). 2. **The proposed new framework**: - By decomposing the sound field into two independent components - the room component and the device component, the complexity of the problem is significantly reduced, and more accurate sound field rendering is provided. - The room component captures the acoustic characteristics in the room through a large - scale microphone array and represents them as the superposition of multiple plane waves. - The device component calculates the influence (fingerprint) of each plane wave on the device surface through simulation or anechoic chamber measurement and combines it into the room component. 3. **Specific objectives**: - Provide an efficient sound field synthesis framework that can reproduce the real sound field under different room and device configurations. - The generated synthetic data can be used to evaluate the performance indicators of voice/audio communication devices, such as the False Rejection Rate (FRR), Word Error Rate (WER), etc. - Reduce the need for actual hardware devices, and only the CAD model of the device is required for performance evaluation in the early design stage. In summary, the main purpose of this paper is to develop an efficient, low - cost, and accurate sound field synthesis method to replace traditional actual data collection and brute - force simulation methods, thereby better supporting the research, development, and testing of voice and audio devices. ### Formula summary - **Plane - wave sound pressure formula**: \[ \psi(\omega, \theta, \phi, \mathbf{r}) = p_0(\omega) e^{-j \mathbf{k}^T \mathbf{r}} \] where \( p_0(\omega) \) is a frequency - dependent scaling factor, and \(\mathbf{k}(\theta, \phi)\) is the wave vector. - **Total sound field formula**: \[ p(\omega, \mathbf{r}) = \sum_{l \in \Lambda} \alpha_l(\omega) \psi(\omega, \theta_l, \phi_l, \mathbf{r}) \] - **Optimization objective function**: \[ J = \int_\omega \left\| y(\omega) - \sum_{l \in \Lambda} \alpha_l(\omega) \bar{\beta}_l(\omega) \right\|^2 + \lambda \sum_{l \in \Lambda} |\alpha_l(\omega)| \] - **Multi - channel transfer function calculation**: \[ \hat{h}(\omega) = \frac{S_{xy}(\omega)}{S_{xx}(\omega)} \] where \[ S_{xx}(\omega) = E\{x^*(t, \omega) x(t, \omega)\} \] \[ S_{xy}(\omega) = E\{x^*(t, \omega) y(t, \omega)\} \] These formulas show how to synthesize the sound field through plane - wave decomposition and optimization methods and calculate the room impulse response (RIR) through system identification techniques.