Physics-Informed Machine Learning For Sound Field Estimation

Shoichi Koyama,Juliano G. C. Ribeiro,Tomohiko Nakamura,Natsuki Ueno,Mirco Pezzoli
2024-08-27
Abstract:The area of study concerning the estimation of spatial sound, i.e., the distribution of a physical quantity of sound such as acoustic pressure, is called sound field estimation, which is the basis for various applied technologies related to spatial audio processing. The sound field estimation problem is formulated as a function interpolation problem in machine learning in a simplified scenario. However, high estimation performance cannot be expected by simply applying general interpolation techniques that rely only on data. The physical properties of sound fields are useful a priori information, and it is considered extremely important to incorporate them into the estimation. In this article, we introduce the fundamentals of physics-informed machine learning (PIML) for sound field estimation and overview current PIML-based sound field estimation methods.
Sound,Audio and Speech Processing
What problem does this paper attempt to address?
The paper is primarily dedicated to addressing the problem of sound field estimation, which involves reconstructing a spatial sound field from a set of discrete microphone measurements. Specifically, sound field estimation techniques are crucial in various applications, including room acoustics analysis, sound field visualization/auditoryization, spatial audio reproduction using speaker arrays or headphones, and active noise cancellation in specific areas. In recent years, VR/AR audio has become a significant application of this technology, as it requires capturing the sound field over a large area using multiple microphones. The paper points out that traditional physics-based methods, while providing accurate estimates, have faced challenges with the rise of data-driven methods. Combining physical prior knowledge with machine learning algorithms to improve estimation accuracy has become a research hotspot. To this end, researchers have introduced Physics-Informed Machine Learning (PIML), particularly Physics-Informed Neural Networks (PINNs), which incorporate physical equations as part of the loss function to constrain the model. This approach prevents unnatural distortions in the estimated results that do not conform to the actual sound field, especially when observational data is sparse. In summary, the paper aims to introduce the basic principles of Physics-Informed Machine Learning and provide an overview of current PIML-based sound field estimation methods. It focuses on how to embed physical properties into different machine learning techniques, such as linear regression, kernel regression, and neural networks. The paper also summarizes the current PIML-based sound field estimation methods, their limitations, and future prospects.