HOMULA-RIR: A Room Impulse Response Dataset for Teleconferencing and Spatial Audio Applications Acquired Through Higher-Order Microphones and Uniform Linear Microphone Arrays

Federico Miotello,Paolo Ostan,Mirco Pezzoli,Luca Comanducci,Alberto Bernardini,Fabio Antonacci,Augusto Sarti

2024-02-22

Abstract:In this paper, we present HOMULA-RIR, a dataset of room impulse responses (RIRs) acquired using both higher-order microphones (HOMs) and a uniform linear array (ULA), in order to model a remote attendance teleconferencing scenario. Specifically, measurements were performed in a seminar room, where a 64-microphone ULA was used as a multichannel audio acquisition system in the proximity of the speakers, while HOMs were used to model 25 attendees actually present in the seminar room. The HOMs cover a wide area of the room, making the dataset suitable also for applications of virtual acoustics. Through the measurement of the reverberation time and clarity index, and sample applications such as source localization and separation, we demonstrate the effectiveness of the HOMULA-RIR dataset.

Audio and Speech Processing,Signal Processing

What problem does this paper attempt to address?

The main purpose of this paper is to introduce a dataset named HOMULA-RIR, which contains room impulse responses (RIRs) collected in real environments using higher-order microphones (HOMs) and uniform linear arrays (ULA). Specifically, the researchers deployed these devices in a seminar room to simulate remote conferencing scenarios. The paper describes the data collection methods, device configurations, and acoustic calibration process, and characterizes the room's acoustic properties through reverberation time measurements and clarity index analysis. Additionally, to validate the dataset's effectiveness, the researchers conducted two typical application tests: blind source separation and sound source localization. The results demonstrate the dataset's effectiveness in these two applications, thereby showcasing its potential application value in telecommunications, remote conferencing, and spatial audio fields.

HOMULA-RIR: A Room Impulse Response Dataset for Teleconferencing and Spatial Audio Applications Acquired Through Higher-Order Microphones and Uniform Linear Microphone Arrays

HARP: A Large-Scale Higher-Order Ambisonic Room Impulse Response Dataset

Dataset of directional room impulse responses for realistic speech data

SoundCam: A Dataset for Finding Humans Using Room Acoustics

Data-driven 3D Room Geometry Inference with a Linear Loudspeaker Array and a Single Microphone

Hearing Anything Anywhere

AV-RIR: Audio-Visual Room Impulse Response Estimation

Deep Sound Field Reconstruction in Real Rooms: Introducing the ISOBEL Sound Field Dataset

RevRIR: Joint Reverberant Speech and Room Impulse Response Embedding using Contrastive Learning with Application to Room Shape Classification

Few-Shot Audio-Visual Learning of Environment Acoustics

AIRCADE: an Anechoic and IR Convolution-based Auralization Data-compilation Ensemble

Online adaptive identification of multichannel systems for audio applications

Iterative Diagonal Unloading Beamforming for Multiple Acoustic Sources Localization Using Compact Sensor Arrays

Realistic multi-microphone data simulation for distant speech recognition

Performance of low-frequency sound zones with very fast room impulse response measurements

Modal smoothing for analysis of room reflections measured with spherical microphone and loudspeaker arrays

IR-GAN: Room Impulse Response Generator for Far-field Speech Recognition

Co-Immersion in Audio Augmented Virtuality: The Case Study of a Static and Approximated Late Reverberation Algorithm

LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization

Room Impulse Response Estimation using Optimal Transport: Simulation-Informed Inference

Deep Room Impulse Response Completion