Abstract:Spatial audio and 3-Dimensional sound rendering techniques play a pivotal and essential role in immersive audio experiences. Head-Related Transfer Functions (HRTFs) are acoustic filters which represent how sound interacts with an individual's unique head and ears anatomy. The use of HRTFs compliant to the subjects anatomical traits is crucial to ensure a personalized and unique spatial experience. This work proposes the implementation of an HRTF individualization method based on anthropometric features automatically extracted from ear images using a Convolutional Neural Network (CNN). Firstly, a CNN is implemented and tested to assess the performance of machine learning on positioning landmarks on ear images. The I-BUG dataset, containing ear images with corresponding 55 landmarks, was used to train and test the neural network. Subsequently, 12 relevant landmarks were selected to correspond to 7 specific anthropometric measurements established by the HUTUBS database. These landmarks serve as a reference for distance computation in pixels in order to retrieve the anthropometric measurements from the ear images. Once the 7 distances in pixels are extracted from the ear image, they are converted in centimetres using conversion factors, a best match method vector is implemented computing the Euclidean distance for each set in a database of 116 ears with their corresponding 7 anthropometric measurements provided by the HUTUBS database. The closest match of anthropometry can be identified and the corresponding set of HRTFs can be obtained for personnalized use. The method is evaluated in its validity instead of the accuracy of the results. The conceptual scope of each stage has been verified and substantiated to function correctly. The various steps and the available elements in the process are reviewed and challenged to define a greater algorithm entity designed for the desired task.

HRTF Interpolation using a Spherical Neural Process Meta-Learner

HRTF Interpolation using a Spherical Neural Process Meta-Learner

Magnitude-Corrected and Time-Aligned Interpolation of Head-Related Transfer Functions

Interpolation Method of Head-Related Transfer Functions Based on Common-Pole/zero Modeling

Head-Related Transfer Function Interpolation with a Spherical CNN

Modeling of Individual Head-Related Transfer Functions (HRTFs) Based on Spatiotemporal and Anthropometric Features Using Deep Neural Networks

Spatial Audio and Individualized HRTFs using a Convolutional Neural Network (CNN)

Modeling of Individual HRTFs Based on Spatial Principal Component Analysis.

Predicting Global Head-Related Transfer Functions From Scanned Head Geometry Using Deep Learning and Compact Representations

HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields

HRTF Estimation in the Wild

A Sparse Spherical Harmonic-Based Model in Subbands for Head-Related Transfer Functions.

Personalized Head-Related Transfer Function Prediction Based on Spatial Grouping

On HRTF Notch Frequency Prediction Using Anthropometric Features and Neural Networks

Sparsity-Constrained Weight Mapping for Head-Related Transfer Functions Individualization from Anthropometric Features

Denoising of photogrammetric dummy head ear point clouds for individual Head-Related Transfer Functions computation

Individual Distance-Dependent HRTFS Modeling Through A Few Anthropometric Measurements

HRTF Individualization: A Survey

Spatial Upsampling of Head-Related Transfer Functions Using a Physics-Informed Neural Network

HRTF upsampling with a generative adversarial network using a gnomonic equiangular projection

Head-Related Transfer Function Modeling Based on Finite-Impulse Response