Abstract:Most studies investigating neural representations of species-specific vocalizations in non-human primates and other species have involved studying neural responses to vocalization tokens. One limitation of such approaches is the difficulty in determining which acoustical features of vocalizations evoke neural responses. Traditionally used filtering techniques are often inadequate in manipulating features of complex vocalizations. Furthermore, the use of vocalization tokens cannot fully account for intrinsic stochastic variations of vocalizations that are crucial in understanding the neural codes for categorizing and discriminating vocalizations differing along multiple feature dimensions. In this work, we have taken a rigorous and novel approach to the study of species-specific vocalization processing by creating parametric "virtual vocalization" models of major call types produced by the common marmoset (Callithrix jacchus). The main findings are as follows. 1) Acoustical parameters were measured from a database of the four major call types of the common marmoset. This database was obtained from eight different individuals, and for each individual, we typically obtained hundreds of samples of each major call type. 2) These feature measurements were employed to parameterize models defining representative virtual vocalizations of each call type for each of the eight animals as well as an overall species-representative virtual vocalization averaged across individuals for each call type. 3) Using the same feature-measurement that was applied to the vocalization samples, we measured acoustical features of the virtual vocalizations, including features not explicitly modeled and found the virtual vocalizations to be statistically representative of the callers and call types. 4) The accuracy of the virtual vocalizations was further confirmed by comparing neural responses to real and synthetic virtual vocalizations recorded from awake marmoset auditory cortex. We found a strong agreement between the responses to token vocalizations and their synthetic counterparts. 5) We demonstrated how these virtual vocalization stimuli could be employed to precisely and quantitatively define the notion of vocalization "selectivity" by using stimuli with parameter values both within and outside the naturally occurring ranges. We also showed the potential of the virtual vocalization stimuli in studying issues related to vocalization categorizations by morphing between different call types and individual callers.

Automatic detection and classification of marmoset vocalizations using deep and recurrent neural networks.

Automated Call Detection for Acoustic Surveys with Structured Calls of Varying Length

Automatic Respiratory Sound Classification Via Multi-Branch Temporal Convolutional Network

Speech neuromuscular decoding based on spectrogram images using conformal predictors with Bi-LSTM.

Utilizing DeepSqueak for automatic detection and classification of mammalian vocalizations: a case study on primate vocalizations

Automatic Sound Event Detection and Classification of Great Ape Calls Using Neural Networks

A Transformer Model for Segmentation, Classification, and Caller Identification of Marmoset Vocalization

Virtual Vocalization Stimuli for Investigating Neural Representations of Species-Specific Vocalizations.

Automated detection of Bornean white-bearded gibbon (Hylobates albibarbis) vocalizations using an open-source framework for deep learning

Automated detection of Bornean white-bearded gibbon (Hylobates albibarbis) vocalisations using an open-source framework for deep learning

Enhancing the analysis of murine neonatal ultrasonic vocalizations: Development, evaluation, and application of different mathematical models

Utilizing synthetic training data for the supervised classification of rat ultrasonic vocalizations

Feature Representations for Automatic Meerkat Vocalization Classification

Acoustic Analysis of Vocal Development in a New World Primate, the Common Marmoset (callithrix Jacchus).

Representation of Conspecific Vocalizations in Amygdala of Awake Marmosets.

Representation of conspecific vocalizations in amygdala of awake marmoset

DCNN for Pig Vocalization and Non-Vocalization Classification: Evaluate Model Robustness with New Data

Marine Mammal Species Classification Using Convolutional Neural Networks and a Novel Acoustic Representation

On the Utility of Speech and Audio Foundation Models for Marmoset Call Analysis

Compensating class imbalance for acoustic chimpanzee detection with convolutional recurrent neural networks