Abstract:Introduction: The auditory system encodes the phonetic features of languages by processing spectro-temporal modulations in speech, which can be described at two time scales: relatively slow amplitude variations over time (AM, further distinguished into the slowest 8–16 Hz) for similar tasks. Methods: Using an observer-based psychophysical method, this study measured the ability of typical-hearing 6-month-olds, 10-month-olds, and adults to detect a change in the vowel or consonant features of consonant-vowel syllables when temporal modulations are selectively degraded. Two acoustically degraded conditions were designed, replacing FM cues with pure tones in 32 frequency bands, and then extracting AM cues in each frequency band with two different low-pass cut- off frequencies: (1) half the bandwidth (Fast AM condition), (2) <8 Hz (Slow AM condition). Results: In the Fast AM condition, results show that with reduced FM cues, 85% of 6-month-olds, 72.5% of 10-month-olds, and 100% of adults successfully categorize phonemes. Among participants who passed the Fast AM condition, 67% of 6-month-olds, 75% of 10-month-olds, and 95% of adults passed the Slow AM condition. Furthermore, across the three age groups, the proportion of participants able to detect phonetic category change did not differ between the vowel and consonant conditions. However, age-related differences were observed for vowel categorization: while the 6- and 10-month-old groups did not differ from one another, they both independently differed from adults. Moreover, for consonant categorization, 10-month-olds were more impacted by acoustic temporal degradation compared to 6-month-olds, and showed a greater decline in detection success rates between the Fast AM and Slow AM conditions. Discussion: The degradation of FM and faster AM cues (>8 Hz) appears to strongly affect consonant processing at 10 months of age. These findings suggest that between 6 and 10 months, infants show different developmental trajectories in the perceptual weight of speech temporal acoustic cues for vowel and consonant processing, possibly linked to phonological attunement.

Modeling early phonetic acquisition from child-centered audio data

A computational model of early language acquisition from audiovisual experiences of young infants

Evaluating computational models of infant phonetic learning across languages

A model of early word acquisition based on realistic-scale audiovisual naming events

The formation of perceptual space in early phonetic acquisition: a cross-linguistic modeling approach

A model of infant speech perception and learning

Predicting non-native speech perception using the Perceptual Assimilation Model and state-of-the-art acoustic models

Assessing language acquisition from parent-child interaction: An event-related potential study on perception of audio-visual cues in infancy

Multimodal Input Aids a Bayesian Model of Phonetic Learning

Phonetic learning as a pathway to language: new data and native language magnet theory expanded (NLM-e)

An open-source voice type classifier for child-centered daylong recordings

Analysing the Impact of Audio Quality on the Use of Naturalistic Long-Form Recordings for Infant-Directed Speech Research

A developmental model of audio-visual attention (MAVA) for bimodal language learning in infants and robots

Phonetic acquisition in cortical dynamics, a computational approach

Statistical learning beyond words in human neonates

Developmental Trends in Auditory Processing Can Provide Early Predictions of Language Acquisition in Young Infants.

The acquisition of speech categories: Beyond perceptual narrowing, beyond unsupervised learning and beyond infancy

Neural indicators of articulator-specific sensorimotor influences on infant speech perception

Examining speech-brain tracking during early bidirectional, free-flowing caregiver-infant interactions

An auditory perspective on phonological development in infancy

Perception Point: Identifying Critical Learning Periods in Speech for Bilingual Networks