Abstract:Computational models that successfully translate neural activity into speech are multiplying in the adult literature, with non-linear convolutional neural network (CNN) approaches joining the more frequently-employed linear and mutual information (MI) models. Despite the promise of these methods for uncovering the neural basis of language acquisition by the human brain, similar studies with infants are rare. Existing infant studies rely on simpler cross-correlation and other linear techniques and aim only to establish neural tracking of the broadband speech envelope. Here, three novel computational models were applied to measure whether low-frequency speech envelope information was encoded in infant neural activity. Backward linear and CNN models were applied to estimate speech information from neural activity using linear versus nonlinear approaches, and a MI model measured how well the acoustic stimuli were encoded in infant neural responses. Fifty infants provided EEG recordings when aged 4, 7, and 11 months, while listening passively to natural speech (sung nursery rhymes) presented by video with a female singer. Each model computed speech information for these nursery rhymes in two different frequency bands, delta (1 – 4 Hz) and theta (4 – 8 Hz), thought to provide different types of linguistic information. All three models demonstrated significant levels of performance for delta-band and theta-band neural activity from 4 months of age. All models also demonstrated higher accuracy for the delta-band neural response in the infant brain. However, only the linear and MI models showed developmental (age-related) effects, and these developmental effects differed by model. Accordingly, the choice of algorithm used to decode speech envelope information from neural activity in the infant brain may determine the developmental conclusions that can be drawn. Better understanding of the strengths and weaknesses of each modelling approach will be fundamental to improving our understanding of how the human brain builds a language system.

Developmental Predictive Coding Model for Early Infancy Mono and Bilingual Vocal Continual Learning

A computational model of early language acquisition from audiovisual experiences of young infants

Modeling early phonetic acquisition from child-centered audio data

InfantNet: A Deep Neural Network for Analyzing Infant Vocalizations

Perception Point: Identifying Critical Learning Periods in Speech for Bilingual Networks

Evaluating computational models of infant phonetic learning across languages

A model of early word acquisition based on realistic-scale audiovisual naming events

A model of infant speech perception and learning

A Neural Network Model of Lexical Competition during Infant Spoken Word Recognition

Learning to Produce Syllabic Speech Sounds via Reward-Modulated Neural Plasticity

Like a Baby: Visually Situated Neural Language Acquisition

Functional reorganization of brain regions supporting non-adjacent dependency learning across the first half year of life

Automatic Assessment of Language Background in Toddlers Through Phonotactic and Pitch Pattern Modeling of Short Vocalizations

Simulating the Acquisition of Lexical Tones from Continuous Dynamic Input

Evolving Learning for Analysing Mood-Related Infant Vocalisation

The formation of perceptual space in early phonetic acquisition: a cross-linguistic modeling approach

A Computational Model of Early Word Learning from the Infant's Point of View

Computational and Robotic Models of Early Language Development: A Review

Decoding speech information from EEG data with 4, 7 and 11 month-old infants: Contrasting convolutional neural network, mutual information-based and backward linear models

Facilitating deep acoustic phenotyping: A basic coding scheme of infant vocalisations preluding computational analysis, machine learning and clinical reasoning

An open-source voice type classifier for child-centered daylong recordings