Abstract:Experimental evidence indicates that neurophysiological responses to well-known meaningful sensory items and symbols (such as familiar objects, faces, or words) differ from those to matched but novel and senseless materials (unknown objects, scrambled faces, and pseudowords). Spectral responses in the high beta- and gamma-band have been observed to be generally stronger to familiar stimuli than to unfamiliar ones. These differences have been hypothesized to be caused by the activation of distributed neuronal circuits or cell assemblies, which act as long-term memory traces for learned familiar items only. Here, we simulated word learning using a biologically constrained neurocomputational model of the left-hemispheric cortical areas known to be relevant for language and conceptual processing. The 12-area spiking neural-network architecture implemented replicates physiological and connectivity features of primary, secondary, and higher-association cortices in the frontal, temporal, and occipital lobes of the human brain. We simulated elementary aspects of word learning in it, focussing specifically on semantic grounding in action and perception. As a result of spike-driven Hebbian synaptic plasticity mechanisms, distributed, stimulus-specific cell-assembly (CA) circuits spontaneously emerged in the network. After training, presentation of one of the learned "word" forms to the model correlate of primary auditory cortex induced periodic bursts of activity within the corresponding CA, leading to oscillatory phenomena in the entire network and spontaneous across-area neural synchronization. Crucially, Morlet wavelet analysis of the network's responses recorded during presentation of learned meaningful "word" and novel, senseless "pseudoword" patterns revealed stronger induced spectral power in the gamma-band for the former than the latter, closely mirroring differences found in neurophysiological data. Furthermore, coherence analysis of the simulated responses uncovered dissociated category specific patterns of synchronous oscillations in distant cortical areas, including indirectly connected primary sensorimotor areas. Bridging the gap between cellular-level mechanisms, neuronal-population behavior, and cognitive function, the present model constitutes the first spiking, neurobiologically, and anatomically realistic model able to explain high-frequency oscillatory phenomena indexing language processing on the basis of dynamics and competitive interactions of distributed cell-assembly circuits which emerge in the brain as a result of Hebbian learning and sensorimotor experience.

A neural network model for encoding and perception of vowel sounds

Formation Of An Auditory Map For Invariant Perception Of Vowel Sounds: Listening To A Variety Of Speakers To Make Unified Vowel Representation

Hierarchical Perception Of Monosyllabic Sounds

A Spiking Neural Network Model for Sound Recognition.

Decomposition and integration of monosyllabic information for auditory perceptual process

Encoding of phonology in a recurrent neural model of grounded speech

A hierarchical sparse coding model predicts acoustic feature encoding in both auditory midbrain and cortex

Sound signal analysis in Japanese speech recognition based on deep learning algorithm

Reverberation Modeling for Source-Filter-Based Neural Vocoder.

The role of vowel and consonant onsets in neural tracking of natural speech

A Spiking Neurocomputational Model of High-Frequency Oscillatory Brain Responses to Words and Pseudowords

Research on the Model of Speech Recognition and Understanding by Using Hierarchical Information Feedback

Mandarin tone modeling using recurrent neural networks

A computational model of early language acquisition from audiovisual experiences of young infants

A Vector Quantized Variational Autoencoder (VQ-VAE) Autoregressive Neural $F_0$ Model for Statistical Parametric Speech Synthesis

Visualising Model Training via Vowel Space for Text-To-Speech Systems

A hierarchical neuronal model for generation and online recognition of birdsongs

A Task-Optimized Neural Network Replicates Human Auditory Behavior, Predicts Brain Responses, and Reveals a Cortical Processing Hierarchy

A model of infant speech perception and learning

Research on deep neural network's hidden layers in phoneme recognition

Neural manifolds carry reactivation of phonetic representations during semantic processing