Abstract:By about 12 months of age, infants show sensitivity to mispronunciations of familiar words when asked to identify a referent. These findings indicate that infants are able to access the phonological detail of words when engaged in lexical recognition. However, most of this work has focused on mispronunciations of consonants and vowels. Very little is known about the role that lexical tones play in constraining lexical access during the early stages of lexical development. In tonal languages (e.g., Chinese), over and above vowel and consonant variations, words are distinguished by lexical tone. Over half the world’s population speak a tonal language. The current study aims to answer the question: Do Chinese infants treat tones as phonological information in their lexical representations as early as 12 months old? Using the intermodal preferential looking paradigm with the mispronunciation task, the current study examined whether Chinese infants at 12 months were sensitive to mispronunciations of lexical tones in monosyllabic, familiar words. 12 infants were in group1, 8 infants were in group2. For group 1, the familiar words were pronounced correctly in block1, while mispronounced with the falling tone in Mandarin (Tone 4) in block 2. The block order was reversed for the infants in group 2. The proportion of target look (PTL) and the difference between infants' longest look at target and distracter images (LLD) before and after naming were calculated. Systematic increment in PTL or LLD across pre- and post-naming phases indicates infants' association of the target label and object. The results showed that both groups of infants could associate the target labels and objects when the labels were correctly pronounced (PTL:group 1, t(11)=1.78, p=0.103;group 2, t(7)=2.95, p= 0.021), while the associations were not found when the labels were mispronounced (PTL: group 1, t(11) =?0.79, p = 0.45; group 2, t(7) = ?0.41, p = 0.70). In other words, the mispronunciation effects were found for both groups. But infants’ sensitivity to tonal mispronunciations was not influenced by their receptive vocabulary size. In conclusion, the results indicate that sufficient phonological information of tones is encoded by 12-month-old Chinese infants.

Automatic Assessment of Language Background in Toddlers Through Phonotactic and Pitch Pattern Modeling of Short Vocalizations

Phonological Specificity of Lexical Tones in 12-Month-old Chinese-speaking Infants

A Novel Application System of Assessing the Pronunciation Differences Between Chinese Children and Adults

Facilitating deep acoustic phenotyping: A basic coding scheme of infant vocalisations preluding computational analysis, machine learning and clinical reasoning

Evaluating computational models of infant phonetic learning across languages

Enhancing Child Vocalization Classification with Phonetically-Tuned Embeddings for Assisting Autism Diagnosis

Automated Sex Classification of Children's Voices and Changes in Differentiating Factors with Age

Visualizations of Complex Sequences of Family-Infant Vocalizations Using Bag-of-Audio-Words Approach Based on Wav2vec 2.0 Features

An open-source voice type classifier for child-centered daylong recordings

Modeling early phonetic acquisition from child-centered audio data

Audio-visual child-adult speaker classification in dyadic interactions

Deep-Learning-Based Automated Classification of Chinese Speech Sound Disorders

A computational model of early language acquisition from audiovisual experiences of young infants

Developing an AI-Assisted Low-Resource Spoken Language Learning App for Children

Evaluating Language Environment Analysis System Performance for Chinese: A Pilot Study in Shanghai

Putting the child in the driver's seat: Insights into language development from children's interactions in preschool classrooms

Who Said What? An Automated Approach to Analyzing Speech in Preschool Classrooms

Research and Implementation of Children's Speech Signal Processing System.

Investigation of the Assessment of Infant Vocalizations by Laypersons

Automated Classification of Phonetic Segments in Child Speech Using Raw Ultrasound Imaging

The Predictability of Naturalistic Evaluation of All-Day Recordings for Speech and Language Development