Abstract:Language and Speech, Ahead of Print. Previous studies have suggested the effect of linguistic information on voice perception (e.g., the language-familiarity effect [LFE]). However, it remains unclear which type of specific information in speech contributes to voice perception, including acoustic, phonological, lexical, and semantic information. It is also underexamined whether the roles of these different types of information are modulated by the experimental paradigm (speaker discrimination vs. speaker identification). In this study, we conducted two experiments to investigate these issues regarding LFEs. Experiment 1 examined the roles of acoustic and phonological information in speaker discrimination and identification with forward and time-reversed Mandarin and Indonesian sentences. Experiment 2 further identified the roles of phonological, lexical, and semantic information with forward, word-scrambled, and reconstructed (consisting of pseudo-Mandarin words) Mandarin and forward Indonesian sentences. For Mandarin-only participants, in Experiment 1, speaker discrimination was more accurate for forward than reversed sentences, but there was no LFE in either sentence. Speaker identification was also more accurate for forward than reversed sentences, whereas there was an LFE for forward sentences. In Experiment 2, speaker discrimination was better for word-scrambled than reconstructed Mandarin sentences. Speaker identification was more accurate for forward and word-scrambled Mandarin sentences but less accurate for Mandarin reconstructed and forward Indonesian sentences. In general, the pattern of the results for Indonesian learners was the same as that for Mandarin-only speakers. These results suggest that different kinds of information support speaker discrimination and identification in native and unfamiliar languages. The LFE in speaker identification depends on both phonological and lexical information.

Speech Length Threshold in Forensic Speaker Comparison by Using Long-Term Cumulative Formant (LTCF) Analysis

Study of Long-Term Formant Distributions in Forensic Phonetics

Forensic Speech Information Hiding Using Fractional Cosine-Cepstrum Transform

Forensic Speech Enhancement Based on Two-Dimensional Fractional Fourier Transform Domain

Vocal tract characteristic on long-term formant distribution

Simplified Deformation Compensation for Emotional Speaker Recognition

Fusing linguistic and acoustic information for automated forensic speaker comparison

The Distance Measure for Line Spectrum Pairs Applied to Speech Recognition.

Lombard Effect for Bilingual Speakers in Cantonese and English: importance of spectro-temporal features

Quantifying and Correlating Rhythm Formants in Speech

Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods

The Duration Analysis of the Checked Tones in Cantonese Speech

Improving Speaker Verification Performance Against Long-Term Speaker Variability

An Empirical Study of the Effects of Pure Real-World Conditions on the Reliability of Forensic Phonetic Features

Effect of Temporal Fine Structure on Speech Intelligibility Modeling.

Latency Characteristics of Speech Evoked Auditory Brainstem Response

A Weighted Cepstral Distance Measure for Objective Quality Evaluation of Mandarin

Auditory Features with Vocal Track Length Normalization for Language Identification

Separated and reunified: An apparent time investigation of the voice quality differences between Hong Kong Cantonese and Guangzhou Cantonese

Classification of speech dysfluencies using LPC based parameterization techniques

How Different Types of Linguistic Information Impact Voice Perception: Evidence From the Language-Familiarity Effect