Abstract:Introduction: This study employed an explanatory sequential design to examine the impact of utilizing automatic speech recognition technology (ASR) with peer correction on the improvement of second language (L2) pronunciation and speaking skills among English as a Foreign Language (EFL) learners. The aim was to assess whether this approach could be an effective tool for enhancing L2 pronunciation and speaking abilities in comparison to traditional teacher-led feedback and instruction. Methods: A total of 61 intermediate-level Chinese EFL learners were randomly assigned to either a control group (CG) or an experimental group (EG). The CG received conventional teacher-led feedback and instruction, while the EG used ASR technology with peer correction. Data collection involved read-aloud tasks, spontaneous conversations, and IELTS speaking tests to evaluate L2 pronunciation and speaking skills. Additionally, semi-structured interviews were conducted with a subset of the participants to explore their perceptions of the ASR technology and its impact on their language learning experience. Results: The quantitative analysis of the collected data demonstrated that the EG outperformed the CG in all measures of L2 pronunciation, including accentedness and comprehensibility. Furthermore, the EG exhibited significant improvements in global speaking skill compared to the CG. The qualitative analysis of the interviews revealed that the majority of the participants in the EG found the ASR technology to be beneficial in enhancing their L2 pronunciation and speaking abilities. Discussion: The results of this study suggest that the utilization of ASR technology with peer correction can be a potent approach in enhancing L2 pronunciation and speaking skills among EFL learners. The improved performance of the EG compared to the CG in pronunciation and speaking tasks demonstrates the potential of incorporating ASR technology into language learning environments. Additionally, the positive feedback from the participants in the EG underscores the value of using ASR technology as a supportive tool in language learning classrooms.

Impact of ASR Performance on Free Speaking Language Assessment

Impact of ASR Performance on Spoken Grammatical Error Detection.

The Impact of ASR on the Automatic Analysis of Linguistic Complexity and Sophistication in Spontaneous L2 Speech

Use of Graphemic Lexicons for Spoken Language Assessment.

The impact of automatic speech recognition technology on second language pronunciation and speaking skills of EFL learners: a mixed methods investigation

Towards automatic assessment of spontaneous spoken English

Adapting an ASR Foundation Model for Spoken Language Assessment

Investigating the Sensitivity of Automatic Speech Recognition Systems to Phonetic Variation in L2 Englishes

Measuring the Accuracy of Automatic Speech Recognition Solutions

A study of college students' perceptions of utilizing automatic speech recognition technology to assist English oral proficiency

ASR-Free Pronunciation Assessment

Does accuracy matter? Methodological considerations when using automated speech-to-text for social science research

A Computer-Assisted Tool for Automatically Measuring Non-Native Japanese Oral Proficiency

Lost in Transcription: Identifying and Quantifying the Accuracy Biases of Automatic Speech Recognition Systems Against Disfluent Speech

Error-preserving Automatic Speech Recognition of Young English Learners' Language

The Impact of ASR on Speech-to-Speech Translation Performance.

Sequence Teacher-Student Training of Acoustic Models for Automatic Free Speaking Language Assessment.

ASTER: Automatic Speech Recognition System Accessibility Testing for Stutterers

An ASR-free Fluency Scoring Approach with Self-Supervised Learning

ASR Benchmarking: Need for a More Representative Conversational Dataset

Automated Speech Scoring System Under The Lens: Evaluating and interpreting the linguistic cues for language proficiency