Abstract:Good speech intelligibility is the primary goal of acoustic environment control in lecture rooms and auditoria. The speech transmission index (STI) is the primary objective evaluation metric for speech intelligibility. Therefore, accurate prediction of the STI during the design stage is important for acoustic environment control in lecture rooms and auditoria. Two types of STI prediction methods based on simulated impulse responses and statistical are recommended by the IEC standard. In this study, the prediction accuracy and influencing factors of these two types of prediction methods were systematically analysed using STI measurements collected at 25 receiver positions in six rooms. The results reveal the following. (1) When the influence of the signal-to-noise ratio (SNR) is not considered, for the STI prediction method based on simulated impulse responses, the prediction accuracy is in an acceptable range with the average predicted STI discrepancy of the 25 receiver positions being −0.004 and the maximum discrepancy being 0.046. For the STI prediction method based on statistical, the average predicted STI discrepancy of the 25 receiver positions was −0.015, and the maximum discrepancy was −0.091. STI prediction based on statistical exhibited a larger prediction error at the receiver positions close to the sound source and smaller prediction error at the receiver positions farther away from the sound source. (2) When the influence of the SNR was considered, the errors of both prediction methods increased significantly. For the STI prediction method based on simulated impulse responses, the average predicted STI discrepancy of the 25 receiver positions was −0.035 and the maximum discrepancy was −0.088. For the STI prediction method based on statistical, the average predicted STI discrepancy of the 25 receiver positions was −0.030 and the maximum discrepancy was −0.112. (3) Using a sound source that is different from the onsite measurement sound source can induce noticeable errors in both methods. (4) For the STI prediction method based on statistical using the actual Q value of the sound source, the prediction accuracy of receiver positions close to the sound source can be significantly improved and the effect on receiver positions farther away from the sound source is relatively small. (5) Both methods require significant technical expertise from users, rendering it difficult to obtain accurate prediction results during the design stage.

A Time-Weighted Method for Predicting the Intelligibility of Speech in the Presence of Interfering Sounds

A Time Weighting Algorithm of Sound Pressure Level Based on DSP

Effect of Temporal Fine Structure on Speech Intelligibility Modeling.

A Computational Model for Assessment of Speech Intelligibility in Informational Masking

A Data-Driven Speech Intelligibility Assessment Method Using Sum-Sorted Spectrogram Feature

An unprecedented memory of macromolecular helicity induced in an achiral polyisocyanide in water.

Experimental comparisons of speech transmission index prediction methods

Title Non-intrusive intelligibility prediction for Mandarin speech innoise

Non-intrusive intelligibility prediction for Mandarin speech in noise

An instrumental intelligibility metric based on information theory

Spectral-change Enhancement with Prior SNR for the Hearing Impaired

Non-Intrusive Binaural Speech Intelligibility Prediction from Discrete Latent Representations

A Novel Method for Intelligibility Assessment of Nonlinearly Processed Speech in Spaces Characterized by Long Reverberation Times

Speech intelligibility prediction based on a physiological model of the human ear and a hierarchical spiking neural network

The speech intelligibility and applicability of the speech transmission index in large spaces

Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners

Adaptive Beamforming Based on Interference-Plus-Noise Covariance Matrix Reconstruction for Speech Separation

Assessing Level-Dependent Segmental Contribution to the Intelligibility of Speech Processed by Single-Channel Noise-Suppression Algorithms

Nonintrusive objective measurement of speech intelligibility: A review of methodology

An Evaluation of Intrusive Instrumental Intelligibility Metrics

Prediction of speech intelligibility with DNN-based performance measures