Post-processing speech recordings during MRI

Juha Kuortti,Jarmo Malinen,Antti Ojalammi
DOI: https://doi.org/10.48550/arXiv.1509.05254
2016-06-21
Abstract:We discuss post-processing of speech that has been recorded during Magnetic Resonance Imaging (MRI) of the vocal tract. Such speech recordings are contaminated by high levels of acoustic noise from the MRI scanner. Also, the frequency response of the sound signal path is not flat as a result of severe restrictions on recording instrumentation due to MRI technology. The post-processing algorithm for noise reduction is based on adaptive spectral filtering. The speech material consists of samples of prolonged vowel productions that are used for validation of the post-processing algorithm. The comparison data is recorded in anechoic chamber from the same test subject. Formant analysis is carried out for the post-processed speech and the comparison data. Artificially noise-contaminated vowel samples are used for validation experiments to determine performance of the algorithm where using true data would be difficult. The properties of recording instrumentation or the post-processing algorithm do not explain the consistent frequency dependent discrepancy between formant data from experiments during MRI and in anechoic chamber. It is shown that the discrepancy is statistically significant, in particular, where it is largest at 1 kHz and 2 kHz. The reflecting surfaces of the MRI head and neck coil are suspected to change the speech acoustics which results in "external formants" at these frequencies. However, the role of test subject adaptation to noise and constrained space acoustics during an MRI examination cannot be ruled out.
Sound,Biological Physics
What problem does this paper attempt to address?