Abstract:Dynamic images can be modeled as points on a smooth nonlinear manifold embedded in a high dimensional ambient space. The manifold regularization exploits the neighborhood relations of points on this manifold. Image frames sharing similar vocal tract postures are mapped as neighbors on the manifold even if they occur at different times (see the red and green squares). This paper shows the prospective utility of manifold regularization in improving dynamic speech MRI of repetitive and fluent speech tasks at 3 T. This work develops and evaluates a self‐navigated variable density spiral (VDS)‐based manifold regularization scheme to prospectively improve dynamic speech magnetic resonance imaging (MRI) at 3 T. Short readout duration spirals (1.3‐ms long) were used to minimize sensitivity to off‐resonance. A custom 16‐channel speech coil was used for improved parallel imaging of vocal tract structures. The manifold model leveraged similarities between frames sharing similar vocal tract postures without explicit motion binning. The self‐navigating capability of VDS was leveraged to learn the Laplacian structure of the manifold. Reconstruction was posed as a sensitivity‐encoding–based nonlocal soft‐weighted temporal regularization scheme. Our approach was compared with view‐sharing, low‐rank, temporal finite difference, extra dimension‐based sparsity reconstruction constraints. Undersampling experiments were conducted on five volunteers performing repetitive and arbitrary speaking tasks at different speaking rates. Quantitative evaluation in terms of mean square error over moving edges was performed in a retrospective undersampling experiment on one volunteer. For prospective undersampling, blinded image quality evaluation in the categories of alias artifacts, spatial blurring, and temporal blurring was performed by three experts in voice research. Region of interest analysis at articulator boundaries was performed in both experiments to assess articulatory motion. Improved performance with manifold reconstruction constraints was observed over existing constraints. With prospective undersampling, a spatial resolution of 2.4 × 2.4 mm2/pixel and a temporal resolution of 17.4 ms/frame for single‐slice imaging, and 52.2 ms/frame for concurrent three‐slice imaging, were achieved. We demonstrated implicit motion binning by analyzing the mechanics of the Laplacian matrix. Manifold regularization demonstrated superior image quality scores in reducing spatial and temporal blurring compared with all other reconstruction constraints. While it exhibited faint (nonsignificant) alias artifacts that were similar to temporal finite difference, it provided statistically significant improvements compared with the other constraints. In conclusion, the self‐navigated manifold regularized scheme enabled robust high spatiotemporal resolution dynamic speech MRI at 3 T.

Post-processing speech recordings during MRI

Assessment of velopharyngeal function with dual-planar high-resolution real-time spiral dynamic MRI.

A Comparison of Denoising Approaches for Spoken Word Production Related Artefacts in Continuous Multiband fMRI Data

MRI acoustic noise: sound pressure and frequency analysis

Measurement of acoustic and anatomic changes in oral and maxillofacial surgery patients

Automatic segmentation of vocal tract articulators in real-time magnetic resonance imaging

Reconstructing Speech from Real-Time Articulatory MRI Using Neural Vocoders

Extraction of overt verbal response from the acoustic noise in a functional magnetic resonance imaging scan by use of segmented active noise cancellation.

Improving Patient Comfort in MRI with Predictive Acoustic Noise Cancelling

Acoustic Noise of MRI Scans of the Internal Auditory Canal and Potential for Intracochlear Physiological Changes

Self-navigated subspace reconstruction for real-time MR imaging of the vocal tract

4D magnetic resonance imaging atlas construction using temporally aligned audio waveforms in speech

Speaker dependent articulatory-to-acoustic mapping using real-time MRI of the vocal tract

Lowering The Acoustic Noise Burden in MRI with Predictive Noise Canceling

Spatio-Temporal Resolution Enhancement of Vocal Tract MRI Sequences—A Comparison Among Wiener Filter Based Methods

Enhancing linguistic research through 2-mm isotropic 3D dynamic speech MRI optimized by sparse temporal sampling and low-rank reconstruction

MRI gradient coil cylinder sound field simulation and measurement

Deep Speech Synthesis from MRI-Based Articulatory Representations

Prospectively accelerated dynamic speech magnetic resonance imaging at 3 T using a self‐navigated spiral‐based manifold regularized scheme

A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images

Silent Speech and Emotion Recognition from Vocal Tract Shape Dynamics in Real-Time MRI