An Improved Glottal Flow Model Based on Seq2Seq LSTM for Simulation of Vocal Fold Vibration

Yang Zhang,Tianmei Pu,Chunhua Zhou,Hongming Cai
DOI: https://doi.org/10.1016/j.jvoice.2022.03.029
Abstract:Objectives: An improved data-driven glottal flow model for fluid-structure interaction (FSI) simulation of the vocal fold vibration is proposed in this paper. This model aims to improve the prediction performance of the previously developed deep neural network (DNN) based empirical flow model (EFM)1 on accuracy and efficiency. Methods: A Seq2Seq long short-term memory (LSTM) network is employed in the present model to infer the flow rate and pressure distribution from the subglottal pressure and cross-section area distribution of the glottis. The training data is collected from the generalized glottal shape library generated in Zhang et al.1 RESULTS AND CONCLUSIONS: Compared to the EFM, the present model not only discards the time-consuming optimization process, but also drastically reduces the errors, therefore the prediction performance can be greatly improved. The present model is evaluated by coupling with a solid dynamics solver for FSI simulation, and the results demonstrate a great improvement on accuracy and efficiency.
What problem does this paper attempt to address?