Abstract:In recent years, as the research objects of phonetics have expanded to accent and colloquial natural speech, the construction of the dialect accent Mandarin voice database has become another important research direction in the field of computer technology. Among them, voice segmentation is a time-consuming and laborious link in the construction of the voice database. The application of artificial intelligence technology helps to improve the construction efficiency of the Mandarin dialect voice database. Based on this, this article mainly researches the application of the artificial intelligence algorithm in the automatic segmentation of dialect accent Mandarin. This paper constructs a voice corpus of dialect accents and Mandarin Chinese and specifically describes the construction process of the voice corpus. This paper uses artificial intelligence algorithms, combined with the HMM (hidden Markov model), and Viterbi algorithm to propose a new method of automatic speech segmentation. This paper studies the automatic speech segmentation model, extracts the general parameters of the training data in the Mandarin corpus, and conducts HMM training. This paper conducts tests based on the voice of the test set to verify the accuracy of the method proposed in this paper. The experimental results show that, in the speech data of 60 people, the error range of each sentence time period is less than 5 ms accounting for 79.16%, less than 10 ms accounting for 82.96%, less than 20 ms accounting for 83.14%, and less than 50 ms accounting for 86.92%. It can be seen that the algorithm proposed in this paper can meet practical applications in automatic speech segmentation.

Research on acoustic Model of Putian Dialect Speech Recognition Based on Deep Learning

An Acoustic Model for English Speech Recognition Based on Deep Learning

Phonotactic language recognition based on DNN-HMM acoustic model

Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech

Investigation of Deep Neural Network Acoustic Modelling Approaches for Low Resource Accented Mandarin Speech Recognition

Chinese Dialect Speech Recognition Based on End-to-end Machine Learning

Chinese dialect speech recognition: a comprehensive survey

A comparative study on selecting acoustic modeling units in deep neural networks based large vocabulary Chinese speech recognition

Deep neural networks for syllable based acoustic modeling in Chinese speech recognition.

A reweighting method for speech recognition with imbalanced data of Mandarin and sub-dialects

Toward a Better Understanding of Deep Neural Network Based Acoustic Modelling: An Empirical Investigation

Acoustic Modeling Based On Chinese Phonetics Knowledge

A New Acoustic Modeling of Inter-Syllable Context-Dependent Units for Putonghua Continuous Speech Recognition

DLD: An Optimized Chinese Speech Recognition Model Based on Deep Learning

Application of the Artificial Intelligence Algorithm in the Automatic Segmentation of Mandarin Dialect Accent

Research on speech recognition models in the Chinese dictation machine

The Research of Acoustic Layer Recognition Based on Pinyin Model

Selection of acoustic modeling unit for Tibetan speech recognition based on deep learning

ACOUSTIC MODEL COMPARISONS FOR AN EMBEDDED PHONEME-BASED MANDARIN NAME DIALING SYSTEM

Deep joint learning for language recognition