Automated Speech Recognition System for Dispatching Call Recordings in The Underground Coal Mines.

Guoyuan Lin,Lei Zhao,Jueting Liu,Zemeng Liu,Minda Yao,Wei Chen ,Yingchun Liu,Zehua Wang,Hengbo Li
DOI: https://doi.org/10.1145/3614008.3614062
2023-01-01
Abstract:In this paper, we proposed an automated speech recognition system focus on the dispatching call recordings in the underground coal mines which promoted the development of intelligent coal mining. The main challenges of the speech recognition system are the noise of recordings and the dialect speech. We employed a voice activity detection module to preprocess the recordings, this module is able to reduce the noise and segment the long recording speech; then the Conformer model with CTC algorithm is utilized to train the ASR module. To get better performance, the WenetSpeech pretrained model is embedded for fine-tuning. The result shows that compared with the other general speech recognition systems, our ASR system has great advance in recognizing the dispatching call recordings of Huaibei dialect.
What problem does this paper attempt to address?