An AI-Based Automatic Speech Recognition for Chongqing Dialect Older Adults with Cognitive Impairment

Meiwei Zhang,Wenyuan Li,Lihua Chen,Qiushi Cui,Juan Yu,Weihua Yu,Xianglin Wang,Junjin Liu,Maosheng Gao,Yang Lü
DOI: https://doi.org/10.1109/imsa61967.2024.10652871
2024-01-01
Abstract:Mini-Mental State Examination (MMSE) is a standard method for the clinical diagnosis of cognitive impairment and is performed through conversations between doctors and patients. This paper presents an idea of using artificial intelligence (AI) based automatic speech recognition (ASR) technology to replace clinicians in MMSE. The paper is focused on the AI-based ASR technology for identifying Chongqing (western China) dialect older adults with cognitive impairment. A dual-input gated convolutional neural network (DIGCNN) model is designed to meet the two challenges - the pronunciation of Chongqing dialect and the slurred speech of older adults with cognitive impairment. In the proposed model, the verbal communication mechanism between doctors and patients in the clinic scenes is simulated. The proposed model was compared with other deep neural network (DNN) based ASR models, indicating its outperformance. The proposed method was applied to real clinic subjects in the MMSE scoring tests and reached an accuracy rate of 92% in comparison with the total average MMSE score marked by expert clinicians. It is a successful and significant step towards the application of the AI-based ASR technology to the diagnosis of patients with cognitive impairment and Alzheimer disease.
What problem does this paper attempt to address?