Speaker Recognition with Voice Evoked EEG

Lang Hu,Li Zhu,Hui Huang,Guang Lin,Bin Ren,Jianhai Zhang
DOI: https://doi.org/10.1109/bibm52615.2021.9669324
2021-01-01
Abstract:Traditional speaker recognition is based on the individual difference information in the voice’s acoustic parameters aroused by the structural characteristics of vocal organs. However, it only focuses on the recognition accuracy from the speaker-side without the listeners’ side. In this paper, we explored the voice evoked EEG (electroencephalography)-based speaker recognition with experiment design, feature extraction, classification and channel selection. The subject (listener) was asked to listen the audio-text stimuli consisted of four speakers and the subject’s EEG was used to decode the speakers. The extracted time-domain and time-frequency domain features were fed into siamese network in which the inter-class and intraclass samples were both trained. The empirical results show that 1) delta band (0.1-3 Hz) and high gamma band (51-80 Hz) provide higher recognition accuracy among other frequency bands; 2) frontal and parietal lobes play an important role; 3) the recognition performance is improved with subject’s attention; 4) when the listener is familiar with the speakers, the recognition accuracy is significantly higher than unfamiliar case. This work provides the brain mechanism and data processing method for auditory brain computer interface (BCI).
What problem does this paper attempt to address?