Esaa: An Eeg-Speech Auditory Attention Detection Database

Longhan Xie,Jia Li,Peiwen Li,Siqi Cai,Enze Su,Haizhou Li
DOI: https://doi.org/10.1109/O-COCOSDA202257103.2022.9997944
2022-11-01
Abstract:Humans are able to listen to a particular sound source in a noisy environment, an ability which is referred to as the cocktail party effect. Auditory attention detection (AAD) sheds light on the neural mechanisms of the cocktail party problem and enables neuro-steered hearing prostheses. In this paper, we build a database for AAD research, which consists of competing speech stimuli and associated human neural responses, i.e, electroencephalography (EEG) recordings, namely EEG-Speech AAD (ESAA) database. This is the first AAD database with speech stimuli in a tonal language (Mandarin), which contains 12.7 hours of data collected from 20 subjects. Moreover, we develop an AAD baseline as a reference model for decoding which speech stream a listening subject is attending to (speaker attention detection), and a baseline for decoding which spatial locus a listening subject is attending to (speaker locus attention detection) on the ESAA database. We achieve the accuracy of 84.6% and 84.3% for speaker and speaker locus attention detection with 64-channel and 1-second decision window, respectively.
Engineering,Medicine,Computer Science
What problem does this paper attempt to address?