'Are you even listening?' - EEG-based decoding of absolute auditory attention to natural speech

Arnout Roebben,Nicolas Heintz,Simon Geirnaert,Tom Francart,Alexander Bertrand
DOI: https://doi.org/10.1101/2023.12.14.571397
2024-06-25
Abstract:Objective. In this study, we use electroencephalography (EEG) recordings to determine whether a subject is actively listening to a presented speech stimulus. More precisely, we aim to discriminate between an active listening condition, and a distractor condition where subjects focus on an unrelated distractor task while being exposed to a speech stimulus. We refer to this task as absolute auditory attention decoding. Approach. We re-use an existing EEG dataset where the subjects watch a silent movie as a distractor condition, and introduce a new dataset with two distractor conditions (silently reading a text and performing arithmetic exercises). We focus on two EEG features, namely neural envelope tracking (NET) and spectral entropy (SE). Additionally, we investigate whether the detection of such an active listening condition can be combined with a selective auditory attention decoding task, where the goal is to decide to which of multiple competing speakers the subject is attending. The latter is a key task in so-called neuro-steered hearing devices that aim to suppress unattended audio, while preserving the attended speaker. Main results. Contrary to a previous hypothesis of higher SE being related with actively listening rather than passively listening (without any distractors), we find significantly lower SE in the active listening condition compared to the distractor conditions. Nevertheless, the NET is consistently significantly higher when actively listening. Similarly, we show that the accuracy of a selective auditory attention decoding task improves when evaluating the accuracy only on the highest NET segments. However, the reverse is observed when evaluating the accuracy only on the lowest SE segments. Significance. We conclude that the NET is more reliable for decoding absolute auditory attention as it is consistently higher when actively listening, whereas the relation of the SE between actively and passively listening seems to depend on the nature of the distractor.
Neuroscience
What problem does this paper attempt to address?
### The problems the paper attempts to solve The paper aims to use electroencephalography (EEG) recordings to determine whether the subjects are actively listening to the presented speech stimuli. Specifically, the goal of the study is to distinguish between the active listening condition and the distraction condition. In the distraction condition, the subjects need to focus on a distraction task that is unrelated to the speech stimuli. The authors refer to this task as Absolute Auditory Attention Decoding (aAAD). ### Research background The human auditory system is able to focus on a single speaker of interest while filtering out competing auditory stimuli, a process known as Selective Auditory Attention. In recent years, algorithms for decoding this Selective Auditory Attention based on neural activity (such as through EEG recordings) have attracted wide attention. However, existing research has mainly focused on Selective Auditory Attention Decoding (sAAD), that is, selecting the speaker that the subject is paying attention to from multiple competing speakers. There has been less research on Absolute Auditory Attention Decoding (aAAD), that is, determining whether the subject is actively listening to any presented auditory stimuli. ### Research methods 1. **Data sets**: - Re - used an existing EEG data set, in which the subjects watched a silent movie as the distraction condition. - Introduced a new data set, which includes two distraction conditions: silently reading text and doing arithmetic exercises. 2. **Feature extraction**: - **Neural Envelope Tracking (NET)**: NET measures the degree to which the neural response tracks the speech envelope, assuming that this tracking will be stronger when the subject is more focused. - **Spectral Entropy (SE)**: SE is an EEG - based spectral feature used to measure the regularity and predictability of the EEG signal. 3. **Experimental design**: - **Active listening condition**: Subjects were instructed to pay attention to the presented speech stimuli. - **Distraction condition**: Subjects were instructed to focus on the distraction task (such as watching a silent movie, silently reading text, or doing arithmetic exercises) while being exposed to the speech stimuli. ### Main results - **NET**: Under the active listening condition, NET was significantly higher than in the distraction condition. - **SE**: Contrary to the previous hypothesis, the study found that SE under the active listening condition was significantly lower than in the distraction condition. ### Conclusions - **NET**: NET is consistently higher in the active listening condition, so it is more suitable for decoding absolute auditory attention. - **SE**: The relationship between SE in active and passive listening depends on the nature of the distraction task, rather than the absolute auditory attention state. ### Significance The research results show that NET is a more reliable feature and can be used to decode whether the subject is actively listening. In addition, combining aAAD with sAAD can improve the accuracy of Selective Auditory Attention Decoding in neural - controlled hearing - aid devices and avoid making arbitrary selections when the subject is not actively listening.