Abstract:Abstract Objective. Auditory attention in complex scenarios can be decoded by electroencephalography (EEG)-based cortical speech-envelope tracking. The relative root-mean-square (RMS) intensity is a valuable cue for the decomposition of speech into distinct characteristic segments. To improve auditory attention decoding (AAD) performance, this work proposed a novel segmented AAD approach to decode target speech envelopes from different RMS-level-based speech segments. Approach. Speech was decomposed into higher- and lower-RMS-level speech segments with a threshold of −10 dB relative RMS level. A support vector machine classifier was designed to identify higher- and lower-RMS-level speech segments, using clean target and mixed speech as reference signals based on corresponding EEG signals recorded when subjects listened to target auditory streams in competing two-speaker auditory scenes. Segmented computational models were developed with the classification results of higher- and lower-RMS-level speech segments. Speech envelopes were reconstructed based on segmented decoding models for either higher- or lower-RMS-level speech segments. AAD accuracies were calculated according to the correlations between actual and reconstructed speech envelopes. The performance of the proposed segmented AAD computational model was compared to those of traditional AAD methods with unified decoding functions. Main results. Higher- and lower-RMS-level speech segments in continuous sentences could be identified robustly with classification accuracies that approximated or exceeded 80% based on corresponding EEG signals at 6 dB, 3 dB, 0 dB, −3 dB and −6 dB signal-to-mask ratios (SMRs). Compared with unified AAD decoding methods, the proposed segmented AAD approach achieved more accurate results in the reconstruction of target speech envelopes and in the detection of attentional directions. Moreover, the proposed segmented decoding method had higher information transfer rates (ITRs) and shorter minimum expected switch times compared with the unified decoder. Significance. This study revealed that EEG signals may be used to classify higher- and lower-RMS-level-based speech segments across a wide range of SMR conditions (from 6 dB to −6 dB). A novel finding was that the specific information in different RMS-level-based speech segments facilitated EEG-based decoding of auditory attention. The significantly improved AAD accuracies and ITRs of the segmented decoding method suggests that this proposed computational model may be an effective method for the application of neuro-controlled brain–computer interfaces in complex auditory scenes.

Auditory Attention Detection via Cross-Modal Attention

Decoding auditory attention (in real time) with eeg

EEG-Based Auditory Attention Detection via Frequency and Channel Neural Attention

Auditory attention detection with EEG channel attention

Low Latency Auditory Attention Detection with Common Spatial Pattern Analysis of EEG Signals.

A Neural-Inspired Architecture for EEG-Based Auditory Attention Detection

Detecting the Locus of Auditory Attention Based on the Spectro-Spatial-temporal Analysis of EEG.

EEG-Based Fast Auditory Attention Detection in Real-Life Scenarios Using Time-Frequency Attention Mechanism.

Using Ear-EEG to Decode Auditory Attention in Multiple-speaker Environment

A Multi-Plane Decoupled Convolutional Network for EEG-Based Auditory Attention Detection

EEG-Based Short-Time Auditory Attention Detection Using Multi-Task Deep Learning.

EEG-based auditory attention decoding with audiovisual speech for hearing-impaired listeners

EEG-based auditory attention decoding using speech-level-based segmented computational models

Auditory Attention Detection in Real-Life Scenarios Using Common Spatial Patterns from EEG

Music-oriented Auditory Attention Detection from Electroencephalogram.

Improved Decoding of Attentional Selection in Multi-Talker Environments with Self-Supervised Learned Speech Representation

Esaa: An Eeg-Speech Auditory Attention Detection Database

Auditory Attention Decoding in Four-Talker Environment with EEG

Auditory Attention Decoding with Task-Related Multi-View Contrastive Learning

A Biologically Inspired Attention Network for EEG-Based Auditory Attention Detection

Decoding Selective Auditory Attention with EEG Using a Transformer Model