TAnet: A New Temporal Attention Network for EEG-based Auditory Spatial Attention Decoding with a Short Decision Window

Yuting Ding,Fei Chen
2024-05-14
Abstract:Auditory spatial attention detection (ASAD) is used to determine the direction of a listener's attention to a speaker by analyzing her/his electroencephalographic (EEG) signals. This study aimed to further improve the performance of ASAD with a short decision window (i.e., <1 s) rather than with long decision windows ranging from 1 to 5 seconds in previous studies. An end-to-end temporal attention network (i.e., TAnet) was introduced in this work. TAnet employs a multi-head attention (MHA) mechanism, which can more effectively capture the interactions among time steps in collected EEG signals and efficiently assign corresponding weights to those EEG time steps. Experiments demonstrated that, compared with the CNN-based method and recent ASAD methods, TAnet provided improved decoding performance in the KUL dataset, with decoding accuracies of 92.4% (decision window 0.1 s), 94.9% (0.25 s), 95.1% (0.3 s), 95.4% (0.4 s), and 95.5% (0.5 s) with short decision windows (i.e., <1 s). As a new ASAD model with a short decision window, TAnet can potentially facilitate the design of EEG-controlled intelligent hearing aids and sound recognition systems.
Signal Processing,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to improve the performance of electroencephalogram (EEG) - based auditory spatial attention decoding (ASAD) under the condition of a short decision window (i.e., less than 1 second). Traditional ASAD methods usually use a longer decision window (1 to 5 seconds), which is not ideal in application scenarios requiring a rapid response. Therefore, this paper proposes a new end - to - end temporal attention network (TAnet), aiming to more effectively capture and process the temporal features in EEG signals through the multi - head attention mechanism (MHA), thereby achieving higher decoding accuracy within an extremely short decision window (such as 0.1 to 0.5 seconds). Specifically, TAnet utilizes the multi - head attention mechanism to dynamically allocate weights to different EEG time steps, which can more accurately capture the dynamic changes of auditory attention. The experimental results show that TAnet performs excellently on the KUL dataset. In particular, under a 0.1 - second decision window, the decoding accuracy rate reaches 92.4%, and as the decision window increases, the accuracy rate further improves to a maximum of 95.5% (0.5 seconds). These results are significantly better than other existing ASAD methods, such as STAnet and EEG - Graph Net.