Abstract:Steady-state visual evoked potential (SSVEP) is a key technique of electroencephalography (EEG)-based brain-computer interfaces (BCI), which has been widely applied to neurological function assessment and postoperative rehabilitation. However, accurate decoding of the user's intended based on the SSVEP-EEG signals is challenging due to the low signal-to-noise ratio and large individual variability of the signals. To address these issues, we proposed a parallel multi-band fusion convolutional neural network (PMF-CNN). Multi frequency band signals were served as the input of PMF-CNN to fully utilize the time-frequency information of EEG. Three parallel modules, spatial self-attention (SAM), temporal self-attention (TAM), and squeeze-excitation (SEM), were proposed to automatically extract multi-dimensional features from spatial, temporal, and frequency domains, respectively. A novel spatial-temporal-frequency representation were designed to capture the correlation of electrode channels, time intervals, and different sub-harmonics by using SAM, TAM, and SEM, respectively. The three parallel modules operate independently and simultaneously. A four layers CNN classification module was designed to fuse parallel multi-dimensional features and achieve the accurate classification of SSVEP-EEG signals. The PMF-CNN was further interpreted by using brain functional connectivity analysis. The proposed method was validated using two large publicly available datasets. After trained using our proposed dual-stage training pattern, the classification accuracies were 99.37% and 93.96%, respectively, which are superior to the current state-of-the-art SSVEP-EEG classification algorithms. The algorithm exhibits high classification accuracy and good robustness, which has the potential to be applied to postoperative rehabilitation.

MSFNet: Multi-Scale Fusion Network for Brain-Controlled Speaker Extraction

Audio-Visual Speech Enhancement with Deep Multi-modality Fusion

A Multi-Scale Fusion Convolutional Neural Network Based on Attention Mechanism for the Visualization Analysis of EEG Signals Decoding

BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions

NeuroHeed: Neuro-Steered Speaker Extraction using EEG Signals

NeuroSpex: Neuro-Guided Speaker Extraction with Cross-Modal Attention

Multimodal Speech Recognition Using EEG and Audio Signals: A Novel Approach for Enhancing ASR Systems

AMFFCN: Attentional Multi-layer Feature Fusion Convolution Network for Audio-visual Speech Enhancement

NeuroHeed+: Improving Neuro-steered Speaker Extraction with Joint Auditory Attention Detection

Binaural Selective Attention Model for Target Speaker Extraction

Improving Visual Speech Enhancement Network by Learning Audio-visual Affinity with Multi-head Attention

PMF-CNN: Parallel multi-band fusion convolutional neural network for SSVEP-EEG decoding

Sparsity-Driven EEG Channel Selection for Brain-Assisted Speech Enhancement

MBCFNet: A Multimodal Brain–Computer Fusion Network for Human Intention Recognition

Extracting the Auditory Attention in a Dual-Speaker Scenario From EEG Using a Joint CNN-LSTM Model

Independent Feature Enhanced Crossmodal Fusion for Match-Mismatch Classification of Speech Stimulus and EEG Response

MSVTNet: Multi-Scale Vision Transformer Neural Network for EEG-Based Motor Imagery Decoding

MSHANet: a multi-scale residual network with hybrid attention for motor imagery EEG decoding

MMASleepNet: A multimodal attention network based on electrophysiological signals for automatic sleep staging

X-CrossNet: A complex spectral mapping approach to target speaker extraction with cross attention speaker embedding fusion

EEG-based Auditory Attention Detection with Spiking Graph Convolutional Network