Squeeze and Excitation-Based Multiscale CNN for Classification of Steady-State Visual Evoked Potentials
Jing Jin,Xiao Wu,Ian Daly,Weijie Chen,Xinjie He,Xingyu Wang,Andrzej Cichocki
DOI: https://doi.org/10.1109/jiot.2024.3488745
IF: 10.6
2024-01-01
IEEE Internet of Things Journal
Abstract:Brain-Computer interface (BCI) technology enables the control of external devices by recognizing user intentions. Steady-state visual evoked potential (SSVEP)-based BCI technology has been widely applied in the field of Internet of things (IoT) device control, including smart healthcare, smart homes, and robotics, and has achieved significant results. However, as the field of BCI-based IoT device control is still in its development stage, there remains considerable room for improvement in terms of accuracy, efficiency, and cost. Therefore, enhancing the classification accuracy of SSVEP decoding using a short time window, reducing both human and material costs, and improving work efficiency are crucial for the theoretical research and engineering applications of BCI technology in IoT device control. Based on this, we propose a novel approach to address the challenge of high accuracy feature extraction within brief timeframes. Our approach integrates a multi-scale convolutional neural network with a squeeze excitation module (SEMSCNN). This fusion leverages CNNs’ local feature learning capacity and the advantageous feature importance distinction offered by the squeeze excitation mechanism. First, the EEG signals are band-pass filtered into distinct frequency bands and frequency band and channel features are extracted by a two-layer convolution. Then, temporal features are extracted via a multi-branch convolution of different scales. Finally, the squeeze and excitation (SE) module is introduced to learn the interdependence between features to improve the quality of the extracted features. The first stage of training exploits statistical commonalities across research participants by learning the global model, and the second stage fine-tunes each participants features separately by exploiting participant-specific differences in features. We evaluate our SEMSCNN model on two large public datasets, Benchmark and BETA, and we compare our model to other state-of-the-art models in order to evaluate the effectiveness of our proposed network.Our experimental results indicate that our method effectively improves the accuracy of target recognition and information transfer rate under short-duration stimuli, showing a significant advantage compared to other baseline methods. This provides a broad prospect for the practical application of BCIs in the field of IoT.