Abstract:A common problem in neural recordings is the low signal-to-noise ratio (SNR), particularly when using non-invasive techniques like magneto- or electroencephalography (M/EEG). To address this problem, experimental designs often include repeated trials, which are then averaged to improve the SNR or to infer statistics that can be used in the design of a denoising spatial filter. However, collecting enough repeated trials is often impractical and even impossible in some paradigms, while analyses on existing data sets may be hampered when these do not contain such repeated trials. Therefore, we present a data-driven method that takes advantage of the knowledge of the presented stimulus, to achieve a joint noise reduction and dimensionality reduction without the need for repeated trials. The method first estimates the stimulus-driven neural response using the given stimulus, which is then used to find a set of spatial filters that maximize the SNR based on a generalized eigenvalue decomposition. As the method is fully data-driven, the dimensionality reduction enables researchers to perform their analyses without having to rely on their knowledge of brain regions of interest, which increases accuracy and reduces the human factor in the results. In the context of neural tracking of a speech stimulus using EEG, our method resulted in more accurate short-term temporal response function (TRF) estimates, higher correlations between predicted and actual neural responses, and higher attention decoding accuracies compared to existing TRF-based decoding methods. We also provide an extensive discussion on the central role played by the generalized eigenvalue decomposition in various denoising methods in the literature, and address the conceptual similarities and differences with our proposed method.

Spatially Selective Deep Non-linear Filters for Speaker Extraction

Multi-channel Speech Separation Using Spatially Selective Deep Non-linear Filters

Temporal-Spatial Neural Filter: Direction Informed End-to-End Multi-channel Target Speech Separation

Exploiting spatial information with the informed complex-valued spatial autoencoder for target speaker extraction

SpatialNet: Extensively Learning Spatial Information for Multichannel Joint Speech Separation, Denoising and Dereverberation

Spotforming: spatial filtering with distributed arrays for position-selective sound acquisition

Robust Spatial Filtering Network for Separating Speech in the Direction of Interest

Informed spatial filtering for sound extraction using distributed microphone arrays

Deep informed spatio-spectral filtering for multi-channel speech extraction against steering vector uncertainties

Spatially constrained vs. unconstrained filtering in neural spatiospectral filters for multichannel speech enhancement

Localizing Spatial Information in Neural Spatiospectral Filters

3S-TSE: Efficient Three-Stage Target Speaker Extraction for Real-Time and Low-Resource Applications

Analysis of spatial filtering in neural spatiospectral filters and its dependence on training target characteristics

Mask-Weighted Spatial Likelihood Coding for Speaker-Independent Joint Localization and Mask Estimation

Selective Listening by Synchronizing Speech with Lips

Binaural Selective Attention Model for Target Speaker Extraction

Dual-Channel Target Speaker Extraction Based on Conditional Variational Autoencoder and Directional Information

Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction

Target Speaker Extraction by Directly Exploiting Contextual Information in the Time-Frequency Domain

PlumberNet: Fixing interference leakage after GEV beamforming

Stimulus-aware spatial filtering for single-trial neural response and temporal response function estimation in high-density EEG with applications in auditory research