Blind Source Separation of Single-Channel Mixtures via Multi-Encoder Autoencoders

Matthew B. Webster,Joonnyong Lee
2024-03-09
Abstract:The task of blind source separation (BSS) involves separating sources from a mixture without prior knowledge of the sources or the mixing system. Single-channel mixtures and non-linear mixtures are a particularly challenging problem in BSS. In this paper, we propose a novel method for addressing BSS with single-channel non-linear mixtures by leveraging the natural feature subspace specialization ability of multi-encoder autoencoders. During the training phase, our method unmixes the input into the separate encoding spaces of the multi-encoder network and then remixes these representations within the decoder for a reconstruction of the input. Then to perform source inference, we introduce a novel encoding masking technique whereby masking out all but one of the encodings enables the decoder to estimate a source signal. To this end, we also introduce a sparse mixing loss that encourages sparse remixing of source encodings throughout the decoder and a so-called zero reconstruction loss on the decoder for coherent source estimations. To analyze and evaluate our method, we conduct experiments on a toy dataset, designed to demonstrate this property of feature subspace specialization, and with real-world biosignal recordings from a polysomnography sleep study for extracting respiration from electrocardiogram and photoplethysmography signals.
Signal Processing,Machine Learning
What problem does this paper attempt to address?
The paper aims to address the problem of Blind Source Separation (BSS) in single-channel mixed signals, particularly for nonlinear mixtures. Specifically, the authors propose a novel method that utilizes multi-encoder autoencoders to handle the task of blind source separation for single-channel nonlinear mixed signals. During the training phase, the method decomposes the input signal into different encoding spaces through multiple encoders, and then the decoder recombines these representations to reconstruct the original input. For source signal estimation, the authors introduce a novel encoding masking technique, where only the output of one encoder is retained during inference, while the outputs of other encoders are masked. This allows the decoder to estimate the source signal associated with the active encoder. Additionally, two regularization loss functions are proposed: sparse mixing loss and zero reconstruction loss. The former encourages sparse mixing of source signals during the decoding process, while the latter ensures that the contribution of masked source signals to the final output is minimized. The paper evaluates the performance of the proposed method on two datasets: one is a synthetic dataset composed of triangular and circular shapes, used to intuitively demonstrate the effectiveness of the method; the other is a real-world dataset containing electrocardiogram (ECG) and photoplethysmogram (PPG) signals obtained from polysomnography studies, with the aim of extracting respiratory signals. Experimental results show that the proposed multi-encoder autoencoder method can effectively separate these source signals and outperforms existing heuristic and supervised methods in real-world application scenarios.