Abstract:The CEEMDAN algorithm is one of the modern methods used in the analysis of non-stationary signals. This research presents a study of the effectiveness of this method in audio source separation to know the limits of its work. It concluded two conditions related to frequencies and amplitudes of mixed signals to be separated by CEEMDAN. The performance of the algorithm in separating noise from speech and separating speech signals from each other is studied. The research reached a conclusion that CEEMDAN can remove some types of noise from speech (speech improvement), and it cannot separate speech signals from each other (cocktail party). Simulation is done using Matlab environment and Noizeus database.

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is to evaluate the effectiveness of the Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) algorithm in separating sound sources in an underdetermined environment. Specifically, the research focuses on the following aspects: 1. **Separating Noise and Speech**: Explore the ability of the CEEMDAN algorithm to separate noise and speech from mixed signals, especially its performance under different Signal - to - Noise Ratio (SNR) conditions. 2. **Separating Speech Signals**: Investigate whether the CEEMDAN algorithm can effectively separate two different speech signals. ### Main Problems and Solutions #### 1. Separating Noise and Speech - **Background**: In many practical applications, such as automatic conference transcription, automatic caption generation for audio/video clips, voice interaction in Internet of Things devices, and hearing aids, separating noise and speech is very important. - **Method**: Use the CEEMDAN algorithm to process speech signals containing noise, and evaluate its performance through multiple evaluation metrics (such as SDR, SAR, and PESQ). - **Results**: The research shows that the CEEMDAN algorithm performs well under high SNR conditions, but its performance degrades under low SNR conditions. In addition, this algorithm can provide both the improved speech signal and the separated noise signal during the separation process. #### 2. Separating Speech Signals - **Background**: Separating multiple speech signals in the same channel (i.e., the "cocktail party problem") is a more challenging task. - **Method**: By mixing two different speech signals and inputting them into the CEEMDAN algorithm, observe whether it can effectively separate these signals. - **Results**: Experiments show that the CEEMDAN algorithm cannot effectively separate two different speech signals. This may be because the speech signals have a large overlap in frequency and amplitude, which does not meet the working conditions of the CEEMDAN algorithm. ### Working Conditions of the CEEMDAN Algorithm Through experiments on sine - wave signals, the researchers have summarized two main conditions for the CEEMDAN algorithm to effectively separate signals: 1. **Frequency Ratio**: The frequency ratio of the two signals should be less than 0.6 or greater than 1.6. 2. **Amplitude Ratio**: The amplitude ratio of the two signals should be in the range of [0.3, 3]. ### Conclusion - The CEEMDAN algorithm can effectively separate noise and speech signals under specific conditions, especially under high SNR conditions. - However, this algorithm is not suitable for separating two different speech signals because the speech signals have a large overlap in frequency and amplitude, which does not meet the working conditions of the algorithm. ### Future Prospects - CEEMDAN can be combined with other systems to improve its accuracy in noise separation. - Explore the application of deep - learning methods in speech - signal separation to further improve the separation effect. Through the above research, the author hopes to provide valuable insights into the application of the CEEMDAN algorithm in the field of sound - source separation and make suggestions for future research directions.

Study of the Performance of CEEMDAN in Underdetermined Speech Separation

Using Energy Difference for Speech Separation of Dual-microphone Close-talk System

Study on Speech Signal Denoising Algorithm Based on EMD

On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments

Research on ultrasonic signal processing algorithm based on CEEMDAN joint wavelet packet thresholding

Speaker and Direction Inferred Dual-channel Speech Separation

Combined Improved CEEMDAN and Wavelet Transform Sea Wave Interference Suppression

U-NET: A Supervised Approach for Monaural Source Separation

Unseen Noise Estimation Using Separable Deep Auto Encoder for Speech Enhancement

Online speaker diarization of meetings guided by speech separation

Linear versus deep learning methods for noisy speech separation for EEG-informed attention decoding

Empirical mode decomposition and normalshrink tresholding for speech denoising

Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures

Developing an Effective Training Dataset to Enhance the Performance of AI-based Speaker Separation Systems

Enhanced complete ensemble EMD with superior noise handling capabilities: A robust signal decomposition method for power systems analysis

End-to-end Networks for Supervised Single-channel Speech Separation

Incorporating Phase-Encoded Spectrum Masking into Speaker-Independent Monaural Source Separation

A comprehensive study of speech separation: spectrogram vs waveform separation

Research on ground-penetrating radar denoising algorithm based on CEEMD and Permutation Entropy

Speech Endpoint Identification Based on Empirical Mode Decomposition

Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition