Study of the Performance of CEEMDAN in Underdetermined Speech Separation

Rawad Melhem,Riad Hamadeh,Assef Jafar
2024-11-18
Abstract:The CEEMDAN algorithm is one of the modern methods used in the analysis of non-stationary signals. This research presents a study of the effectiveness of this method in audio source separation to know the limits of its work. It concluded two conditions related to frequencies and amplitudes of mixed signals to be separated by CEEMDAN. The performance of the algorithm in separating noise from speech and separating speech signals from each other is studied. The research reached a conclusion that CEEMDAN can remove some types of noise from speech (speech improvement), and it cannot separate speech signals from each other (cocktail party). Simulation is done using Matlab environment and Noizeus database.
Sound,Artificial Intelligence,Audio and Speech Processing
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to evaluate the effectiveness of the Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) algorithm in separating sound sources in an underdetermined environment. Specifically, the research focuses on the following aspects: 1. **Separating Noise and Speech**: Explore the ability of the CEEMDAN algorithm to separate noise and speech from mixed signals, especially its performance under different Signal - to - Noise Ratio (SNR) conditions. 2. **Separating Speech Signals**: Investigate whether the CEEMDAN algorithm can effectively separate two different speech signals. ### Main Problems and Solutions #### 1. Separating Noise and Speech - **Background**: In many practical applications, such as automatic conference transcription, automatic caption generation for audio/video clips, voice interaction in Internet of Things devices, and hearing aids, separating noise and speech is very important. - **Method**: Use the CEEMDAN algorithm to process speech signals containing noise, and evaluate its performance through multiple evaluation metrics (such as SDR, SAR, and PESQ). - **Results**: The research shows that the CEEMDAN algorithm performs well under high SNR conditions, but its performance degrades under low SNR conditions. In addition, this algorithm can provide both the improved speech signal and the separated noise signal during the separation process. #### 2. Separating Speech Signals - **Background**: Separating multiple speech signals in the same channel (i.e., the "cocktail party problem") is a more challenging task. - **Method**: By mixing two different speech signals and inputting them into the CEEMDAN algorithm, observe whether it can effectively separate these signals. - **Results**: Experiments show that the CEEMDAN algorithm cannot effectively separate two different speech signals. This may be because the speech signals have a large overlap in frequency and amplitude, which does not meet the working conditions of the CEEMDAN algorithm. ### Working Conditions of the CEEMDAN Algorithm Through experiments on sine - wave signals, the researchers have summarized two main conditions for the CEEMDAN algorithm to effectively separate signals: 1. **Frequency Ratio**: The frequency ratio of the two signals should be less than 0.6 or greater than 1.6. 2. **Amplitude Ratio**: The amplitude ratio of the two signals should be in the range of [0.3, 3]. ### Conclusion - The CEEMDAN algorithm can effectively separate noise and speech signals under specific conditions, especially under high SNR conditions. - However, this algorithm is not suitable for separating two different speech signals because the speech signals have a large overlap in frequency and amplitude, which does not meet the working conditions of the algorithm. ### Future Prospects - CEEMDAN can be combined with other systems to improve its accuracy in noise separation. - Explore the application of deep - learning methods in speech - signal separation to further improve the separation effect. Through the above research, the author hopes to provide valuable insights into the application of the CEEMDAN algorithm in the field of sound - source separation and make suggestions for future research directions.