Maximum likelihood based estimation with quasi oppositional chemical reaction optimization algorithm for speech signal enhancement

Inderjeet Kaur,Vinay Kumar Nassa,T. Kavitha,Prakash Mohan,S. Velmurugan
DOI: https://doi.org/10.1007/s41870-022-01032-6
2022-07-18
International Journal of Information Technology
Abstract:In recent times, speech enhancement (SE) becomes a significant process in the field of speech signal processing. Since the speech signal in real time gets affected by background noise, the efficacy of the speech-based applications gets severely affected. The SE techniques have gained considerable attention in the usage of speaker recognition, video conferences, speech broadcast, speech-enabled biometric systems, smartphones, hearing aid, and so on. Numerous conventional SE and machine learning (ML) models have been employed for processing and removing the additive noise from the speech signal. This paper presents a novel Maximum Likelihood Based Estimation with Quasi Oppositional Chemical Reaction Optimization (MLE-QOCRO) algorithm for speech signal enhancement. The presented MLE-QOCRO algorithm follows three-stage processes namely preprocessing, mask generation, and spectral filtering. Besides, the mask generation process involves QOCRO algorithm to classify the speech signal into noise or speech frames. In addition, MLE based spectral filtering technique is employed at the final stage to get enhanced speech signals. A set of experiments were performed to highlight the improved performance of the proposed MLE-QOCRO algorithm, and the results are investigated in terms of several performance measures. Speech signal augmentation using the unique MLE-QOCRO algorithm, which combines maximum likelihood estimation with quasi-opposing chemical reaction optimization, is the subject of this paper. The QOCRO algorithm is used in the mask generation process in the model to distinguish between noise and speech frames in the input speech signal. A spectral filtering method based on MLE is also utilized to improve the voice signal at the very end. The resultant experimental values pointed out the enhanced outcome of the proposed MLE-QOCRO algorithm over the other compared techniques.
What problem does this paper attempt to address?