A watermark detection scheme based on non-parametric model applied to mute machine voice
Yangxia Hu,Wenhuan Lu,Jianguo Wei,Junhai Xu,Maode Ma
DOI: https://doi.org/10.1007/s11042-023-15572-x
IF: 2.577
2023-05-05
Multimedia Tools and Applications
Abstract:With the development of artificial intelligence and human-computer interaction, performance of man-machine voice dialogue system is becoming better and better. We proposed a new watermark detection method based on non-parametric model to mute machine voice when there are two or more robots around. We took a random sequence composed of 1 and − 1 as watermark in our experiment. In the embedding process, we modeled coefficients of speech frames after 3-level DWT (Discrete wavelet transform) though KDE (Kernel Density Estimation) of non-parametric test, and in watermark detection process, we designed a detector of ML (Maximum Likelihood), and calculated decision threshold by Neyman-Pearson criterion. We found proposed detector could respond when test speech signal was watermarked, and could further mute machine voice. We calculated the theoretical detection rates with false alarm rates from 0 to 1, and compared the theoretical values with experimental values. We found experimental values were very close to theoretical values, and they were almost close to 1 when false alarm rates were above 0.3. Compared with existing synthetic speech detection algorithms, our proposal was simpler and cost less, and was appropriate to detect watermark based on small samples. And our algorithm had a good imperceptibility and robustness, and average detection rates were all above 98% for some common noise attacks.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering