Speech Sanitizer: Speech Content Desensitization and Voice Anonymization

Jianwei Qian,Haohua Du,Jiahui Hou,Linlin Chen,Taeho Jung,Xiangyang Li
DOI: https://doi.org/10.1109/tdsc.2019.2960239
2021-01-01
IEEE Transactions on Dependable and Secure Computing
Abstract:Voice input users' speech recordings are being collected by service providers and shared with third parties, who may abuse users' voiceprints, identify them by voice, and learn their sensitive speech content. In this work, we design Speech Sanitizer to perturb users' speech recordings so that the sanitized speech can be safely shared with third parties. First, we desensitize speech content by identifying sensitive words, localizing them in the audio using DTW-based keyword spotting, and substituting them with safe words. Both common and personalized sensitive words are identified and replaced. Then, we anonymize users' voiceprints with a carefully designed voice conversion mechanism that is resistant to de-anonymization attacks. Meanwhile, we try to preserve the utility of the sanitized speech, measured by the accuracy of speech recognition performed on it. We implement Speech Sanitizer and present extensive experimental results that validate the effectiveness and efficiency of our algorithms. It is demonstrated that we are able to reduce the chance of a user's voice being identified from 50 people by 83.7 percent while keeping the drop of speech recognition accuracy within 19.1 percent. We can also easily relax the privacy level to improve speech recognition accuracy.
What problem does this paper attempt to address?