Abstract:The growing use of voice user interfaces has led to a surge in the collection and storage of speech data. While data collection allows for the development of efficient tools powering most speech services, it also poses serious privacy issues for users as centralized storage makes private personal speech data vulnerable to cyber threats. With the increasing use of voice-based digital assistants like Amazon's Alexa, Google's Home, and Apple's Siri, and with the increasing ease with which personal speech data can be collected, the risk of malicious use of voice-cloning and speaker/gender/pathological/etc. recognition has increased. This thesis proposes solutions for anonymizing speech and evaluating the degree of the anonymization. In this work, anonymization refers to making personal speech data unlinkable to an identity while maintaining the usefulness (utility) of the speech signal (e.g., access to linguistic content). We start by identifying several challenges that evaluation protocols need to consider to evaluate the degree of privacy protection properly. We clarify how anonymization systems must be configured for evaluation purposes and highlight that many practical deployment configurations do not permit privacy evaluation. Furthermore, we study and examine the most common voice conversion-based anonymization system and identify its weak points before suggesting new methods to overcome some limitations. We isolate all components of the anonymization system to evaluate the degree of speaker PPI associated with each of them. Then, we propose several transformation methods for each component to reduce as much as possible speaker PPI while maintaining utility. We promote anonymization algorithms based on quantization-based transformation as an alternative to the most-used and well-known noise-based approach. Finally, we endeavor a new attack method to invert anonymization.

Speech Sanitizer: Speech Content Desensitization and Voice Anonymization

VoiceMask: Anonymize and Sanitize Voice Input on Mobile Devices

MicPro: Microphone-based Voice Privacy Protection

PDAssess: A Privacy-preserving Free-speech Based Parkinson's Disease Daily Assessment System

Anonymizing Speech: Evaluating and Designing Speaker Anonymization Techniques

Hidebehind: Enjoy Voice Input with Voiceprint Unclonability and Anonymity.

Speaker Anonymization for Personal Information Protection Using Voice Conversion Techniques

WaveFuzz: A Clean-Label Poisoning Attack to Protect Your Voice

V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization

Privacy-Utility Balanced Voice De-Identification Using Adversarial Examples

VoiceCloak

A Non-intrusive and Adaptive Speaker De-Identification Scheme Using Adversarial Examples

NPU-NTU System for Voice Privacy 2024 Challenge

Two-Stage Voice Anonymization for Enhanced Privacy

On the Impact of Voice Anonymization on Speech Diagnostic Applications: a Case Study on COVID-19 Detection

Preserving spoken content in voice anonymisation with character-level vocoder conditioning

On the Generation and Removal of Speaker Adversarial Perturbation for Voice-Privacy Protection

Anonymising Elderly and Pathological Speech: Voice Conversion Using DDSP and Query-by-Example

Towards Privacy-Preserving Speech Data Publishing.

Adversarial speech for voice privacy protection from Personalized Speech generation

Improving Voice Conversion for Dissimilar Speakers Using Perceptual Losses