Abstract:Voice data generated on instant messaging or social media applications contains unique user voiceprints that may be abused by malicious adversaries for identity inference or identity theft. Existing voice anonymization techniques, e.g., signal processing and voice conversion/synthesis, suffer from degradation of perceptual quality. In this paper, we develop a voice anonymization system, named V-Cloak, which attains real-time voice anonymization while preserving the intelligibility, naturalness and timbre of the audio. Our designed anonymizer features a one-shot generative model that modulates the features of the original audio at different frequency levels. We train the anonymizer with a carefully-designed loss function. Apart from the anonymity loss, we further incorporate the intelligibility loss and the psychoacoustics-based naturalness loss. The anonymizer can realize untargeted and targeted anonymization to achieve the anonymity goals of unidentifiability and unlinkability. We have conducted extensive experiments on four datasets, i.e., LibriSpeech (English), AISHELL (Chinese), CommonVoice (French) and CommonVoice (Italian), five Automatic Speaker Verification (ASV) systems (including two DNN-based, two statistical and one commercial ASV), and eleven Automatic Speech Recognition (ASR) systems (for different languages). Experiment results confirm that V-Cloak outperforms five baselines in terms of anonymity performance. We also demonstrate that V-Cloak trained only on the VoxCeleb1 dataset against ECAPA-TDNN ASV and DeepSpeech2 ASR has transferable anonymity against other ASVs and cross-language intelligibility for other ASRs. Furthermore, we verify the robustness of V-Cloak against various de-noising techniques and adaptive attacks. Hopefully, V-Cloak may provide a cloak for us in a prism world.

IncogniText: Privacy-enhancing Conditional Text Anonymization via LLM-based Private Attribute Randomization

Robust Utility-Preserving Text Anonymization Based on Large Language Models

$A^{4}NT$: Author Attribute Anonymity by Adversarial Training of Neural Machine Translation

Adanonymizer: Interactively Navigating and Balancing the Duality of Privacy and Output Performance in Human-LLM Interaction

Latent Diffusion Models for Attribute-Preserving Image Anonymization

Keep It Private: Unsupervised Privatization of Online Text

Textwash -- automated open-source text anonymisation

Neural Text Sanitization with Explicit Measures of Privacy Risk

Evaluating the Efficacy of AI Techniques in Textual Anonymization: A Comparative Study

TextObfuscator: Making Pre-trained Language Model a Privacy Protector via Obfuscating Word Representations

Towards Quantifying The Privacy Of Redacted Text

Textual Differential Privacy for Context-Aware Reasoning with Large Language Model

Neural Text Sanitization with Privacy Risk Indicators: An Empirical Analysis

Anonymization Prompt Learning for Facial Privacy-Preserving Text-to-Image Generation

Dynamic Anonymization for Marginal Publication

Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory

V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization

Evaluating the disclosure risk of anonymized documents via a machine learning-based re-identification attack

Context-Aware Full Body Anonymization using Text-to-Image Diffusion Models

NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human

Conditional Anonymity with Non-Probabilistic Adversary