Abstract:In the rapidly evolving landscape of artificial intelligence, generative models such as Generative Adversarial Networks (GANs) and Diffusion Models have become cornerstone technologies, driving innovation in diverse fields from art creation to healthcare. Despite their potential, these models face the significant challenge of data memorization, which poses risks to privacy and the integrity of generated content. Among various metrics of memorization detection, our study delves into the memorization scores calculated from encoder layer embeddings, which involves measuring distances between samples in the embedding spaces. Particularly, we find that the memorization scores calculated from layer embeddings of Vision Transformers (ViTs) show an notable trend - the latter (deeper) the layer, the less the memorization measured. It has been found that the memorization scores from the early layers' embeddings are more sensitive to low-level memorization (e.g. colors and simple patterns for an image), while those from the latter layers are more sensitive to high-level memorization (e.g. semantic meaning of an image). We also observe that, for a specific model architecture, its degree of memorization on different levels of information is unique. It can be viewed as an inherent property of the architecture. Building upon this insight, we introduce a unique fingerprinting methodology. This method capitalizes on the unique distributions of the memorization score across different layers of ViTs, providing a novel approach to identifying models involved in generating deepfakes and malicious content. Our approach demonstrates a marked 30% enhancement in identification accuracy over existing baseline methods, offering a more effective tool for combating digital misinformation.

A Geometric Framework for Understanding Memorization in Generative Models

Losing dimensions: Geometric memorization in generative diffusion

Understanding Memorization in Generative Models via Sharpness in Probability Landscapes

Embedding Space Selection for Detecting Memorization and Fingerprinting in Generative Models

Understanding (Un)Intended Memorization in Text-to-Image Generative Models

MemoryGAN: GAN Generator As Heterogeneous Memory for Compositional Image Synthesis

A Multi-Perspective Analysis of Memorization in Large Language Models

Memory Triggers: Unveiling Memorization in Text-To-Image Generative Models through Word-Level Duplication

Learning to Generate with Memory

SoK: Memorization in General-Purpose Large Language Models

Generative Modeling with Explicit Memory

MFS: A Brain-Inspired Memory Formation System for GAN

Understanding the Local Geometry of Generative Model Manifolds

An Inversion-based Measure of Memorization for Diffusion Models

Generating Memorable Images Based on Human Visual Memory Schemas

Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Frontier AI Models

Generalized Clustering and Multi-Manifold Learning with Geometric Structure Preservation

On Memorization in Diffusion Models

Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis

Detecting, Explaining, and Mitigating Memorization in Diffusion Models

Towards a Theoretical Understanding of Memorization in Diffusion Models