Abstract:The rapid development of generative diffusion models has significantly advanced the field of style transfer. However, most current style transfer methods based on diffusion models typically involve a slow iterative optimization process, e.g., model fine-tuning and textual inversion of style concept. In this paper, we introduce FreeStyle, an innovative style transfer method built upon a pre-trained large diffusion model, requiring no further optimization. Besides, our method enables style transfer only through a text description of the desired style, eliminating the necessity of style images. Specifically, we propose a dual-stream encoder and single-stream decoder architecture, replacing the conventional U-Net in diffusion models. In the dual-stream encoder, two distinct branches take the content image and style text prompt as inputs, achieving content and style decoupling. In the decoder, we further modulate features from the dual streams based on a given content image and the corresponding style text prompt for precise style transfer. Our experimental results demonstrate high-quality synthesis and fidelity of our method across various content images and style text prompts. Compared with state-of-the-art methods that require training, our FreeStyle approach notably reduces the computational burden by thousands of iterations, while achieving comparable or superior performance across multiple evaluation metrics including CLIP Aesthetic Score, CLIP Score, and Preference. We have released the code anonymously at: \href{https://anonymous.4open.science/r/FreeStyleAnonymous-0F9B}

UATST: Towards Unpaired Arbitrary Text-Guided Style Transfer with Cross-Space Modulation

TeSTNeRF: Text-Driven 3D Style Transfer Via Cross-Modal Learning.

Artistic Style Transfer with Internal-external Learning and Contrastive Learning

Correlation-based and Content-Enhanced Network for Video Style Transfer

Diverse Image Style Transfer Via Invertible Cross-Space Mapping

Style Permutation for Diversified Arbitrary Style Transfer

Learning Structure-Aware Transformations for Arbitrary Image Style Transfer

A Unified Arbitrary Style Transfer Framework via Adaptive Contrastive Learning

ITstyler: Image-optimized Text-based Style Transfer

Name Your Style: An Arbitrary Artist-aware Image Style Transfer

TextStyler: A CLIP-based approach to text-guided style transfer

Bridging Text and Image for Artist Style Transfer via Contrastive Learning

StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements

CLAST: Contrastive Learning for Arbitrary Style Transfer

Unified Style Transfer

Language-Driven Image Style Transfer

CLIPstyler: Image Style Transfer with a Single Text Condition

Style Transfer as Unsupervised Machine Translation

Style Transfer in Text: Exploration and Evaluation

MSSRNet: Manipulating Sequential Style Representation for Unsupervised Text Style Transfer

FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models