Abstract:Simply by rearranging the regions of an image, we can create a new image of any subject matter. The definition of regions is user definable, ranging from regularly and irregularly-shaped blocks, concentric rings, or even individual pixels. Our method extends and improves recent work in the generation of optical illusions by simultaneously learning not only the content of the images, but also the parameterized transformations required to transform the desired images into each other. By learning the image transforms, we allow any source image to be pre-specified; any existing image (e.g. the Mona Lisa) can be transformed to a novel subject. We formulate this process as a constrained optimization problem and address it through interleaving the steps of image diffusion with an energy minimization step. Unlike previous methods, increasing the number of regions actually makes the problem easier and improves results. We demonstrate our approach in both pixel and latent spaces. Creative extensions, such as using infinite copies of the source image and employing multiple source images, are also given.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to generate completely new images by rearranging regions (such as tiles) in an image. Specifically, the researchers hope to develop a method that can not only learn the content of the image, but also learn the parametric transformations required to convert a known source image into a new theme. This enables any existing image (for example, "Mona Lisa") to be transformed into a new theme image through predefined transformations. ### Core of the Problem 1. **Generating New Images from Static Source Images**: Given a source image (such as a work of art), the goal is to create new images depicting completely different themes, but only using the tiles in the source image. 2. **Dynamically Discovering Transformations**: Most previous work relied on pre - specified transformations, while this paper proposes a method that can dynamically discover the optimal transformation during the generation process. 3. **Constrained Optimization Problem**: When generating a new image, the pixel values of the source image must be strictly adhered to, and the pixel colors cannot be changed arbitrarily. This means that the best match needs to be found among the limited tile combinations to generate a high - quality new image. ### Main Challenges - **Dynamic Matching Problem**: How to dynamically adjust the arrangement of tiles during the generation process to meet the requirements of different themes. - **Optimization under Constraints**: How to generate a new image that meets expectations while keeping the pixel values of the source image unchanged. - **Extension to More Tiles and More Complex Transformations**: Increasing the number of tiles and the complexity of transformations can improve the quality of the results instead of causing performance degradation as in other methods. ### Solution The researchers proposed a framework that alternates between image denoising and energy minimization by combining the diffusion model and the energy minimization step. They introduced dynamic matching techniques, used the Hungarian algorithm (Kuhn - Munkres algorithm) to dynamically adjust the arrangement of tiles, and optimized the early noise - adding strategy through the rollout mechanism to ensure the stability and efficiency of the generation process. ### Experimental Verification The paper shows multiple experimental results, proving the effectiveness of this method. In particular, when using complex source images (such as famous paintings), the quality of the generated new images is significantly improved, and the generation effect is better as the number of tiles increases. ### Summary This paper solves the problem of how to generate new images from static source images. Especially under strict constraints, it realizes high - quality new image generation by dynamically discovering the optimal transformation. This method is not only visually appealing but also has broad application potential in the fields of art creation and image processing.

Making Images from Images: Interleaving Denoising and Transformation

Novel 3D-Aware Composition Images Synthesis for Object Display with Diffusion Model.

Selective Image Abstraction

Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models

Interpolating between Images with Diffusion Models

Guided Image Synthesis via Initial Image Editing in Diffusion Model

Enhancing Conditional Image Generation with Explainable Latent Space Manipulation

SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions

Factorized Diffusion: Perceptual Illusions by Noise Decomposition

Intelligent Painter: Picture Composition With Resampling Diffusion Model

Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models

Diffusion Illusions: Hiding Images in Plain Sight

Coherent and Multi-modality Image Inpainting via Latent Space Optimization

Generative Photomontage

SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time

Factorized Diffusion Architectures for Unsupervised Image Generation and Segmentation

Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder

Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models

PixelSynth: Generating a 3D-Consistent Experience from a Single Image

ReNoise: Real Image Inversion Through Iterative Noising

Move Anything with Layered Scene Diffusion