Abstract:Neural style transfer is used as an optimization technique that combines two different images – a content image and a style reference image – to produce an output image that retains the appearance of the content image but has been modified to match the actual style of the style reference image. This is achieved by fine-tuning the output image to match the style reference images and the statistics for both content and style in the content image. These statistics are extracted from the images using a convolutional network. Primitive models such as WCT were improved upon by models such as PhotoWCT, whose spatial and temporal limitations were improved upon by Deep Photo Style Transfer. Eventually, wavelet transforms were introduced to perform photorealistic style transfer. A wavelet-corrected transfer based on whitening and colouring transforms, i.e., WCT2, was proposed that allowed the preservation of core content and eliminated the need for any post-processing steps and constraints. A model called Domain-Aware Universal Style Transfer also came into the picture. It supported both artistic and photorealistic style transfer. This study provides an overview of the neural style transfer technique. The recent advancements and improvements in the field, including the development of multi-scale and adaptive methods and the integration of semantic segmentation, are discussed and elaborated upon. Experiments have been conducted to determine the roles of encoder-decoder architecture and Haar wavelet functions. The optimum levels at which these can be leveraged for effective style transfer are ascertained. The study also highlights the contrast between VGG-16 and VGG-19 structures and analyzes various performance parameters to establish which works more efficiently for particular use cases. On comparing quantitative metrics across Gatys, AdaIN, and WCT, a gradual upgrade was seen across the models, as AdaIN was performing 99.92 percent better than the primitive Gatys model in terms of processing time. Over 1000 iterations, we found that VGG-16 and VGG-19 have comparable style loss metrics, but there is a difference of 73.1 percent in content loss. VGG-19, however, is displaying a better overall performance since it can keep both content and style losses at bay.

Deep Learning-Based Application of Image Style Transfer

Style Fader Generative Adversarial Networks for Style Degree Controllable Artistic Style Transfer

Optimal Transport of Deep Feature for Image Style Transfer

Creative and Diverse Artwork Generation Using Adversarial Networks

Artistic Style Transfer with Internal-external Learning and Contrastive Learning

Learning Structure-Aware Transformations for Arbitrary Image Style Transfer

GLStyleNet: Exquisite Style Transfer Combining Global and Local Pyramid Features

AdaCM: Adaptive ColorMLP for Real-Time Universal Photo-realistic Style Transfer

Digital Image Art Style Transfer Algorithm Based on CycleGAN

A model integrating attention mechanism and generative adversarial network for image style transfer

Gated-GAN: Adversarial Gated Networks for Multi-Collection Style Transfer

Progress and Impediments in Deep Learning-Driven Image Style Transfer

Research on Image Generation and Style Transfer Algorithm Based on Deep Learning

A Generative Adversarial Network AMS-CycleGAN for Multi-Style Image Transformation

Image neural style transfer: A review

Loss functions for Style Transfer with CycleGAN

A comparison of art style transfer in Cycle-GAN based on different generators

Image Neural Style Transfer with Preserving the Salient Regions.

A deep learning-based neural style transfer optimization approach

Deep Learning-Based Image Style Transformation Research on Landscape Paintings of Wei, Jin and North-South Dynasties

Contour-enhanced CycleGAN framework for style transfer from scenery photos to Chinese landscape paintings