Abstract:The architecture is based on multiple decomposition of GAN model. This network model learns the decomposition of the generated image features, texture representation is used to extract the high frequency texture features of the generated image to make it conform to the unique characteristics of the cartoon image and is guided by the discriminator to further enhance the cartoon texture of the generated image. The structure‐content representation makes the generated image have sparse color blocks and clear edges by mimicking the global content of the cartoon image, and maintains the structure and content of the original image with the pre‐trained network. Background Cartoon images play a vital role in film production, scientific and educational animation, video games, and other fields, and are one of the key visual expressions of artistic creation. However, since hand‐crafted cartoon images often require a great deal of time and effort on the part of professional artists, it is necessary to be able to automatically transform real‐world images into different styles of cartoon images. Although cartoon images vary from artist to artist, cartoon images generally have the unique characteristics of being highly simplified and abstract, with clear edges, smooth color shading, and relatively simple textures. However, existing image cartoonization methods tend to create a number of problems when performing style transfer, which mainly include: (1) the resulting generated images do not have obvious cartoon‐style textures; and (2) the generated images are prone to structural confusion, color artifacts, and loss of the original image content. Therefore, it is also a great challenge in the field of image cartoonization to be able to make a good balance between style transfer and content keeping. Methods In this paper, we propose a GAN‐based multi‐attention mechanism for image cartoonization to address the above issues. The method combines the residual block used to extract deep network features in the generator with the attention mechanism, and further strengthens the perceptual ability of the generative model to cartoon images through the adaptive feature correction of the attention module to improve the cartoon features of the generated images. At the same time, we also introduce the attention mechanism in the convolution block of the discriminator, which is used to further reduce the image visual quality problem caused by the style transfer process. By introducing the attention mechanism into the generator and discriminator models of the generative adversarial network, our method enables the generated images to have obvious cartoon‐style features while effectively improving the image's visual quality. Results A large number of quantitative, qualitative, and ablation experiments are conducted to demonstrate the advantages of our method in the field of image cartoonization and the role of each module in the method.

Multi‐style cartoonization: Leveraging multiple datasets with generative adversarial networks

GAN-based Multi-Style Photo Cartoonization

Creative and Diverse Artwork Generation Using Adversarial Networks

MCLGAN: a multi-style cartoonization method based on style condition information

CartoonGAN: Generative Adversarial Networks for Photo Cartoonization.

Style Fader Generative Adversarial Networks for Style Degree Controllable Artistic Style Transfer

GAN‐Based Multi‐Decomposition Photo Cartoonization

Caster: Cartoon Style Transfer Via Dynamic Cartoon Style Casting

Attribute-Guided Sketch Generation

GC-GAN: Photo Cartoonization Using Guided Cartoon Generative Adversarial Network

3D Cartoon Face Generation with Controllable Expressions from a Single GAN Image

CartoonLossGAN: Learning Surface and Coloring of Images for Cartoonization

TSGAN: A two-stage interpretable learning method for image cartoonization

Style attention based global-local aware GAN for personalized facial caricature generation

Gated-GAN: Adversarial Gated Networks for Multi-Collection Style Transfer

Learning to Incorporate Texture Saliency Adaptive Attention to Image Cartoonization

CariGAN: Caricature generation through weakly paired adversarial learning

Two Birds with One Stone: Transforming and Generating Facial Images with Iterative GAN

Two Birds with One Stone: Iteratively Learn Facial Attributes with GANs.

MW-GAN: Multi-Warping GAN for Caricature Generation With Multi-Style Geometric Exaggeration

AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation