Abstract:Although Chinese calligraphy generation has achieved style transfer, generating calligraphy by specifying the calligrapher, font, and character style remains challenging. To address this, we propose a new Chinese calligraphy generation model 'Moyun' , which replaces the Unet in the Diffusion model with Vision Mamba and introduces the TripleLabel control mechanism to achieve controllable calligraphy generation. The model was tested on our large-scale dataset 'Mobao' of over 1.9 million images, and the results demonstrate that 'Moyun' can effectively control the generation process and produce calligraphy in the specified style. Even for calligraphy the calligrapher has not written, 'Moyun' can generate calligraphy that matches the style of the calligrapher.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to generate calligraphy works of a specific style in Chinese calligraphy generation by specifying calligraphers, fonts, and character styles. Although the existing Chinese calligraphy generation techniques have achieved style transfer, there are still challenges in generating calligraphy works with specific calligraphers, fonts, and character styles. Specifically: 1. **Style Control**: Existing models have difficulty in precisely controlling the specific style of the generated calligraphy, especially when specifying calligraphers, fonts, and character styles. 2. **Structure Matching**: When existing models generate calligraphy, especially in stroke structure and brushstroke details, there is a gap compared with real calligraphy works. 3. **Dataset Scale**: The scale of existing datasets is small, and the annotations are not detailed, which limits the learning ability of the models. To address these challenges, the author proposes a new Chinese calligraphy generation model "Moyun", and its main innovations include: - **Introducing Vision Mamba**: Replace Unet in the diffusion model and use Vision Mamba to process images to better capture the structural relationships between strokes. - **TripleLabel Control Mechanism**: A multi - label control mechanism is designed. By combining the labels of calligraphers, fonts, and characters to control the generation process, controllable calligraphy generation is achieved. - **Large - scale Dataset**: A large - scale dataset "Mobao" containing more than 1.9 million high - resolution binarized images is constructed, enriching the learning resources of the model. Through these improvements, the "Moyun" model can more accurately control the style when generating calligraphy and generate works that are highly similar to real calligraphy works, and can even generate calligraphy in line with the style of calligraphers for characters that they have not written.

Moyun: A Diffusion-Based Model for Style-Specific Chinese Calligraphy Generation

Chinese Calligraphic Style Representation for Recognition

Calliffusion: Chinese Calligraphy Generation and Style Transfer with Diffusion Modeling

UATST: Towards Unpaired Arbitrary Text-Guided Style Transfer with Cross-Space Modulation

Expressive Facial Style Transfer for Personalized Memes Mimic

Mural Sketch Generation via Style-aware Convolutional Neural Network.

CalliffusionV2: Personalized Natural Calligraphy Generation with Flexible Multi-modal Control

CalliGAN: Style and Structure-aware Chinese Calligraphy Character Generator

Few-shot Calligraphy Style Learning

Style Generation in Robot Calligraphy with Deep Generative Adversarial Networks

Few-shot Font Style Transfer with Multiple Style Encoders

Chinese Character Font Generation Based on Diffusion Model

ZiGAN: Fine-grained Chinese Calligraphy Font Generation via a Few-shot Style Transfer Approach

Cvstgan: A Controllable Generative Adversarial Network for Video Style Transfer of Chinese Painting

CalliPaint: Chinese Calligraphy Inpainting with Diffusion Model

Anisotropic Stroke Control for Multiple Artists Style Transfer

Latent Style Model: Discovering Writing Styles for Calligraphy Works

DeepCalliFont: Few-shot Chinese Calligraphy Font Synthesis by Integrating Dual-modality Generative Models

Automatic Generation of Chinese Character Based on Human Vision and Prior Knowledge of Calligraphy

Calligan: Unpaired Mutli-Chirography Chinese Calligraphy Image Translation

Handwritten Chinese Font Generation with Collaborative Stroke Refinement.