Controllable Anime Image Editing via Probability of Attribute Tags

Zhenghao Song,Haoran Mo,Chengying Gao
DOI: https://doi.org/10.1111/cgf.15245
IF: 2.5
2024-10-26
Computer Graphics Forum
Abstract:Editing anime images via probabilities of attribute tags allows controlling the degree of the manipulation in an intuitive and convenient manner. Existing methods fall short in the progressive modification and preservation of unintended regions in the input image. We propose a controllable anime image editing framework based on adjusting the tag probabilities, in which a probability encoding network (PEN) is developed to encode the probabilities into features that capture continuous characteristic of the probabilities. Thus, the encoded features are able to direct the generative process of a pre‐trained diffusion model and facilitate the linear manipulation. We also introduce a local editing module that automatically identifies the intended regions and constrains the edits to be applied to those regions only, which preserves the others unchanged. Comprehensive comparisons with existing methods indicate the effectiveness of our framework in both one‐shot and linear editing modes. Results in additional applications further demonstrate the generalization ability of our approach.
computer science, software engineering
What problem does this paper attempt to address?