Abstract:Icons are ubiquitous visual elements in graphic design, yet their creation is often complex and time-consuming. To resolve this problem, we draw inspiration from the booming text-to-image field and propose Text-Guided Icon Set Expansion, a novel task that helps users design high-quality icons using textual descriptions. Besides, users can control the style consistency of the created icons by inputting a few hand-crafted icons as style reference. Despite its practicality, the task poses two unique challenges. (i) Abstract Concept Visualization. Abstract concepts like technology and health are frequently encountered in icon creation, but their visualization is not straightforward and requires a grounding process that translates them into physical, easy-to-depict objects. (ii) Fine-grained Style Transfer. Unlike ordinary images, icons exhibit richer fine-grained stylistic elements, including tones, line widths, shapes, shadow effects, etc., which puts higher demands on capturing and preserving detailed styles during icon generation. To address the challenges, we propose IconDM, a method based on pre-trained text-to-image (T2I) diffusion models. Our approach incorporates a one-time domain adaptation process and an online style transfer process. In domain adaptation, we enhance the existing T2I model's capability to understand abstract concepts by fine-tuning it on high-quality icon-text pairs. To achieve this, we construct a large-scale dataset IconBank containing 2.3 million icon samples, and leverage a state-of-the-art vision-language model to generate textual descriptions for each icon. In style transfer, we introduce a Style Enhancement Module into the T2I model. It explicitly extracts the fine-grained style features from the given reference icons and is jointly optimized with the T2I model during DreamBooth tuning. To assess IconDM, we present IconBench, a structured evaluation suite with 30 icon sets and 100 concepts (including 50 abstract concepts). Quantitative results, qualitative analysis, and extensive ablation studies demonstrate the effectiveness of IconDM.

Synthetically Trained Icon Proposals for Parsing and Summarizing Infographics

Design Semiotics Based Icon Design

Understanding Infographics through Textual and Visual Tag Prediction

IconDM: Text-Guided Icon Set Expansion Using Diffusion Models

FlexIcon: Flexible Icon Colorization via Guided Images and Palettes

User-Centric Semi-Automated Infographics Authoring and Recommendation

An Intelligent Approach to Automatically Discovering Visual Insights

IconShop: Text-Guided Vector Icon Synthesis with Autoregressive Transformers

EvIcon: Designing High-Usability Icon with Human-in-the-loop Exploration and IconCLIP

Text-to-Viz: Automatic Generation of Infographics from Proportion-Related Natural Language Statements.

Towards Automated Infographic Design: Deep Learning-based Auto-Extraction of Extensible Timeline

Auto-Icon+: An Automated End-to-End Code Generation Tool for Icon Designs in UI Development

IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning

SummVis: Interactive Visual Analysis of Models, Data, and Evaluation for Text Summarization

InfoColorizer: Interactive Recommendation of Color Palettes for Infographics

TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition

SeeBel: Seeing is Believing

Infographics Wizard: Flexible Infographics Authoring and Design Exploration

Visual Analytics for Efficient Image Exploration and User-Guided Image Captioning

Interactive design generation and optimization from generative adversarial networks in spatial computing