Abstract:Geometric modeling is a fundamental problem in computer graphics. The continuous growth of 3D models in public repositories has shifted research focus from computing local, low-level geometry features such as curvature and textures to high-level semantic information such as shape parts information and shape structural characteristics (symmetry, parallelism, etc.). However, automatically computing such semantic information is essentially an ill-posed problem due to the ambiguities in the definition of shape semantics. This thesis focuses on developing interactive approaches to aid the shape analysis process. We leverage user assistance to exploit shape semantics, and to respect and preserve them during manipulation. In particular, this thesis aims at advancing state-of-the-art interactive techniques in two specific geometric applications: shape segmentation and shape manipulation. First, we introduce two interactive tools for shape segmentation, which we call cross-boundary brushes and dot scissor. Both tools offer very simple and easy-to-use user interfaces that operate at interactive rates. In contrast to existing state-of-the-art interactive segmentation tools, our tools allow the user to cut out meaningful and functional components in most cases using only a single mouse stroke or click near boundary regions, making them very convenient to use. We adopt the concept of isolines of harmonic fields as cutting boundaries in designing both tools. We show that the propagation properties and the differentiating power of the harmonic fields allow effective computation of shape semantic boundaries for segmentation purposes. Second, we developed an editing framework that first extracts the shape's structural features and preserves them during user manipulation. In contrast to traditional shape editing frameworks, the system operates at the component level and takes a shape's structural characteristics such as inter-relations among semantic components as modeling constraints, enabling an effective structure-preserving editing tool. We show that user assistance is essential in accurately revealing complex shape structures. We use a semi-automatic shape segmentation process as a prior step to facilitate the analysis of shape structures and inter-relations, and show that these shape analysis results play an important role in preserving a shape's global features during user manipulation.

Creating Language-driven Spatial Variations of Icon Images

Semantic-based Interactive Shape Analysis and Manipulation

Design Semiotics Based Icon Design

Text-Driven Image Editing via Learnable Regions

SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing

Adjusting Image Attributes of Localized Regions with Low-level Dialogue

FlexIcon: Flexible Icon Colorization via Guided Images and Palettes

OBJECT 3DIT: Language-guided 3D-aware Image Editing

Learning to Follow Object-Centric Image Editing Instructions Faithfully

IconShop: Text-Guided Vector Icon Synthesis with Autoregressive Transformers

GG-Editor: Locally Editing 3D Avatars with Multimodal Large Language Model Guidance

Enjoy Your Editing: Controllable GANs for Image Editing via Latent Space Navigation

Leveraging LLMs for On-the-Fly Instruction Guided Image Editing

DM-Align: Leveraging the Power of Natural Language Instructions to Make Changes to Images

DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing

Instruction-Guided Editing Controls for Images and Multimedia: A Survey in LLM era

ZONE: Zero-Shot Instruction-Guided Local Editing

LDEdit: Towards Generalized Text Guided Image Manipulation via Latent Diffusion Models

Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations

Interactive design generation and optimization from generative adversarial networks in spatial computing

ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing