DirectGPT: A Direct Manipulation Interface to Interact with Large Language Models

Damien Masson,Sylvain Malacria,Géry Casiez,Daniel Vogel
DOI: https://doi.org/10.1145/3613904.3642462
2024-03-19
Abstract:We characterize and demonstrate how the principles of direct manipulation can improve interaction with large language models. This includes: continuous representation of generated objects of interest; reuse of prompt syntax in a toolbar of commands; manipulable outputs to compose or control the effect of prompts; and undo mechanisms. This idea is exemplified in DirectGPT, a user interface layer on top of ChatGPT that works by transforming direct manipulation actions to engineered prompts. A study shows participants were 50% faster and relied on 50% fewer and 72% shorter prompts to edit text, code, and vector images compared to baseline ChatGPT. Our work contributes a validated approach to integrate LLMs into traditional software using direct manipulation. Data, code, and demo available at <a class="link-external link-https" href="https://osf.io/3wt6s" rel="external noopener nofollow">this https URL</a>.
Human-Computer Interaction
What problem does this paper attempt to address?