Data Formulator: AI-powered Concept-driven Visualization Authoring

Chenglong Wang,John Thompson,Bongshin Lee
2023-10-28
Abstract:With most modern visualization tools, authors need to transform their data into tidy formats to create visualizations they want. Because this requires experience with programming or separate data processing tools, data transformation remains a barrier in visualization authoring. To address this challenge, we present a new visualization paradigm, concept binding, that separates high-level visualization intents and low-level data transformation steps, leveraging an AI agent. We realize this paradigm in Data Formulator, an interactive visualization authoring tool. With Data Formulator, authors first define data concepts they plan to visualize using natural languages or examples, and then bind them to visual channels. Data Formulator then dispatches its AI-agent to automatically transform the input data to surface these concepts and generate desired visualizations. When presenting the results (transformed table and output visualizations) from the AI agent, Data Formulator provides feedback to help authors inspect and understand them. A user study with 10 participants shows that participants could learn and use Data Formulator to create visualizations that involve challenging data transformations, and presents interesting future research directions.
Human-Computer Interaction,Artificial Intelligence
What problem does this paper attempt to address?
This paper attempts to address the obstacles caused by data transformation steps to users during the data visualization process. Most modern visualization tools require data to be in a tidy format, which means that each variable to be visualized is a column and each observation is a row. When the input data is not in this format, users need to perform data transformation, even if the original data is clean and contains all necessary information. However, data transformation usually requires programming experience or expertise in specific tools, which increases the user's burden, and switching between visualization creation and data transformation also incurs additional overhead. Therefore, the challenge of data transformation has become an obstacle in visualization creation. To meet this challenge, the paper proposes a brand - new visualization creation method, namely concept binding, which uses artificial intelligence agents to separate high - level visualization intentions ("what to visualize") and low - level data transformation steps ("how to format data for visualization"), thereby reducing the burden of data transformation. Specifically, the paper introduces Data Formulator, an interactive visualization creation tool that allows users to define the data concepts they plan to visualize through natural language or examples, and then bind these concepts to visual channels. Data Formulator will automatically call its backend artificial intelligence agent to generate the required visualization results based on the input data and provide feedback to help users check and understand the results. In this way, Data Formulator aims to lower the threshold for users in data transformation, enabling users without programming experience to easily create complex visualization charts.