LEVA: Using Large Language Models to Enhance Visual Analytics

Yuheng Zhao,Yixing Zhang,Yu Zhang,Xinyi Zhao,Junjie Wang,Zekai Shao,Cagatay Turkay,Siming Chen
DOI: https://doi.org/10.1109/TVCG.2024.3368060
2024-03-09
Abstract:Visual analytics supports data analysis tasks within complex domain problems. However, due to the richness of data types, visual designs, and interaction designs, users need to recall and process a significant amount of information when they visually analyze data. These challenges emphasize the need for more intelligent visual analytics methods. Large language models have demonstrated the ability to interpret various forms of textual data, offering the potential to facilitate intelligent support for visual analytics. We propose LEVA, a framework that uses large language models to enhance users' VA workflows at multiple stages: onboarding, exploration, and summarization. To support onboarding, we use large language models to interpret visualization designs and view relationships based on system specifications. For exploration, we use large language models to recommend insights based on the analysis of system status and data to facilitate mixed-initiative exploration. For summarization, we present a selective reporting strategy to retrace analysis history through a stream visualization and generate insight reports with the help of large language models. We demonstrate how LEVA can be integrated into existing visual analytics systems. Two usage scenarios and a user study suggest that LEVA effectively aids users in conducting visual analytics.
Human-Computer Interaction
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve The paper aims to address several key challenges faced by users during the Visual Analytics (VA) process to improve the efficiency of data interpretation and analysis. Specifically, these challenges include: 1. **High System Entry Difficulty**: - Users need to familiarize themselves with the data mapping and interaction methods of the VA system when using it for the first time, often leading to a steep learning curve. - The ambiguity of data information (such as the meaning, structure, type, and transformation of data) can lead to misunderstandings. - Users lack visualization literacy, making the initial stage of data analysis more laborious. 2. **Easily Losing Direction During Data Exploration**: - Users often spend a significant amount of time on interactive analysis of complex data and visualizations. - Forming hypotheses requires finding parts of multiple insights related to the task and discovering new insights through interaction, making the exploration phase more challenging. 3. **Difficulty Summarizing Final Insights**: - Summarizing and reporting findings is the final step in the VA system but is usually a very time-consuming process. - Users need to verify hypotheses through multiple iterative explorations, but not every exploration yields valuable knowledge. - Analysts often spend a lot of time and effort sifting through, refining, and manually drafting reports. To address these issues, the authors propose a framework called LEVA (LLM-Enhanced Visual Analytics), which leverages Large Language Models (LLMs) to provide support in the three stages of the VA workflow (entry, exploration, and summarization). The goal of LEVA is to help users perform visual analysis more efficiently through intelligent methods, thereby reducing user effort and time costs.