Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language Models

Siqiao Xue,Danrui Qi,Caigao Jiang,Wenhui Shi,Fangyin Cheng,Keting Chen,Hongjun Yang,Zhiping Zhang,Jianshan He,Hongyang Zhang,Ganglin Wei,Wang Zhao,Fan Zhou,Hong Yi,Shaodong Liu,Hongjun Yang,Faqiang Chen
2024-04-25
Abstract:The recent breakthroughs in large language models (LLMs) are positioned to transition many areas of software. The technologies of interacting with data particularly have an important entanglement with LLMs as efficient and intuitive data interactions are paramount. In this paper, we present DB-GPT, a revolutionary and product-ready Python library that integrates LLMs into traditional data interaction tasks to enhance user experience and accessibility. DB-GPT is designed to understand data interaction tasks described by natural language and provide context-aware responses powered by LLMs, making it an indispensable tool for users ranging from novice to expert. Its system design supports deployment across local, distributed, and cloud environments. Beyond handling basic data interaction tasks like Text-to-SQL with LLMs, it can handle complex tasks like generative data analysis through a Multi-Agents framework and the Agentic Workflow Expression Language (AWEL). The Service-oriented Multi-model Management Framework (SMMF) ensures data privacy and security, enabling users to employ DB-GPT with private LLMs. Additionally, DB-GPT offers a series of product-ready features designed to enable users to integrate DB-GPT within their product environments easily. The code of DB-GPT is available at Github(
Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the problem of how to leverage large language models (LLMs) to enhance data interaction tasks, providing users with reliable data understanding and insights. Specifically, the paper proposes an open-source Python library named DB-GPT, which aims to solve the following issues through technical means such as a multi-agent framework, a declarative workflow expression language (AWEL), and a service-oriented multi-model management framework (SMMF): 1. **Natural Language Data Interaction**: Enabling users to describe data interaction tasks through natural language and receive context-aware responses. 2. **Complex Data Interaction Task Handling**: Automating the handling of complex database interaction tasks through a multi-agent framework, such as generating data analysis. 3. **Flexible Workflow Arrangement**: Allowing users to flexibly design and execute multi-agent workflows through AWEL. 4. **Data Privacy and Security**: Supporting users in using private LLMs in a local environment through SMMF, ensuring data privacy and security. 5. **Product-Ready Features**: Providing a range of product-ready features, such as advanced knowledge extraction, fine-grained Text-to-SQL model tuning, multi-language support, and a user-friendly front-end interface. Through these technical means, DB-GPT can not only handle basic data interaction tasks but also tackle more complex tasks, such as generating data analysis, thereby providing users with a comprehensive and powerful data interaction solution.