Abstract:We introduce {\lambda}-Tune, a framework that leverages Large Language Models (LLMs) for automated database system tuning. The design of {\lambda}-Tune is motivated by the capabilities of the latest generation of LLMs. Different from prior work, leveraging LLMs to extract tuning hints for single parameters, {\lambda}-Tune generates entire configuration scripts, based on a large input document, describing the tuning context. {\lambda}-Tune generates alternative configurations, using a principled approach to identify the best configuration, out of a small set of candidates. In doing so, it minimizes reconfiguration overheads and ensures that evaluation costs are bounded as a function of the optimal run time. By treating prompt generation as a cost-based optimization problem, {\lambda}-Tune conveys the most relevant context to the LLM while bounding the number of input tokens and, therefore, monetary fees for LLM invocations. We compare {\lambda}-Tune to various baselines, using multiple benchmarks and PostgreSQL and MySQL as target systems for tuning, showing that {\lambda}-Tune is significantly more robust than prior approaches.

What problem does this paper attempt to address?

The paper attempts to address the issue of database system tuning automation. Specifically, the authors propose a framework called 𝜆-Tune, which leverages large language models (LLMs) to automate the tuning process of database systems. Unlike previous works, 𝜆-Tune not only extracts tuning suggestions for individual parameters but also generates complete configuration scripts and selects the best configuration through a series of optimization steps. The main goal of the paper is to reduce the overhead of reconfiguration, ensure that the evaluation cost is within a function range of the optimal runtime, and minimize the number of input tokens by treating prompt generation as a cost-based optimization problem, thereby controlling the cost of invoking LLMs. ### Background of the Paper The performance of database management systems is influenced by various tuning choices, including system configuration parameter settings and physical design choices (such as indexing, sorting, or partitioning). Therefore, automated database system tuning has been a hot research topic. In recent years, although using machine learning to find near-optimal configurations has been effective, it has the problem of high training and exploration costs. This has prompted new research directions, namely using large language models (LLMs) to heuristically prune the tuning search space. ### Contributions of the Paper 1. **Proposing the 𝜆-Tune Framework**: This framework uses the latest generation of large language models (such as GPT-4 and Claude 3) to optimize various tuning choices, including system parameter settings and physical design decisions. 2. **Three Key Components**: - **Prompt Generation**: An automated prompt generation step that generates customized prompts based on the input workload (analyzing SQL queries), hardware specifications, and database system. - **Configuration Selection**: Incremental evaluation of multiple candidate configurations to avoid significant overhead caused by particularly slow configurations. - **Configuration Evaluation**: Minimizing the cost of index reconfiguration through lazy index creation and dynamic programming algorithms to optimize query execution order. 3. **Experimental Evaluation**: Through experiments on Postgres and MySQL, using Join Order Benchmark (JOB) and TPC-H benchmarks, 𝜆-Tune demonstrates robustness and superiority compared to other automated database system tuning tools (such as GP-Tuner, DB-Bert, UDO, LlamaTune, and ParamTree). ### Key Technologies - **Prompt Generation**: By compressing workload representation methods, decomposing the input SQL queries into smaller mergeable fragments, selecting the most relevant information fragments, and generating efficient prompts. - **Configuration Selection**: Avoiding inefficient configurations from monopolizing the entire tuning process through multi-round evaluations and geometrically increasing timeout mechanisms. - **Configuration Evaluation**: Minimizing the cost of index reconfiguration through lazy index creation and dynamic programming algorithms. ### Experimental Results The experimental results show that 𝜆-Tune performs excellently in multiple benchmarks, consistently identifying the best-performing configurations, significantly outperforming existing automated database system tuning tools. In summary, this paper proposes the 𝜆-Tune framework, leveraging large language models to address multiple challenges in database system tuning, improving the efficiency and robustness of the tuning process.

λ-Tune: Harnessing Large Language Models for Automated Database System Tuning

LLMTune: Accelerate Database Knob Tuning with Large Language Models

Is Large Language Model Good at Database Knob Tuning? A Comprehensive Experimental Evaluation

LATuner: an LLM-Enhanced Database Tuning System Based on Adaptive Surrogate Model

MLETune: Streamlining Database Knob Tuning Via Multi-LLMs Experts Guided Deep Reinforcement Learning

LMTuner: An user-friendly and highly-integrable Training Framework for fine-tuning Large Language Models

GPTuner: A Manual-Reading Database Tuning System via GPT-Guided Bayesian Optimization

A Demonstration of GPTuner: A GPT-Based Manual-Reading Database Tuning System

Fine Tuning LLM for Enterprise: Practical Guidelines and Recommendations

Stay Tuned: An Empirical Study of the Impact of Hyperparameters on LLM Tuning in Real-World Applications

DB-GPT: Large Language Model Meets Database

IterSelectTune: An Iterative Training Framework for Efficient Instruction-Tuning Data Selection

An inquiry into machine learning-based automatic configuration tuning services on real-world database management systems

L-TUNING: Synchronized Label Tuning for Prompt and Prefix in LLMs

EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language Models

Unveiling the Secret Recipe: A Guide For Supervised Fine-Tuning Small LLMs

Can Modern LLMs Tune and Configure LSM-based Key-Value Stores?

Large Language Models for Human-Machine Collaborative Particle Accelerator Tuning through Natural Language

A Unified and Efficient Coordinating Framework for Autonomous DBMS Tuning

A Large-scale Empirical Study on Fine-tuning Large Language Models for Unit Testing

Large Language Models for Tuning Evolution Strategies