Lauca: A Workload Duplicator for Benchmarking Transactional Database Performance
Siyang Weng,QingShuai Wang,Luyi Qu,Rong Zhang,Peng Cai,Weining Qian,Aoying Zhou
DOI: https://doi.org/10.1109/tkde.2024.3360116
IF: 9.235
2024-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Generating synthetic workloads is essential and critical to the performance evaluation of database systems. When benchmarking database performance for a specific application, the similarity between synthetic workloads and real application workloads determines the credibility of the evaluation results. However, it meets a great challenge to catch workload characteristics for a target online transaction processing (OLTP) application considering the complexity of transaction executions. To address this problem, we propose a workload duplicator (Lauca) that can generate synthetic workloads with highly similar performance metrics compared to a specific application on both centralized and distributed databases. By carefully studying the application-driven workload generation problem, we present Transaction Logic, Data Access Distribution and Partition Access Distribution to characterize runtime workloads and propose novel generation algorithms to guarantee the high fidelity of synthetic workloads. To the best of our knowledge, Lauca is the first application-driven transactional workload generator. We conduct extensive experiments based on TPC-C, SmallBank and YCSB on both centralized and distributed databases. The experimental results show that Lauca consistently generates high-quality synthetic workloads.
computer science, information systems, artificial intelligence,engineering, electrical & electronic