Automatic Database Index Tuning: A Survey
Yang Wu,Xuanhe Zhou,Yong Zhang,Guoliang Li
DOI: https://doi.org/10.1109/tkde.2024.3422006
IF: 9.235
2024-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Index tuning plays a crucial role in facilitating the efficiency of data retrieval within database systems, which adjusts index settings to optimize the database performance. Recently, with the growth of data volumes, the complexity of workloads, and the diversification of database applications, various Automatic Index Tuning (AIT) methods have been proposed to address these challenges. In this paper, we provide a comprehensive survey on Automatic Index Tuning. First, we overview the AIT techniques from multiple aspects, including (i) problem definition, (ii) workflow, (iii) framework, (iv) index types, (v) index interaction, (vi) changing factors, (vii) automation level, and show the development history. Second, we summarize techniques in the main modules of AIT, including preprocessing , index benefit estimation , and index selection . Preprocessing involves workload compression, index candidate generation, feature representation of workloads and databases, and workload reduction. Index benefit estimation approaches are categorized into empirical methods and machine learning based methods. Index selection involves algorithms of offline AIT and online AIT. Moreover, we summarize the commonly-used datasets in AIT and discuss the applications of index tuning in commercial and opensource database products. Finally, we outline potential future research directions. Our survey aims to enhance both general knowledge and in-depth insights into AIT, and inspire researchers to address the ongoing challenges.