Index Selection for NoSQL Database with Deep Reinforcement Learning

Shun Yao,Hongzhi Wang,Yu Yan
DOI: https://doi.org/10.48550/arXiv.2006.08842
2020-06-16
Abstract:We propose a new approach of NoSQL database index selection. For different workloads, we select different indexes and their different parameters to optimize the database performance. The approach builds a deep reinforcement learning model to select an optimal index for a given fixed workload and adapts to a changing workload. Experimental results show that, Deep Reinforcement Learning Index Selection Approach (DRLISA) has improved performance to varying degrees according to traditional single index structures.
Databases,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to dynamically select the optimal index structure and its parameters according to different workloads in NoSQL databases in order to optimize database performance. Specifically, the traditional single - index structure is difficult to handle workloads in many different situations, and the existing NoSQL database solutions usually do not combine several different index structures for simultaneous use, which is a common requirement in practical applications. In addition, index selection needs to adapt to most workloads and be as compatible as possible with more index types, and as the database is used, the workload will change, and the optimal index selection in different hardware environments also varies greatly. To solve these problems, the author proposes a NoSQL database index selection method based on deep reinforcement learning (DRLISA). This method can recommend index structures and their parameters, uses a reinforcement learning model as a heuristic method for index selection under dynamic workloads, and selects the optimal index based on the Q - value in the reinforcement learning network. Through this method, DRLISA can not only reduce the cost for system administrators to select appropriate indexes, but also provides a good framework for developing an automatic index selection system. The main contributions of the paper include: 1. Recommending index structures while recommending index parameters. 2. Using a reinforcement learning model to handle index selection under dynamic workloads. 3. Selecting indexes based on the Q - value in the reinforcement learning network. The experimental results show that DRLISA has different degrees of performance improvement compared with the traditional single - index structure under different workloads, which proves the effectiveness and adaptability of this method.