Database Tuning using Natural Language Processing

Immanuel Trummer
DOI: https://doi.org/10.1145/3503780.3503788
2021-12-01
ACM SIGMOD Record
Abstract:Introduction. We have seen significant advances in the state of the art in natural language processing (NLP) over the past few years [20]. These advances have been driven by new neural network architectures, in particular the Transformer model [19], as well as the successful application of transfer learning approaches to NLP [13]. Typically, training for specific NLP tasks starts from large language models that have been pre-trained on generic tasks (e.g., predicting obfuscated words in text [5]) for which large amounts of training data are available. Using such models as a starting point reduces task-specific training cost as well as the number of required training samples by orders of magnitude [7]. These advances motivate new use cases for NLP methods in the context of databases.
computer science, information systems, software engineering
What problem does this paper attempt to address?