Naturalness Evaluation of Natural Language Generation in Task-oriented Dialogues using BERT

Ye Liu,Wolfgang Maier,Wolfgang Minker,Stefan Ultes
DOI: https://doi.org/10.48550/arXiv.2109.02938
2021-09-07
Computation and Language
Abstract:This paper presents an automatic method to evaluate the naturalness of natural language generation in dialogue systems. While this task was previously rendered through expensive and time-consuming human labor, we present this novel task of automatic naturalness evaluation of generated language. By fine-tuning the BERT model, our proposed naturalness evaluation method shows robust results and outperforms the baselines: support vector machines, bi-directional LSTMs, and BLEURT. In addition, the training speed and evaluation performance of naturalness model are improved by transfer learning from quality and informativeness linguistic knowledge.
What problem does this paper attempt to address?