Evaluating and Enhancing the Robustness of Retrieval-Based Dialogue Systems with Adversarial Examples.

Jia Li,Chongyang Tao,Nanyun Peng,Wei Wu,Dongyan Zhao,Rui Yan
DOI: https://doi.org/10.1007/978-3-030-32233-5_12
2019-01-01
Abstract:Retrieval-based dialogue systems have shown strong performances on both consistency and fluency according to several recent studies. However, their robustness towards malicious attacks remains largely untested. In this paper, we generate adversarial examples in black-box settings to evaluate the robustness of retrieval-based dialogue systems. On three representative retrieval-based dialogue models, our attacks reduce R-10@1 by 38.3%, 45.0% and 31.5% respectively on the Ubuntu dataset. Moreover, with adversarial training using our generated adversarial examples, we significantly improve the robustness of retrieval-based dialogue systems. We conduct thorough analysis to understand the robustness of retrieval-based dialog systems. Our results provide new insights to facilitate future work on building more robust dialogue systems.
What problem does this paper attempt to address?