Enhancing Model Robustness Via Lexical Distilling

Wentao Qin,Dongyan Zhao
DOI: https://doi.org/10.1007/978-3-030-88483-3_26
2021-01-01
Abstract:Humans are prone to making typos in writing, which, though, doesn’t affect understanding the whole sentence. However, neural models in natural language processing(NLP) would collapse when confronted with such tiny mistakes. This problem results from that neural models incline to entangle information, i.e., replacing a single aspect of the input text leads to significant changes in all components of the representation. Therefore, a trivial noise in a sentence can bring about a dramatic performance drop of the model. In this paper, we propose a novel and general framework to enhance the robustness of a model. The whole framework is trained in an adversarial style, which enables the model to encode the original sentence and the sentence refined by a lexical distiller to a similar sentence representation. We verify the effectiveness of the proposed framework in auto-encoder task. Experimental results show that our framework enhances the robustness of the model in different aspects.
What problem does this paper attempt to address?