Formality Style Transfer with Shared Latent Space

Yunli Wang,Yu Wu,Lili Mou,Zhoujun Li,Wen-Han Chao
DOI: https://doi.org/10.18653/v1/2020.coling-main.203
2020-01-01
Abstract:Conventional approaches for formality style transfer borrow models from neural machine translation, which typically requires massive parallel data for training. However, the dataset for formality style transfer is considerably smaller than translation corpora. Moreover, we observe that informal and formal sentences closely resemble each other, which is different from the translation task where two languages have different vocabularies and grammars. In this paper, we present a new approach, Sequence-to-Sequence with Shared Latent Space (S2S-SLS), for formality style transfer, where we propose two auxiliary losses and adopt joint training of bi-directional transfer and auto-encoding. Experimental results show that S2S-SLS (with either RNN or Transformer architectures) consistently outperforms baselines in various settings, especially when we have limited data.
What problem does this paper attempt to address?