Semi-Supervised Seq2seq Joint-Stochastic-Approximation Autoencoders with Applications to Semantic Parsing

Yunfu Song,Zhijian Ou
DOI: https://doi.org/10.1109/lsp.2019.2953999
2019-01-01
IEEE Signal Processing Letters
Abstract:Developing Semi-Supervised Seq2Seq ($S^4$) learning for sequence transduction tasks in natural language processing (NLP), e.g. semantic parsing, is challenging, since both the input and the output sequences are discrete. This discrete nature makes trouble for methods which need gradients either from the input space or from the output space. Recently, a new learning method called joint stochastic approximation is developed for unsupervised learning of fixed-dimensional autoencoders and theoretically avoids gradient propagation through discrete latent variables, which is suffered by Variational Auto-Encoders (VAEs). In this letter, we propose seq2seq Joint-stochastic-approximation Auto-Encoders (JAEs) and apply them to $S^4$ learning for NLP sequence transduction tasks. Further, we propose bi-directional JAEs (called bi-JAEs) to leverage not only unpaired input sequences (which is most commonly studied) but also unpaired output sequences. Experiments on two benchmarking datasets for semantic parsing show that JAEs consistently outperform VAEs in $S^4$ learning and bi-JAEs yield further improvements.
What problem does this paper attempt to address?