Hierarchical Latent Alignment for Non-Autoregressive Generation under High Compression Ratio.

Wang Xu,Yongliang Ma,Kehai Chen,Ming Zhou,Muyun Yang,Tiejun Zhao
DOI: https://doi.org/10.1587/transinf.2023edp7111
2024-01-01
IEICE Transactions on Information and Systems
Abstract:Non-autoregressive generation has attracted more and more attention due to its fast decoding speed. Latent alignment objectives, such as CTC, are designed to capture the monotonic alignments between the predicted and output tokens, which have been used for machine translation and sentence summarization. However, our preliminary experiments revealed that CTC performs poorly on document abstractive summarization, where a high compression ratio between the input and output is involved. To address this issue, we conduct a theoretical analysis and propose Hierarchical Latent Alignment (HLA). The basic idea is a two-step alignment process: we first align the sentences in the input and output, and subsequently derive token-level alignment using CTC based on aligned sentences. We evaluate the effectiveness of our proposed approach on two widely used datasets XSUM and CNNDM. The results indicate that our proposed method exhibits remarkable scalability even when dealing with high compression ratios.
What problem does this paper attempt to address?