Enhancing Language Generation with Effective Checkpoints of Pre-trained Language Model.

Jeonghyeok Park,Hai Zhao
DOI: https://doi.org/10.18653/v1/2021.findings-acl.237
2021-01-01
Abstract:This work empirically explores effective ex-ploiting of intermediate output from pre-trained language models (PrLMs) for language generation tasks. For this purpose, we propose an improved method to integrate public checkpoints of PrLMs for the most convenience and perform extensive experiments on 6 different kinds of PrLMs, including BERT, ELECTRA, GPT2, Multi-lingual BERT, and XLM RoBERTa. Evaluation with automatic met-rics shows that our approach significantly im-proves the generation quality on the generation tasks, up to 1.8 BLEU points for neural machine translation (Korean-to-English, Korean-to-Chinese) and 1.8 ROUGE points improvements for text summarization.
What problem does this paper attempt to address?