Abstract:Cross-lingual summarization (CLS) is the task of condensing lengthy source language text into a concise summary in a target language. This presents a dual challenge, demanding both cross-language semantic understanding (i.e., semantic alignment) and effective information compression capabilities. Traditionally, researchers have tackled these challenges using two types of methods: pipeline methods (e.g., translate-then-summarize) and end-to-end methods. The former is intuitive but prone to error propagation, particularly for low-resource languages. The later has shown an impressive performance, due to multilingual pre-trained models (mPTMs). However, mPTMs (e.g., mBART) are primarily trained on resource-rich languages, thereby limiting their semantic alignment capabilities for low-resource languages. To address these issues, this paper integrates the intuitiveness of pipeline methods and the effectiveness of mPTMs, and then proposes a two-stage fine-tuning method for low-resource cross-lingual summarization (TFLCLS). In the first stage, by recognizing the deficiency in the semantic alignment for low-resource languages in mPTMs, a semantic alignment fine-tuning method is employed to enhance the mPTMs' understanding of such languages. In the second stage, while considering that mPTMs are not originally tailored for information compression and CLS demands the model to simultaneously align and compress, an adaptive joint fine-tuning method is introduced. This method further enhances the semantic alignment and information compression abilities of mPTMs that were trained in the first stage. To evaluate the performance of TFLCLS, a low-resource CLS dataset, named Vi2ZhLow, is constructed from scratch; moreover, two additional low-resource CLS datasets, En2ZhLow and Zh2EnLow, are synthesized from widely used large-scale CLS datasets. Experimental results show that TFCLS outperforms state-of-the-art methods by 18.88%, 12.71% and 16.91% in ROUGE-2 on the three datasets, respectively, even when limited with only 5,000 training samples.

CAR-Transformer: Cross-Attention Reinforcement Transformer for Cross-Lingual Summarization

Cross-Lingual Abstractive Summarization with Limited Parallel Resources.

Unified Training for Cross-Lingual Abstractive Summarization by Aligning Parallel Machine Translation Pairs

Unifying Cross-lingual Summarization and Machine Translation with Compression Rate

NCLS: Neural Cross-Lingual Summarization

Bridging the Gap: Cross-Lingual Summarization with Compression Rate

Curriculum-Guided Abstractive Summarization

Augmenting Low-Resource Cross-Lingual Summarization with Progression-Grounded Training and Prompting

A Variational Hierarchical Model for Neural Cross-Lingual Summarization

Selective and Coverage Multi-head Attention for Abstractive Summarization

Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation

ClueGraphSum: Let Key Clues Guide the Cross-Lingual Abstractive Summarization

Cross-lingual Cross-temporal Summarization: Dataset, Models, Evaluation

CLG-Trans: Contrastive Learning for Code Summarization Via Graph Attention-Based Transformer

A two-stage fine-tuning method for low-resource cross-lingual summarization

Advanced multiple document summarization via iterative recursive transformer networks and multimodal transformer

ConVerSum: A Contrastive Learning based Approach for Data-Scarce Solution of Cross-Lingual Summarization Beyond Direct Equivalents

Improving ROUGE‐1 by 6%: A novel multilingual transformer for abstractive news summarization

Multi-path Based Self-adaptive Cross-lingual Summarization.

Zero-Shot Cross-Lingual Abstractive Sentence Summarization Through Teaching Generation and Attention

Understanding Translationese in Cross-Lingual Summarization