Abstract:Keyphrase generation, which can help people obtain key information from a long document (social media posts or scientific articles) in a short time, has made significant progress in recent years, especially for training by concatenating keyphrases with a predefined order. However, when using beam search for keyphrase generation, models tend to repeatedly generate the highest priority keyphrase type in each beam branch, which causes the model to weaken the generation performance on the underdog keyphrase type. To tackle this, we introduce the One2MultiSeq paradigm, which allows the model to train with two sets of keyphrases that have completely opposite connection orders. Moreover, given that social media content is often colloquial, informal, and multimodal (comprising not just text but also images), these properties necessitate the incorporation of a priori knowledge for models to effectively process such information. However, contemporary models lack this requisite capacity, thereby limiting their ability to proficiently handle these discrete data elements. To overcome this, we incorporate the pretrained model BART as our backbone architecture and employ a copy mechanism to further augment its keyphrase generation capabilities. Experimental results show that our method outperformed relatively advanced models, with gains of 3.51, 1.55, and 2.47 percentage points in F1@1, F1@3, and MAP@5, respectively, on the unimodal Twitter dataset; 3.23, 2.68, and 4.07 on the multimodal Tweet dataset; and increases of 4.32, 0.32, and 7.07 in F1@3, F1@5, and MAP@5, respectively, on the StackExchange dataset.

Keyphrase Enhanced Diverse Beam Search: A Content-Introducing Approach To Neural Text Generation

Keyphrase Guided Beam Search for Neural Abstractive Text Summarization.

A Simple, Fast Diverse Decoding Algorithm for Neural Generation

Keyphrase Generation With Word Attention

Deep Keyphrase Generation

Deep Keyphrase Generation with a Convolutional Sequence to Sequence Model.

Best-$k$ Search Algorithm for Neural Text Generation

Keyphrase Generation Based on Deep Seq2seq Model

Neural Keyphrase Generation: Analysis and Evaluation

When to Finish? Optimal Beam Search for Neural Text Generation (modulo beam size)

One Size Does Not Fit All: Generating and Evaluating Variable Number of Keyphrases

Topic and Reference Guided Keyphrase Generation from Social Media

Training with One2MultiSeq: CopyBART for social media keyphrase generation

Investigating Label Bias in Beam Search for Open-ended Text Generation

Towards Better Multi-modal Keyphrase Generation via Visual Entity Enhancement and Multi-granularity Image Noise Filtering

Heterogeneous Graph Neural Networks for Keyphrase Generation

Enriched Entity Representation of Knowledge Graph for Text Generation

Keyphrase Generation with Cross-Document Attention

Global-aware Beam Search for Neural Abstractive Summarization

Keyphrase Extraction with Span-based Feature Representations

Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation