Abstract:Comparative Opinion Quintuple Extraction (COQE) is an emerging task focused on identifying comparative relationships within sentence-level comments. COQE involves extracting a quintuple that encapsulates a specific comparative relation, encompassing a subject, object, comparative aspect, opinion, and preference. Previous approaches to COQE relied on pipeline methods, which unfortunately suffered from error propagation during the sequential process. In our work, we present an efficient and highly effective end-to-end solution for COQE, improving three key aspects: encoding, decoding, and learning. First, in terms of feature encoding, we introduce external syntactic dependency structural features to enhance the modeling of opinion elements. We also propose a novel dynamic structural pruning (DSP) mechanism to refine and optimize the raw syntax structure, ensuring alignment with the end task. Second, regarding decoding, we address the challenge of non-coherent relationships among quintuple constituents by formulating COQE as a set prediction problem. We employ a non-autoregressive decoding scheme to simultaneously generate all possible quintuples in parallel, increasing decoding speed while retaining the advantages of a generative approach . Finally, from a learning perspective, we introduce three advanced training targets: bipartite matching learning loss, cross-element contrastive loss , and mention boundary learning loss. These targets promote interaction learning between quadruples and elements. Through extensive experiments on three benchmark datasets, our end-to-end COQE system demonstrates significant performance improvements over both current pipeline baselines and state-of-the-art joint systems by 2 to 3 points. We also demonstrate that our proposed dynamic pruning strategy over syntactic dependency features is important to the overall performance, and meanwhile our system is capable of producing explainable predictions. Furthermore, the non-autoregressive decoding greatly enhances the inference efficiency, and multiple advanced training losses help achieve better learning process.

ComOM at VLSP 2023: A Dual-Stage Framework with BERTology and Unified Multi-Task Instruction Tuning Model for Vietnamese Comparative Opinion Mining

Overview of the VLSP 2023 -- ComOM Shared Task: A Data Challenge for Comparative Opinion Mining from Vietnamese Product Reviews

Enhancing Comparative Opinion Mining in Vietnamese Product Reviews: A Hybrid Generative Model Approach with Knowledge Base Integration

Unveiling Comparative Sentiments in Vietnamese Product Reviews: A Sequential Classification Framework

Comparative Opinion Mining in Product Reviews: Multi-perspective Prompt-based Learning

VlogQA: Task, Dataset, and Baseline Models for Vietnamese Spoken-Based Machine Reading Comprehension

VLSP 2021 - ViMRC Challenge: Vietnamese Machine Reading Comprehension

Investigating Monolingual and Multilingual BERTModels for Vietnamese Aspect Category Detection

Overview of the VLSP 2022 -- Abmusu Shared Task: A Data Challenge for Vietnamese Abstractive Multi-document Summarization

Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges

Vietnamese Sentiment Analysis: An Overview and Comparative Study of Fine-tuning Pretrained Language Models

UIT-OpenViIC: A Novel Benchmark for Evaluating Image Captioning in Vietnamese

BERT-VBD: Vietnamese Multi-Document Summarization Framework

Scaling up COMETKIWI: Unbabel-IST 2023 Submission for the Quality Estimation Shared Task

Software Mention Recognition with a Three-Stage Framework Based on BERTology Models at SOMD 2024

VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding

End-to-end comparative opinion quintuple extraction as bipartite set prediction with dynamic structure pruning

DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition

A Unified Span-Based Approach for Opinion Mining with Syntactic Constituents

Benchmarking LLMs on the Semantic Overlap Summarization Task

Sentence Extraction-Based Machine Reading Comprehension for Vietnamese