Structured Tree Alignment for Evaluation of (Speech) Constituency Parsing

Freda Shi,Kevin Gimpel,Karen Livescu

2024-06-20

Abstract:We present the structured average intersection-over-union ratio (STRUCT-IOU), a similarity metric between constituency parse trees motivated by the problem of evaluating speech parsers. STRUCT-IOU enables comparison between a constituency parse tree (over automatically recognized spoken word boundaries) with the ground-truth parse (over written words). To compute the metric, we project the ground-truth parse tree to the speech domain by forced alignment, align the projected ground-truth constituents with the predicted ones under certain structured constraints, and calculate the average IOU score across all aligned constituent pairs. STRUCT-IOU takes word boundaries into account and overcomes the challenge that the predicted words and ground truth may not have perfect one-to-one correspondence. Extending to the evaluation of text constituency parsing, we demonstrate that STRUCT-IOU can address token-mismatch issues, and shows higher tolerance to syntactically plausible parses than PARSEVAL (Black et al., 1991).

Computation and Language,Data Structures and Algorithms

What problem does this paper attempt to address?

The paper primarily focuses on addressing the following issues: 1. **Developing New Evaluation Metrics**: The paper introduces a new evaluation metric called STRUCT-IOU, which is designed to assess the performance of syntax structure analysis based on speech (specifically, sentence constituent analysis). This metric aims to overcome the limitations of traditional evaluation methods (such as PARSEVAL) when dealing with speech data. 2. **Adapting to the Characteristics of Speech Parsing**: Given the potential inaccuracy of word boundaries in automatic speech recognition, STRUCT-IOU takes into account the uncertainty of word boundaries and is capable of evaluating the similarity between parse trees, even when there is no one-to-one correspondence between the predicted words and the actual words. 3. **Addressing Token Mismatch Issues in Text Parsing**: Although designed for speech parsing, STRUCT-IOU is also applicable to the evaluation of text parsing. It can better handle issues caused by token inconsistencies and shows a higher tolerance for grammatically reasonable parses compared to the traditional PARSEVAL metric. In summary, the goal of this research is to improve existing parsing evaluation methods by proposing STRUCT-IOU, making them more suitable for evaluating the results of syntactic structure parsing in speech and text, especially in situations where word boundaries are uncertain or there are token mismatches.

Structured Tree Alignment for Evaluation of (Speech) Constituency Parsing

jp-evalb: Robust Alignment-based PARSEVAL Measures

A Novel Alignment-based Approach for PARSEVAL Measures

Tree-Averaging Algorithms for Ensemble-Based Unsupervised Discontinuous Constituency Parsing

Evaluating Discourse in Structured Text Representations

Improving Unsupervised Constituency Parsing via Maximizing Semantic Information

Straight to the Tree: Constituency Parsing with Neural Syntactic Distance

Resolving Coordinate Structures for Chinese Constituent Parsing.

Linguistic Structure Induction from Language Models

Combining Data-Driven Constituent and Dependency Parsers for CIPS-ParsEval-2009

Unsupervised Parsing by Searching for Frequent Word Sequences among Sentences with Equivalent Predicate-Argument Structures

Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences

A Unified View of Evaluation Metrics for Structured Prediction

Image2Struct: Benchmarking Structure Extraction for Vision-Language Models

Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of Speech

Unlocking Structure Measuring: Introducing PDD, an Automatic Metric for Positional Discourse Coherence

High-order Joint Constituency and Dependency Parsing

Ensemble Distillation for Unsupervised Constituency Parsing

Evaluating Table Structure Recognition: A New Perspective

Improved Parsing for Argument-Clusters Coordination

Structsum Generation for Faster Text Comprehension