Syntax-aware Multilingual Semantic Role Labeling

Shexia He,Zuchao Li,Hai Zhao
DOI: https://doi.org/10.48550/arXiv.1909.00310
2019-09-10
Abstract:Recently, semantic role labeling (SRL) has earned a series of success with even higher performance improvements, which can be mainly attributed to syntactic integration and enhanced word representation. However, most of these efforts focus on English, while SRL on multiple languages more than English has received relatively little attention so that is kept underdevelopment. Thus this paper intends to fill the gap on multilingual SRL with special focus on the impact of syntax and contextualized word representation. Unlike existing work, we propose a novel method guided by syntactic rule to prune arguments, which enables us to integrate syntax into multilingual SRL model simply and effectively. We present a unified SRL model designed for multiple languages together with the proposed uniform syntax enhancement. Our model achieves new state-of-the-art results on the CoNLL-2009 benchmarks of all seven languages. Besides, we pose a discussion on the syntactic role among different languages and verify the effectiveness of deep enhanced representation for multilingual SRL.
Computation and Language
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the impact of grammar and context word representations on performance improvement in Multilingual Semantic Role Labeling (SRL), especially for non - English languages. Although significant progress has been made in English SRL in recent years, these achievements are mainly concentrated on English, and there is relatively little research on SRL in other languages. Therefore, this paper aims to fill this gap by introducing a grammar - guided parameter pruning method and context word representations to improve the performance of multilingual SRL and explore the grammatical roles among different languages. Specifically, the paper proposes a new parameter pruning method based on grammar rules, which can effectively integrate grammar information into the multilingual SRL model, thereby simplifying and efficiently processing multilingual data. In addition, the paper also explores the effectiveness of deep enhanced representations for multilingual SRL, especially in different languages. Through experiments on the CoNLL - 2009 benchmark dataset, the paper shows that its proposed model has achieved new best results in all seven languages, which is the first comprehensive update of the performance of multilingual SRL since 2009. These results not only verify the effectiveness of the proposed parameter pruning method for multilingual SRL, but also prove the crucial role of context word representations in improving the performance of multilingual SRL.