Enhanced Universal Dependency Parsing with Automated Concatenation of Embeddings

Xinyu Wang,Zixia Jia,Yong Jiang,Kewei Tu
DOI: https://doi.org/10.48550/arXiv.2107.02416
2021-07-06
Abstract:This paper describes the system used in submission from SHANGHAITECH team to the IWPT 2021 Shared Task. Our system is a graph-based parser with the technique of Automated Concatenation of Embeddings (ACE). Because recent work found that better word representations can be obtained by concatenating different types of embeddings, we use ACE to automatically find the better concatenation of embeddings for the task of enhanced universal dependencies. According to official results averaged on 17 languages, our system ranks 2nd over 9 teams.
Computation and Language,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to improve the performance of Enhanced Universal Dependency Parsing (EUD). Specifically, the authors hope to optimize the combinations of different types of word embeddings through the Automated Concatenation of Embeddings (ACE) technique, thereby enhancing the performance of EUD parsers on multiple languages. ### Problem Background 1. **Enhanced Universal Dependencies (EUD)** - EUD is an extension of traditional Universal Dependencies (UD). It aims to make some implicit inter - word relationships more explicit and adds some dependency labels to help distinguish different types of arguments and modifiers. - The representation form of EUD is an enhanced graph containing re - entrancy, cycles, and null nodes. This representation can express richer grammatical relationships than a rooted tree, but it is more difficult to learn. 2. **Existing Challenges** - The complex structure of EUD makes the learning process more difficult. - How to select and combine different pre - trained word embeddings (such as context - aware embeddings and non - context - aware embeddings) to obtain better word representations and further improve the performance of the parser. ### Solution To address these challenges, the authors propose the following methods: - **Data Pre - processing**: Convert the EUD graph into a two - lexical structure similar to Semantic Dependency Parsing (SDP). By reducing re - entrancy and null nodes, many SDP methods can be applied to the EUD task. - **System Architecture**: Use a graph - based biaffine parser and introduce an ACE module to automatically find better embedding combinations. - **ACE Module**: Efficiently search for the best embedding combinations for each language through reinforcement learning, and then provide them as input to the biaffine parser. ### Experimental Results According to the official evaluation results, in terms of the average ELAS score (F1 - score over the set of enhanced dependencies in the system output and the gold standard) on 17 languages, the performance of this system ranks second, only second to one team. In particular, this system achieved the first - place result on Arabic. ### Summary This paper significantly improves the performance of multi - language EUD parsing tasks by introducing the ACE module, demonstrating the effectiveness of automatically selecting and combining different types of word embeddings.