Structural Embedding of Syntactic Trees for Machine Comprehension

Rui Liu,Junjie Hu,Wei Wei,Zi Yang,Eric Nyberg
DOI: https://doi.org/10.48550/arXiv.1703.00572
2017-09-01
Abstract:Deep neural networks for machine comprehension typically utilizes only word or character embeddings without explicitly taking advantage of structured linguistic information such as constituency trees and dependency trees. In this paper, we propose structural embedding of syntactic trees (SEST), an algorithm framework to utilize structured information and encode them into vector representations that can boost the performance of algorithms for the machine comprehension. We evaluate our approach using a state-of-the-art neural attention model on the SQuAD dataset. Experimental results demonstrate that our model can accurately identify the syntactic boundaries of the sentences and extract answers that are syntactically coherent over the baseline methods.
Computation and Language
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that in machine comprehension tasks, existing deep neural network models mainly utilize word or character embeddings, without making full use of the structured language information of sentences, such as syntactic trees (including constituent trees and dependency trees). The paper proposes a new algorithm framework - Structural Embedding of Syntactic Trees (SEST), aiming to utilize this structured information and encode it into vector representations to improve the performance of machine - comprehension algorithms. Specifically, evaluated by using the state - of - the - art neural attention model on the SQuAD dataset, the experimental results show that this model can accurately identify the syntactic boundaries of sentences and extract grammatically coherent answers, thus outperforming the baseline methods.