Modeling Human Sentence Processing with Left-Corner Recurrent Neural Network Grammars

Ryo Yoshida,Hiroshi Noji,Yohei Oseki
2023-10-05
Abstract:In computational linguistics, it has been shown that hierarchical structures make language models (LMs) more human-like. However, the previous literature has been agnostic about a parsing strategy of the hierarchical models. In this paper, we investigated whether hierarchical structures make LMs more human-like, and if so, which parsing strategy is most cognitively plausible. In order to address this question, we evaluated three LMs against human reading times in Japanese with head-final left-branching structures: Long Short-Term Memory (LSTM) as a sequential model and Recurrent Neural Network Grammars (RNNGs) with top-down and left-corner parsing strategies as hierarchical models. Our computational modeling demonstrated that left-corner RNNGs outperformed top-down RNNGs and LSTM, suggesting that hierarchical and left-corner architectures are more cognitively plausible than top-down or sequential architectures. In addition, the relationships between the cognitive plausibility and (i) perplexity, (ii) parsing, and (iii) beam size will also be discussed.
Computation and Language
What problem does this paper attempt to address?
The problems that this paper attempts to solve are as follows: In computational linguistics, researchers have been exploring whether language models (LMs) can become more human - like by explicitly modeling the hierarchical structure of natural languages. However, most previous studies have ignored the parsing strategies in hierarchical models. This paper aims to explore whether the hierarchical structure makes LMs more human - like and, if so, which parsing strategy is the most cognitively reasonable. To this end, the author evaluated the performance of three language models on human reading time in Japanese (which has a head - final left - branching structure): the Long - Short - Term Memory network (LSTM) as a sequential model, and Recursive Neural Network Grammars (RNNGs) with top - down and left - corner parsing strategies as hierarchical models. Specifically, the author addresses this problem through the following points: 1. **Evaluating the performance of different models**: Compare the performance of LSTM, top - down RNNG, and left - corner RNNG in predicting human reading time. 2. **Comparison of parsing strategies**: Explore which parsing strategy (top - down vs. left - corner) is more cognitively reasonable. 3. **Model evaluation metrics**: Use perplexity, parsing accuracy, and psychometric predictive power to evaluate the performance of the models. Through these methods, the author hopes to reveal the influence of the hierarchical structure and different parsing strategies on language models in simulating human sentence - processing processes.