Integrating large language models and active inference to understand eye movements in reading and dyslexia

Francesco Donnarumma,Mirco Frosolone,Giovanni Pezzulo

2024-02-24

Abstract:We present a novel computational model employing hierarchical active inference to simulate reading and eye movements. The model characterizes linguistic processing as inference over a hierarchical generative model, facilitating predictions and inferences at various levels of granularity, from syllables to sentences. Our approach combines the strengths of large language models for realistic textual predictions and active inference for guiding eye movements to informative textual information, enabling the testing of predictions. The model exhibits proficiency in reading both known and unknown words and sentences, adhering to the distinction between lexical and nonlexical routes in dual-route theories of reading. Notably, our model permits the exploration of maladaptive inference effects on eye movements during reading, such as in dyslexia. To simulate this condition, we attenuate the contribution of priors during the reading process, leading to incorrect inferences and a more fragmented reading style, characterized by a greater number of shorter saccades. This alignment with empirical findings regarding eye movements in dyslexic individuals highlights the model's potential to aid in understanding the cognitive processes underlying reading and eye movements, as well as how reading deficits associated with dyslexia may emerge from maladaptive predictive processing. In summary, our model represents a significant advancement in comprehending the intricate cognitive processes involved in reading and eye movements, with potential implications for understanding and addressing dyslexia through the simulation of maladaptive inference. It may offer valuable insights into this condition and contribute to the development of more effective interventions for treatment.

Neurons and Cognition,Computation and Language

What problem does this paper attempt to address?

The problem this paper attempts to address is the construction of a novel computational model to simulate eye movement behavior during reading by combining large language models (LLM) and active inference. Specifically, the model aims to: 1. **Understand eye movement behavior during reading**: By viewing language processing as an inference process based on a hierarchical generative model, the model can make predictions and inferences at different levels of granularity (from syllables to sentences). 2. **Explore abnormal inference effects in reading disorders (such as dyslexia)**: By weakening the contribution of prior information during the reading process, the model can simulate eye movement patterns similar to those of dyslexic patients, characterized by more frequent short saccades. 3. **Validate the dual-route reading theory**: The model can distinguish between the lexical route and the nonlexical route, thereby better understanding the cognitive mechanisms involved in the reading process. 4. **Provide insights into the potential mechanisms of reading disorders**: By simulating abnormal predictive processing in reading disorders, the model helps to understand the cognitive basis of reading disorders and may offer valuable insights for therapeutic interventions. In summary, the goal of this paper is to combine large language models and active inference to establish a computational model capable of simulating both normal and abnormal reading processes, thereby gaining a deeper understanding of the cognitive processes involved in reading and eye movement behavior, particularly the mechanisms related to reading disorders.

Integrating large language models and active inference to understand eye movements in reading and dyslexia

Language models outperform cloze predictability in a cognitive model of reading

Integrating Large Language Model, EEG, and Eye-Tracking for Word-Level Neural State Classification in Reading Comprehension

Dynamical Cognitive Modeling of Syntactic Processing and Eye Movement Control in Reading

Understanding Dyslexia Through Personalized Large-Scale Computational Models

SEAM: An Integrated Activation-Coupled Model of Sentence Processing and Eye Movements in Reading

Integrating LLM, EEG, and Eye-Tracking Biomarker Analysis for Word-Level Neural State Classification in Semantic Inference Reading Comprehension

From Word Embedding to Reading Embedding Using Large Language Model, EEG and Eye-tracking

Machine-Learned Computational Models Can Enhance the Study of Text and Discourse: A Case Study Using Eye Tracking to Model Reading Comprehension

Eyettention: An Attention-based Dual-Sequence Model for Predicting Human Scanpaths during Reading

Simultaneous simulations of pure, surface and phonological acquired dyslexia within a full computational model of the primary systems hypothesis

Predictive models of reading difficulties considering neuropsycholinguistic profiles of atypical and ADHD-inattentive type readers, and eye-tracking measures

Identifying dyslexia in school pupils from eye movement and demographic data using artificial intelligence

ScanDL: A Diffusion Model for Generating Synthetic Scanpaths on Texts

Predictive Model for Dyslexia from Eye Fixation Events

Fine-Grained Prediction of Reading Comprehension from Eye Movements

DysLexML: Screening Tool for Dyslexia Using Machine Learning

Cross-Lingual Transfer of Cognitive Processing Complexity

Eye movement analyses indicate the underlying reading strategy in the recovery of lexical readers

Multifractal information on reading eye tracking data