SARA: Smart AI Reading Assistant for Reading Comprehension

Enkeleda Thaqi,Mohamed Mantawy,Enkelejda Kasneci
DOI: https://doi.org/10.1145/3649902.3655661
2024-04-10
Abstract:SARA integrates Eye Tracking and state-of-the-art large language models in a mixed reality framework to enhance the reading experience by providing personalized assistance in real-time. By tracking eye movements, SARA identifies the text segments that attract the user's attention the most and potentially indicate uncertain areas and comprehension issues. The process involves these key steps: text detection and extraction, gaze tracking and alignment, and assessment of detected reading difficulty. The results are customized solutions presented directly within the user's field of view as virtual overlays on identified difficult text areas. This support enables users to overcome challenges like unfamiliar vocabulary and complex sentences by offering additional context, rephrased solutions, and multilingual help. SARA's innovative approach demonstrates it has the potential to transform the reading experience and improve reading proficiency.
Human-Computer Interaction
What problem does this paper attempt to address?
The paper introduces an intelligent reading assistant system named SARA (Smart AI Reading Assistant), which aims to enhance users' reading experience and comprehension through mixed reality technology. The SARA system combines eye-tracking technology and advanced large language models to provide personalized real-time support within a mixed reality framework. Specifically, SARA can achieve the following functions: 1. **Text Position Recognition**: Determine the text position the user is reading by recognizing QR codes. 2. **Text Extraction**: Capture text images in the user's field of view using a camera and further extract the text content. 3. **Optical Character Recognition (OCR)**: Analyze the extracted text images to recognize the textual information. 4. **Eye-Tracking**: Monitor the user's gaze focus to determine their fixation points on the text. 5. **Reading Difficulty Classification**: Identify reading difficulties based on changes in the user's gaze duration and reading patterns. 6. **Provide Reading Support**: Offer assistance such as definitions, translations, simplifications, or paraphrases for identified difficulties, such as unfamiliar vocabulary or hard-to-understand passages. Through the above steps, SARA can effectively help users overcome obstacles in the reading process, improving reading efficiency and comprehension. Additionally, the system demonstrates the significant potential of mixed reality technology and advanced language models in enhancing reading assistance.