Evaluation of Automated Image Descriptions for Visually Impaired Students

Anett Hoppe,David Morris,Ralph Ewerth
DOI: https://doi.org/10.1007/978-3-030-78270-2_35
2021-06-30
Abstract:Illustrations are widely used in education, and sometimes, alternatives are not available for visually impaired students. Therefore, those students would benefit greatly from an automatic illustration description system, but only if those descriptions were complete, correct, and easily understandable using a screenreader. In this paper, we report on a study for the assessment of automated image descriptions. We interviewed experts to establish evaluation criteria, which we then used to create an evaluation questionnaire for sighted non-expert raters, and description templates. We used this questionnaire to evaluate the quality of descriptions which could be generated with a template-based automatic image describer. We present evidence that these templates have the potential to generate useful descriptions, and that the questionnaire identifies problems with description templates.
Human-Computer Interaction,Computer Vision and Pattern Recognition,Computers and Society
What problem does this paper attempt to address?
This paper aims to solve the accessibility problems encountered by visually - impaired students when accessing educational image resources. Specifically, due to the lack of alternative texts (alt - texts) and image descriptions, visually - impaired students are unable to effectively utilize the image information in online educational resources. Through researching automatic image - description technologies, the paper explores how to provide high - quality image descriptions for these students to enhance their learning experience. To achieve this goal, the authors carried out the following tasks: 1. **Requirement Analysis**: By interviewing three visually - impaired experts with different backgrounds, the specific requirements of visually - impaired users for image descriptions were understood, and thus the design principles of the description template were formulated. 2. **Description Template Design**: Based on the results of expert interviews, structured description templates for four common image types (line charts and scatter plots, bar charts, node - link diagrams, and pie charts) were designed. These templates are in HTML format, are easy for screen readers to navigate, and can provide multi - level information from an overview to details. 3. **Evaluation Method Development**: A set of structured questionnaires was developed to evaluate the quality of the generated image descriptions. This set of questionnaires not only examines the comprehensibility of the descriptions and whether they can help users form mental images, but also evaluates the completeness and accuracy of the descriptions by comparing them with the original images. 4. **Experimental Verification**: Non - professional volunteers were used to conduct a comparative evaluation of the automatically - generated image descriptions and the best - example descriptions in the control group. The results show that for relatively simple chart types (such as bar charts and pie charts), the quality of the automatically - generated descriptions is close to or even exceeds that of the control group; while for complex node - link diagrams, further optimization is required. In conclusion, this paper proposes a method that combines structured templates and computer vision technologies, aiming to provide high - quality image descriptions for visually - impaired students and improve their online learning experience. At the same time, the paper also points out the limitations of the current method in dealing with complex images and proposes future research directions.