Practical and Ethical Challenges of Large Language Models in Education: A Systematic Scoping Review

Lixiang Yan,Lele Sha,Linxuan Zhao,Yuheng Li,Roberto Martinez-Maldonado,Guanliang Chen,Xinyu Li,Yueqiao Jin,Dragan Gašević
DOI: https://doi.org/10.1111/bjet.13370
2023-07-22
Abstract:Educational technology innovations leveraging large language models (LLMs) have shown the potential to automate the laborious process of generating and analysing textual content. While various innovations have been developed to automate a range of educational tasks (e.g., question generation, feedback provision, and essay grading), there are concerns regarding the practicality and ethicality of these innovations. Such concerns may hinder future research and the adoption of LLMs-based innovations in authentic educational contexts. To address this, we conducted a systematic scoping review of 118 peer-reviewed papers published since 2017 to pinpoint the current state of research on using LLMs to automate and support educational tasks. The findings revealed 53 use cases for LLMs in automating education tasks, categorised into nine main categories: profiling/labelling, detection, grading, teaching support, prediction, knowledge representation, feedback, content generation, and recommendation. Additionally, we also identified several practical and ethical challenges, including low technological readiness, lack of replicability and transparency, and insufficient privacy and beneficence considerations. The findings were summarised into three recommendations for future studies, including updating existing innovations with state-of-the-art models (e.g., GPT-3/4), embracing the initiative of open-sourcing models/systems, and adopting a human-centred approach throughout the developmental process. As the intersection of AI and education is continuously evolving, the findings of this study can serve as an essential reference point for researchers, allowing them to leverage the strengths, learn from the limitations, and uncover potential research opportunities enabled by ChatGPT and other generative AI models.
Computation and Language,Artificial Intelligence,Computers and Society
What problem does this paper attempt to address?
The paper attempts to address the practical and ethical challenges faced by large language models (LLMs) when applied in educational technology. Despite the enormous potential of LLMs in automating the generation and analysis of textual content, the practical feasibility and ethicality of these technologies remain highly controversial. These issues may hinder future research and the application of LLMs in real educational scenarios. Therefore, through a systematic scoping review, the paper aims to comprehensively understand the current state of research and identify the practical and ethical challenges of using LLMs for automating educational tasks. Specifically, the paper attempts to answer the following research questions: 1. **RQ1**: What is the current state of research on using LLMs to automate educational tasks? Particularly from the perspectives of educational tasks, stakeholders, LLMs, and machine learning tasks. 2. **RQ2**: What practical challenges do LLMs face when automating educational tasks? Especially from the perspectives of technical readiness, model performance, and model reproducibility. 3. **RQ3**: What ethical challenges do LLMs face when automating educational tasks? Particularly from the perspectives of system transparency, privacy, equity, and beneficence. By answering these questions, the paper hopes to provide researchers with an important reference point, enabling them to leverage the advantages of LLMs, learn from their limitations, and discover new research opportunities.