Abstract:The use of question-based activities (QBAs) is wide-spread in education, traditionally forming an integral part of the learning and assessment process. In this paper, we design and evaluate an automated question generation tool for formative and summative assessment in schools. We present an expert survey of one hundred and four teachers, demonstrating the need for automated generation of QBAs, as a tool that can significantly reduce the workload of teachers and facilitate personalized learning experiences. Leveraging the recent advancements in generative AI, we then present a modular framework employing transformer based language models for automatic generation of multiple-choice questions (MCQs) from textual content. The presented solution, with distinct modules for question generation, correct answer prediction, and distractor formulation, enables us to evaluate different language models and generation techniques. Finally, we perform an extensive quantitative and qualitative evaluation, demonstrating trade-offs in the use of different techniques and models.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to automatically generate text - based multiple - choice questions (MCQs) in education, so as to reduce the workload of teachers and promote personalized learning experiences. Specifically: 1. **Teacher Workload**: Traditional question - based activities (QBAs) are widely used in education, but preparing these activities is time - consuming and labor - intensive. By surveying 104 teachers, the paper reveals the challenges that teachers face when preparing QBAs, such as lack of time, lack of high - quality resources, and lack of personalized adaptive resources. 2. **Automated Generation Technology**: Utilize recent generative artificial intelligence (AI) technologies, especially Transformer - based language models (such as T5 and GPT - 3), to develop an automated MCQ generation tool. This tool can automatically generate multiple - choice questions from educational texts, including three main modules: question generation, correct - answer prediction, and distractor generation. 3. **System Evaluation**: Conduct quantitative and qualitative evaluations of the generated MCQs to verify the effectiveness of the system and the quality of the generated questions. Evaluation metrics include perplexity, query well - formedness, answer accuracy (exact match, F1 - score, ROUGE), and distractor quality. 4. **Modular Framework**: Propose a modular framework that allows independent development and optimization of each module and integrates different language models to meet various educational needs. This flexibility not only improves the overall quality of MCQs but also makes the system more robust, scalable, and highly customizable. In conclusion, this paper aims to help teachers reduce their workload while improving teaching quality and students' learning experiences by developing an efficient, reliable, and privacy - protected automatic MCQ generation system.

Automating question generation from educational text

Automated Question and Answer Generation from Texts using Text-to-Text Transformers

Automatic Question Generation for Repeated Testing to Improve Student Learning Outcome

Multiple-Choice Question Generation: Towards an Automated Assessment Framework

Automatic Question Paper Generator Using Artificial Intelligence

A Systematic Review of Automatic Question Generation for Educational Purposes

Generating Answer Candidates for Quizzes and Answer-Aware Question Generators

How Teachers Can Use Large Language Models and Bloom's Taxonomy to Create Educational Quizzes

Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs

Pre-Training With Scientific Text Improves Educational Question Generation

Eduassesspro: An Automated Quiz Generator and Exam Creator

Q-Module-Bot: A Generative AI-Based Question and Answer Bot for Module Teaching Support

Automatic question generation for literature review writing support

Automatic Generation of Short Answer Questions for Reading Comprehension Assessment

Automated Question Generation for Science Tests in Arabic Language Using NLP Techniques

It is AI's Turn to Ask Humans a Question: Question-Answer Pair Generation for Children's Story Books

Automatic True/False Question Generation for Educational Purpose

Question Generation for Adaptive Education

Automated Educational Question Generation at Different Bloom's Skill Levels using Large Language Models: Strategies and Evaluation

How Useful are Educational Questions Generated by Large Language Models?