Exploring How Multiple Levels of GPT-Generated Programming Hints Support or Disappoint Novices

Ruiwei Xiao,Xinying Hou,John Stamper

DOI: https://doi.org/10.1145/3613905.3650937

2024-04-03

Abstract:Recent studies have integrated large language models (LLMs) into diverse educational contexts, including providing adaptive programming hints, a type of feedback focuses on helping students move forward during problem-solving. However, most existing LLM-based hint systems are limited to one single hint type. To investigate whether and how different levels of hints can support students' problem-solving and learning, we conducted a think-aloud study with 12 novices using the LLM Hint Factory, a system providing four levels of hints from general natural language guidance to concrete code assistance, varying in format and granularity. We discovered that high-level natural language hints alone can be helpless or even misleading, especially when addressing next-step or syntax-related help requests. Adding lower-level hints, like code examples with in-line comments, can better support students. The findings open up future work on customizing help responses from content, format, and granularity levels to accurately identify and meet students' learning needs.

Human-Computer Interaction,Artificial Intelligence,Computers and Society

What problem does this paper attempt to address?

The paper attempts to address the issue of how to effectively support beginners in problem-solving and learning during programming education through different levels of GPT-generated programming hints. Existing hint systems based on large language models (LLMs) are mostly limited to a single type of hint, whereas this paper aims to explore how different levels of hints can better support students' problem-solving and learning processes. Specifically, the paper developed a system called "LLM Hint Factory," which can provide four different levels of hints ranging from general natural language guidance to specific code snippet suggestions. Through an experiment involving 12 programming beginners, the researchers found that providing only high-level natural language hints is sometimes insufficient and may even mislead students, especially when dealing with next-step logic or syntax-related issues. In contrast, adding lower-level hints (such as annotated code examples) was more helpful in most cases for students to solve problems. This finding offers new insights for the design of future teaching agents, emphasizing the importance of personalized responses based on the specific help-seeking context of students.

Exploring How Multiple Levels of GPT-Generated Programming Hints Support or Disappoint Novices

Howzat? Appealing to Expert Judgement for Evaluating Human and AI Next-Step Hints for Novice Programmers

Next-Step Hint Generation for Introductory Programming Using Large Language Models

One Step at a Time: Combining LLMs and Static Analysis to Generate Next-Step Hints for Programming Tasks

Automating Human Tutor-Style Programming Feedback: Leveraging GPT-4 Tutor Model for Hint Generation and GPT-3.5 Student Model for Hint Validation

Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology

Large Language Models (GPT) for automating feedback on programming assignments

A Knowledge-Component-Based Methodology for Evaluating AI Assistants

Navigating Compiler Errors with AI Assistance -- A Study of GPT Hints in an Introductory Programming Course

The Impact of Looking Further Ahead: A Comparison of Two Data-driven Unsolicited Hint Types on Performance in an Intelligent Data-driven Logic Tutor

Exploring the Responses of Large Language Models to Beginner Programmers' Help Requests

Extending the Hint Factory for the assistance dilemma: A novel, data-driven HelpNeed Predictor for proactive problem-solving help

Hint-before-Solving Prompting: Guiding LLMs to Effectively Utilize Encoded Knowledge

Exploring the Potential of Large Language Models to Generate Formative Programming Feedback

The Continuous Hint Factory - Providing Hints in Vast and Sparsely Populated Edit Distance Spaces

Can LLMs plan paths with extra hints from solvers?

How Beginning Programmers and Code LLMs (Mis)read Each Other

AutoHint: Automatic Prompt Optimization with Hint Generation

Enhancing Computer Programming Education with LLMs: A Study on Effective Prompt Engineering for Python Code Generation

Feedback-Generation for Programming Exercises With GPT-4

Not the Silver Bullet: LLM-enhanced Programming Error Messages are Ineffective in Practice