Abstract:As Large Language Models (LLMs) gain in popularity, it is important to understand how novice programmers use them. We present a thematic analysis of 33 learners, aged 10-17, independently learning Python through 45 code-authoring tasks using Codex, an LLM-based code generator. We explore several questions related to how learners used these code generators and provide an analysis of the properties of the written prompts and the generated code. Specifically, we explore (A) the context in which learners use Codex, (B) what learners are asking from Codex, (C) properties of their prompts in terms of relation to task description, language, and clarity, and prompt crafting patterns, (D) the correctness, complexity, and accuracy of the AI-generated code, and (E) how learners utilize AI-generated code in terms of placement, verification, and manual modifications. Furthermore, our analysis reveals four distinct coding approaches when writing code with an AI code generator: AI Single Prompt, where learners prompted Codex once to generate the entire solution to a task; AI Step-by-Step, where learners divided the problem into parts and used Codex to generate each part; Hybrid, where learners wrote some of the code themselves and used Codex to generate others; and Manual coding, where learners wrote the code themselves. The AI Single Prompt approach resulted in the highest correctness scores on code-authoring tasks, but the lowest correctness scores on subsequent code-modification tasks during training. Our results provide initial insight into how novice learners use AI code generators and the challenges and opportunities associated with integrating them into self-paced learning environments. We conclude with various signs of over-reliance and self-regulation, as well as opportunities for curriculum and tool development.

AceCoder : An Effective Prompting Technique Specialized in Code Generation

Structured Chain-of-Thought Prompting for Code Generation

Towards Enhancing In-Context Learning for Code Generation

Promptly: Using Prompt Problems to Teach Learners How to Effectively Utilize AI Code Generators

EPiC: Cost-effective Search-based Prompt Engineering of LLMs for Code Generation

Genetic Auto-prompt Learning for Pre-trained Code Intelligence Language Models

Code Prompting: a Neural Symbolic Method for Complex Reasoning in Large Language Models

Enhancing Computer Programming Education with LLMs: A Study on Effective Prompt Engineering for Python Code Generation

Prompt Problems: A New Programming Exercise for the Generative AI Era

AgentCoder: Multi-Agent-based Code Generation with Iterative Testing and Optimisation

Prompt-based Code Completion via Multi-Retrieval Augmented Generation

Benchmarking and Explaining Large Language Model-based Code Generation: A Causality-Centric Approach

Code Generation Using Self-Interactive Assistant

Code-Aware Prompting: A study of Coverage Guided Test Generation in Regression Setting using LLM

What Makes Large Language Models Reason in (Multi-Turn) Code Generation?

How Novices Use LLM-Based Code Generators to Solve CS1 Coding Tasks in a Self-Paced Learning Environment

Advancing GenAI Assisted Programming--A Comparative Study on Prompt Efficiency and Code Quality Between GPT-4 and GLM-4

Prompting Techniques for Secure Code Generation: A Systematic Investigation

Bridging Code Semantic and LLMs: Semantic Chain-of-Thought Prompting for Code Generation