Abstract:Large Language Models (LLMs), such as ChatGPT, have demonstrated impressive capabilities in automatically generating code from provided natural language requirements. However, in real-world practice, it is inevitable that the requirements written by users might be ambiguous or insufficient. Current LLMs will directly generate programs according to those unclear requirements, regardless of interactive clarification, which will likely deviate from the original user intents. To bridge that gap, we introduce a novel framework named ClarifyGPT, which aims to enhance code generation by empowering LLMs with the ability to identify ambiguous requirements and ask targeted clarifying questions. Specifically, ClarifyGPT first detects whether a given requirement is ambiguous by performing a code consistency check. If it is ambiguous, ClarifyGPT prompts an LLM to generate targeted clarifying questions. After receiving question responses, ClarifyGPT refines the ambiguous requirement and inputs it into the same LLM to generate a final code solution. To evaluate our ClarifyGPT, we invite ten participants to use ClarifyGPT for code generation on two benchmarks: MBPP-sanitized and MBPP-ET. The results show that ClarifyGPT elevates the performance (Pass@1) of GPT-4 from 70.96% to 80.80% on MBPP-sanitized. Furthermore, to conduct large-scale automated evaluations of ClarifyGPT across different LLMs and benchmarks without requiring user participation, we introduce a high-fidelity simulation method to simulate user responses. The results demonstrate that ClarifyGPT can significantly enhance code generation performance compared to the baselines. In particular, ClarifyGPT improves the average performance of GPT-4 and ChatGPT across five benchmarks from 62.43% to 69.60% and from 54.32% to 62.37%, respectively. A human evaluation also confirms the effectiveness of ClarifyGPT in detecting ambiguous requirements and generating high-quality clarifying questions. We believe that ClarifyGPT can effectively facilitate the practical application of LLMs in real-world development environments.

ModelGPT: Unleashing LLM's Capabilities for Tailored Model Generation

Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level

GPTA: Generative Prompt Tuning Assistant for Synergistic Downstream Neural Network Enhancement with LLMs

PatternGPT :A Pattern-Driven Framework for Large Language Model Text Generation

NetGPT: An AI-Native Network Architecture for Provisioning Beyond Personalized Generative Services

GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction

AutoML-GPT: Automatic Machine Learning with GPT

ClarifyGPT: A Framework for Enhancing LLM-Based Code Generation via Requirements Clarification

OMPGPT: A Generative Pre-trained Transformer Model for OpenMP

UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models

AcademicGPT: Empowering Academic Research

LLM4DS: Evaluating Large Language Models for Data Science Code Generation

NetGPT:An AI-Native Network Architecture for Provisioning Beyond Personalized Generative Services

CourseGPT-zh: an Educational Large Language Model Based on Knowledge Distillation Incorporating Prompt Optimization

GeneRAG: Enhancing Large Language Models with Gene-Related Task by Retrieval-Augmented Generation

Large Language Models as Data Preprocessors

GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information

SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding

3D-GPT: Procedural 3D Modeling with Large Language Models

GPT4AIGChip: Towards Next-Generation AI Accelerator Design Automation via Large Language Models

GeoCode-GPT: A Large Language Model for Geospatial Code Generation Tasks