Abstract:Background: Prompt engineering, focusing on crafting effective prompts to large language models (LLMs), has garnered attention for its capabilities at harnessing the potential of LLMs. This is even more crucial in the medical domain due to its specialized terminology and language technicity. Clinical natural language processing applications must navigate complex language and ensure privacy compliance. Prompt engineering offers a novel approach by designing tailored prompts to guide models in exploiting clinically relevant information from complex medical texts. Despite its promise, the efficacy of prompt engineering in the medical domain remains to be fully explored. Objective: The aim of the study is to review research efforts and technical approaches in prompt engineering for medical applications as well as provide an overview of opportunities and challenges for clinical practice. Methods: Databases indexing the fields of medicine, computer science, and medical informatics were queried in order to identify relevant published papers. Since prompt engineering is an emerging field, preprint databases were also considered. Multiple data were extracted, such as the prompt paradigm, the involved LLMs, the languages of the study, the domain of the topic, the baselines, and several learning, design, and architecture strategies specific to prompt engineering. We include studies that apply prompt engineering–based methods to the medical domain, published between 2022 and 2024, and covering multiple prompt paradigms such as prompt learning (PL), prompt tuning (PT), and prompt design (PD). Results: We included 114 recent prompt engineering studies. Among the 3 prompt paradigms, we have observed that PD is the most prevalent (78 papers). In 12 papers, PD, PL, and PT terms were used interchangeably. While ChatGPT is the most commonly used LLM, we have identified 7 studies using this LLM on a sensitive clinical data set. Chain-of-thought, present in 17 studies, emerges as the most frequent PD technique. While PL and PT papers typically provide a baseline for evaluating prompt-based approaches, 61% (48/78) of the PD studies do not report any nonprompt-related baseline. Finally, we individually examine each of the key prompt engineering–specific information reported across papers and find that many studies neglect to explicitly mention them, posing a challenge for advancing prompt engineering research. Conclusions: In addition to reporting on trends and the scientific landscape of prompt engineering, we provide reporting guidelines for future studies to help advance research in the medical field. We also disclose tables and figures summarizing medical prompt engineering papers available and hope that future contributions will leverage these existing works to better advance the field.

Prompt engineering paradigms for medical applications: scoping review and recommendations for better practices

Prompt Engineering Paradigms for Medical Applications: Scoping Review

Prompt Engineering as an Important Emerging Skill for Medical Professionals: Tutorial

Prompt Engineering for Healthcare: Methodologies and Applications

Prompt engineering for digital mental health: a short review

Prompt engineering: The next big skill in rheumatology research

An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing: Algorithm Development and Validation Study

Unleashing the potential of prompt engineering in Large Language Models: a comprehensive review

A Systematic Survey of Prompt Engineering in Large Language Models: Techniques and Applications

Prompt engineering on leveraging large language models in generating response to InBasket messages

Towards a Catalog of Prompt Patterns to Enhance the Discipline of Prompt Engineering

Prompt Engineering: a methodology for optimizing interactions with AI-Language Models in the field of engineering

A Road Map of Prompt Engineering for ChatGPT in Healthcare: A Perspective Study

Prompt Engineering For Students of Medicine and Their Teachers

Promptwise: Prompt Engineering Paradigm for Enhanced Patient-Large Language Model Interactions Towards Medical Education

Prompt engineering with a large language model to assist providers in responding to patient inquiries: a real-time implementation in the electronic health record

Exploring Prompt Engineering: A Systematic Review with SWOT Analysis

A Survey of Prompt Engineering Methods in Large Language Models for Different NLP Tasks

A Brief History of Prompt: Leveraging Language Models. (Through Advanced Prompting)

Do Physicians Know How to Prompt? The Need for Automatic Prompt Optimization Help in Clinical Note Generation

Improving the use of LLMs in radiology through prompt engineering: from precision prompts to zero-shot learning