Abstract:Background: A large language model (LLM) is a machine learning model inferred from text data that captures subtle patterns of language use in context. Modern LLMs are based on neural network architectures that incorporate transformer methods. They allow the model to relate words together through attention to multiple words in a text sequence. LLMs have been shown to be highly effective for a range of tasks in natural language processing (NLP), including classification and information extraction tasks and generative applications. Objective: The aim of this adapted Delphi study was to collect researchers' opinions on how LLMs might influence health care and on the strengths, weaknesses, opportunities, and threats of LLM use in health care. Methods: We invited researchers in the fields of health informatics, nursing informatics, and medical NLP to share their opinions on LLM use in health care. We started the first round with open questions based on our strengths, weaknesses, opportunities, and threats framework. In the second and third round, the participants scored these items. Results: The first, second, and third rounds had 28, 23, and 21 participants, respectively. Almost all participants (26/28, 93% in round 1 and 20/21, 95% in round 3) were affiliated with academic institutions. Agreement was reached on 103 items related to use cases, benefits, risks, reliability, adoption aspects, and the future of LLMs in health care. Participants offered several use cases, including supporting clinical tasks, documentation tasks, and medical research and education, and agreed that LLM-based systems will act as health assistants for patient education. The agreed-upon benefits included increased efficiency in data handling and extraction, improved automation of processes, improved quality of health care services and overall health outcomes, provision of personalized care, accelerated diagnosis and treatment processes, and improved interaction between patients and health care professionals. In total, 5 risks to health care in general were identified: cybersecurity breaches, the potential for patient misinformation, ethical concerns, the likelihood of biased decision-making, and the risk associated with inaccurate communication. Overconfidence in LLM-based systems was recognized as a risk to the medical profession. The 6 agreed-upon privacy risks included the use of unregulated cloud services that compromise data security, exposure of sensitive patient data, breaches of confidentiality, fraudulent use of information, vulnerabilities in data storage and communication, and inappropriate access or use of patient data. Conclusions: Future research related to LLMs should not only focus on testing their possibilities for NLP-related tasks but also consider the workflows the models could contribute to and the requirements regarding quality, integration, and regulations needed for successful implementation in practice.

Evaluating large language models for use in healthcare: A framework for translational value assessment

Evaluating large language models in medical applications: a survey

A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics

Large language models in healthcare and medical domain: A review

Large language models in medicine: the potentials and pitfalls

A framework for human evaluation of large language models in healthcare derived from literature review

Critical Care Studies Using Large Language Models Based on Electronic Healthcare Records: A Technical Note

Attention is not all you need: the complicated case of ethically using large language models in healthcare and medicine

The long but necessary road to responsible use of large language models in healthcare research

A Comprehensive Survey on Evaluating Large Language Model Applications in the Medical Industry

Large language models in medical and healthcare fields: applications, advances, and challenges

Potential of Large Language Models in Health Care: Delphi Study

Testing and Evaluation of Health Care Applications of Large Language Models: A Systematic Review

Empathy and Equity: Key Considerations for Large Language Model Adoption in Health Care

The Role of Language Models in Modern Healthcare: A Comprehensive Review

Embracing Large Language Models for Medical Applications: Opportunities and Challenges

Ethical and regulatory challenges of large language models in medicine

Harnessing Large Language Models in Medical Research and Scientific Writing: A Closer Look to The Future