Abstract:Objective. Vaccination has engendered a spectrum of public opinions, with social media acting as a crucial platform for health-related discussions. The emergence of artificial intelligence technologies, such as large language models (LLMs), offers a novel opportunity to efficiently investigate public discourses. This research assesses the accuracy of ChatGPT, a widely used and freely available service built upon an LLM, for sentiment analysis to discern different stances toward Human Papillomavirus (HPV) vaccination. Methods. Messages related to HPV vaccination were collected from social media supporting different message formats: Facebook (long format) and Twitter (short format). A selection of 1,000 human-evaluated messages was input into the LLM, which generated multiple response instances containing its classification results. Accuracy was measured for each message as the level of concurrence between human and machine decisions, ranging between 0 and 1. Results. Average accuracy was notably high when 20 response instances were used to determine the machine decision of each message: .882 (SE = .021) and .750 (SE = .029) for anti- and pro-vaccination long-form; .773 (SE = .027) and .723 (SE = .029) for anti- and pro-vaccination short-form, respectively. Using only three or even one instance did not lead to a severe decrease in accuracy. However, for long-form messages, the language model exhibited significantly lower accuracy in categorizing pro-vaccination messages than anti-vaccination ones. Conclusions. ChatGPT shows potential in analyzing public opinions on HPV vaccination using social media content. However, understanding the characteristics and limitations of a language model within specific public health contexts remains imperative.

Working With AI to Persuade: Examining a Large Language Model's Ability to Generate Pro-Vaccination Messages

Artificial intelligence for health message generation: an empirical study using a large language model (LLM) and prompt engineering

What generative AI means for trust in health communications

Accuracy of a Large Language Model in Distinguishing Anti- And Pro-vaccination Messages on Social Media: The Case of Human Papillomavirus Vaccination

The potential of generative AI for personalized persuasion at scale

Attention is not all you need: the complicated case of ethically using large language models in healthcare and medicine

Persuasion with Large Language Models: a Survey

On the Conversational Persuasiveness of Large Language Models: A Randomized Controlled Trial

The Persuasive Power of Large Language Models

The effect of source disclosure on evaluation of AI-generated messages: A two-part study

Use of large language models as a scalable approach to understanding public health discourse

Artificial Intelligence and Public Health: An Exploratory Study

Meat consumption and preparation, and genetic susceptibility in relation to colorectal adenomas.

ChatGPT and the rise of large language models: the new AI-driven infodemic threat in public health

Influence of believed AI involvement on the perception of digital medical advice

AI language models are transforming the medical writing space – like it or not!

Generative AI and medical ethics: the state of play

Synthetic Lies: Understanding AI-Generated Misinformation and Evaluating Algorithmic and Human Solutions

Generative Artificial Intelligence and Large Language Models in Primary Care Medical Education

Should we tweet this? Generative response modeling for predicting reception of public health messaging on Twitter

Large language models are changing landscape of academic publications. A positive transformation?