Abstract:Background: The COVID-19 pandemic was a "wake up" call for public health agencies. Often, these agencies are ill-prepared to communicate with target audiences clearly and effectively for community-level activations and safety operations. The obstacle is a lack of data-driven approaches to obtaining insights from local community stakeholders. Thus, this study suggests a focus on listening at local levels given the abundance of geo-marked data and presents a methodological solution to extracting consumer insights from unstructured text data for health communication. Methods: This study demonstrates how to combine human and Natural Language Processing (NLP) machine analyses to reliably extract meaningful consumer insights from tweets about COVID and the vaccine. This case study employed Latent Dirichlet Allocation (LDA) topic modeling, Bidirectional Encoder Representations from Transformers (BERT) emotion analysis, and human textual analysis and examined 180,128 tweets scraped by Twitter Application Programming Interface's (API) keyword function from January 2020 to June 2021. The samples came from four medium-sized American cities with larger populations of people of color. Results: The NLP method discovered four topic trends: "COVID Vaccines," "Politics," "Mitigation Measures," and "Community/Local Issues," and emotion changes over time. The human textual analysis profiled the discussions in the selected four markets to add some depth to our understanding of the uniqueness of the different challenges experienced. Conclusions: This study ultimately demonstrates that our method used here could efficiently reduce a large amount of community feedback (e.g., tweets, social media data) by NLP and ensure contextualization and richness with human interpretation. Recommendations on communicating vaccination are offered based on the findings: (1) the strategic objective should be empowering the public; (2) the message should have local relevance; and, (3) communication needs to be timely.

Identifying informative tweets during a pandemic via a topic-aware neural language model

Not-NUTs at W-NUT 2020 Task 2: A BERT-based System in Identifying Informative COVID-19 English Tweets

Machine Learning Techniques for Sentiment Analysis of COVID-19-Related Twitter Data

Public discourse and sentiment during the COVID 19 pandemic: Using Latent Dirichlet Allocation for topic modeling on Twitter

A Dynamic Topic Identification and Labeling Approach of COVID-19 Tweets

Enhanced Sentiment Analysis and Topic Modeling During the Pandemic Using Automated Latent Dirichlet Allocation

Leveraging Natural Language Processing to Mine Issues on Twitter During the COVID-19 Pandemic

Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection

Transformer-Based Language Model Fine-Tuning Methods for COVID-19 Fake News Detection

Sentimental analysis of COVID-19 twitter data using deep learning and machine learning models

Exploring a Hybrid Deep Learning Framework to Automatically Discover Topic and Sentiment in COVID-19 Tweets

AI Assisted Attention Mechanism for Hybrid Neural Model to Assess Online Attitudes About COVID-19

A Domain-Agnostic Neurosymbolic Approach for Big Social Data Analysis: Evaluating Mental Health Sentiment on Social Media during COVID-19

Public Opinion About COVID-19 on a Microblog Platform in China: Topic Modeling and Multidimensional Sentiment Analysis of Social Media

An improved BERT method for the evolution of network public opinion of major infectious diseases: Case Study of COVID-19

A case study of using natural language processing to extract consumer insights from tweets in American cities for public health crises

An attention-based hybrid model for spatial and temporal sentiment analysis of COVID-19 related tweets in the contiguous United States

Twitter discussions and emotions about COVID-19 pandemic: a machine learning approach

Enhancing public health response: a framework for topics and sentiment analysis of COVID-19 in the UK using Twitter and the embedded topic model

Deep learning for COVID-19 topic modelling via Twitter: Alpha, Delta and Omicron

Dynamic topic modelling for exploring the scientific literature on coronavirus: an unsupervised labelling technique