Abstract:Background: Health care organizations are collecting increasing volumes of clinical text data. Topic models are a class of unsupervised machine learning algorithms for discovering latent thematic patterns in these large unstructured document collections. Objective: We aimed to comparatively evaluate several methods for estimating temporal topic models using clinical notes obtained from primary care electronic medical records from Ontario, Canada. Methods: We used a retrospective closed cohort design. The study spanned from January 01, 2011, through December 31, 2015, discretized into 20 quarterly periods. Patients were included in the study if they generated at least 1 primary care clinical note in each of the 20 quarterly periods. These patients represented a unique cohort of individuals engaging in high-frequency use of the primary care system. The following temporal topic modeling algorithms were fitted to the clinical note corpus: nonnegative matrix factorization, latent Dirichlet allocation, the structural topic model, and the BERTopic model. Results: Temporal topic models consistently identified latent topical patterns in the clinical note corpus. The learned topical bases identified meaningful activities conducted by the primary health care system. Latent topics displaying near-constant temporal dynamics were consistently estimated across models (eg, pain, hypertension, diabetes, sleep, mood, anxiety, and depression). Several topics displayed predictable seasonal patterns over the study period (eg, respiratory disease and influenza immunization programs). Conclusions: Nonnegative matrix factorization, latent Dirichlet allocation, structural topic model, and BERTopic are based on different underlying statistical frameworks (eg, linear algebra and optimization, Bayesian graphical models, and neural embeddings), require tuning unique hyperparameters (optimizers, priors, etc), and have distinct computational requirements (data structures, computational hardware, etc). Despite the heterogeneity in statistical methodology, the learned latent topical summarizations and their temporal evolution over the study period were consistently estimated. Temporal topic models represent an interesting class of models for characterizing and monitoring the primary health care system.

Quality indices for topic model selection and evaluation: a literature review and case study

Investigating topic modeling techniques through evaluation of topics discovered in short texts data across diverse domains

Improving Automated Bug Triaging with Specialized Topic Model.

Evaluating Topic Quality with Posterior Variability

Comparison of Methods for Estimating Temporal Topic Models From Primary Care Clinical Text Data: Retrospective Closed Cohort Study

A Topic Coverage Approach to Evaluation of Topic Models

Can Topic Models Be Used in Research Evaluations? Reproducibility, Validity, and Reliability when Compared with Semantic Maps

A Review of Stability in Topic Modeling: Metrics for Assessing and Techniques for Improving Stability

Topic and Keyword Re-Ranking for LDA-based Topic Modeling

Using Topic Modeling Methods for Short-Text Data: A Comparative Analysis

An Iterative Approach to Topic Modelling

Qualitative Insights Tool (QualIT): LLM Enhanced Topic Modeling

Revisiting Automated Topic Model Evaluation with Large Language Models

Computer-Assisted Text Analysis for Social Science: Topic Models and Beyond

A systematic review of the use of topic models for short text social media analysis

The utility of topic modelling for discourse studies: A critical evaluation

Towards Evaluation of Cultural-scale Claims in Light of Topic Model Sampling Effects

Agreeing to Disagree: Choosing Among Eight Topic-Modeling Methods

Pseudo-document simulation for comparing LDA, GSDMM and GPM topic models on short and sparse text using Twitter data

Reliability of Topic Modeling

Exploring the Power of Topic Modeling Techniques in Analyzing Customer Reviews: A Comparative Analysis