Abstract:Recent progress in multiplexed tissue imaging is advancing the study of tumor microenvironments to enhance our understanding of treatment response and disease progression. Cellular neighborhood analysis is a popular computational approach for these complex image data. Despite its popularity, there are significant challenges, including high computational demands that limit feasibility for large-scale applications and the lack of a principled strategy for integrative analysis across images. This absence hampers the precise and consistent identification of spatial features and tracking of their dynamics over disease progression. To overcome these challenges, we introduce SpaTopic, a spatial topic model designed to decode high-level spatial architecture across multiplexed tissue images. This algorithm integrates both cell type and spatial information within a topic modelling framework, originally developed for natural language processing and adapted for computer vision. Spatial information is incorporated into the flexible design of documents, representing densely overlapping regions in images. The model employs an efficient collapsed Gibbs sampling algorithm for both statistical and computational inference. We benchmarked the performance against five state-of-the-art algorithms through various case studies using different single-cell spatial transcriptomic and proteomic imaging platforms across different tissue types. Our findings demonstrate that SpaTopic consistently identifies biologically and clinically significant spatial topics such as tertiary lymphoid structures (TLSs) and tracks dynamic changes in spatial features over disease progression. Its computational efficiency and broad applicability across various molecular imaging platforms will enhance the analysis of large-scale tissue imaging datasets.

A bayesian multivariate mixture model for high throughput spatial transcriptomics

Revealing Spatial Multimodal Heterogeneity in Tissues with SpaTrio

Spanve: an Statistical Method to Detect Clustering-friendly Spatially Variable Genes in Large-scale Spatial Transcriptomics Data

Bayesian modeling of spatial molecular profiling data via Gaussian process

FISHFactor: A Probabilistic Factor Model for Spatial Transcriptomics Data with Subcellular Resolution

Bayesian Nonparametric Clustering with Feature Selection for Spatially Resolved Transcriptomics Data

SpaceX: Gene Co-expression Network Estimation for Spatial Transcriptomics

Bayesian hidden mark interaction model for detecting spatially variable genes in imaging-based spatially resolved transcriptomics data

An interpretable Bayesian clustering approach with feature selection for analyzing spatially resolved transcriptomics data

MAPLE: A Hybrid Framework for Multi-Sample Spatial Transcriptomics Data

High-dimensional Bayesian Model for Disease-Specific Gene Detection in Spatial Transcriptomics

A Bayesian modified Ising model for identifying spatially variable genes from spatial transcriptomics data

Spatial transcriptomics at subspot resolution with BayesSpace

Benchmarking Computational Integration Methods for Spatial Transcriptomics Data

BayeSMART: Bayesian Clustering of Multi-sample Spatially Resolved Transcriptomics Data

Generalized Bayesian nonparametric clustering framework for high-dimensional spatial omics data

Decoding Spatial Tissue Architecture: A Scalable Bayesian Topic Model for Multiplexed Imaging Analysis

SpatialSPM: statistical parametric mapping for the comparison of gene expression pattern images in multiple spatial transcriptomic datasets

Differential gene expression analysis of spatial transcriptomic experiments using spatial mixed models

A Unified Probabilistic Framework for Modeling and Inferring Spatial Transcriptomic Data