DiffuST: a latent diffusion model for spatial transcriptomics denoising

Shaoqing Jiao,Dazhi Lu,Xi Zeng,Tao Wang,Yongtian Wang,Yunwei Dong,Jiajie Peng
DOI: https://doi.org/10.1101/2024.06.19.599672
2024-06-23
Abstract:Spatial transcriptomics technologies have enabled comprehensive measurements of gene expression profiles while retaining spatial information and matched pathology images. However, noise resulting from low RNA capture efficiency and experimental steps needed to keep spatial information may corrupt the biological signals and obstruct analyses. Here, we develop a latent diffusion model DiffuST to denoise spatial transcriptomics. DiffuST employs a graph autoencoder and a pre-trained model to extract different scale features from spatial information and pathology images. Then, a latent diffusion model is leveraged to map different scales of features to the same space for denoising. The evaluation based on various spatial transcriptomics datasets showed the superiority of DiffuST over existing denoising methods. Furthermore, the results demonstrated that DiffuST can enhance downstream analysis of spatial transcriptomics and yield significant biological insights.
Genomics
What problem does this paper attempt to address?
The paper attempts to address the issue of noise removal in spatial transcriptomics data to improve the accuracy of downstream analyses. Specifically, denoising becomes a necessary step because low RNA capture efficiency and noise in experimental procedures can disrupt biological signals and hinder analysis. The paper proposes a new method called DiffuST, which is a latent diffusion model designed to integrate spatial information and pathological image features to remove noise from spatial transcriptomics data. DiffuST achieves this goal through the following means: 1. **Utilizing graph autoencoders**: Extracting gene expression profiles and spatial location information and compressing them into latent representations. 2. **Utilizing pre-trained models**: Extracting high-level semantic features from pathological images. 3. **Latent diffusion model**: Mapping features of different scales into the same space for denoising. The paper validates the superiority of DiffuST through multiple spatial transcriptomics datasets and demonstrates its advantages in downstream tasks such as tissue structure recognition, cell-cell communication inference, and cell type deconvolution. Overall, DiffuST can significantly enhance the denoising effect of spatial transcriptomics data and provide valuable biological insights.