High-Resolution Spatial Transcriptomics from Histology Images using HisToSGE

Zhiceng Shi,Shuailin Xue,Fangfang Zhu,Wenwen Min
2024-07-30
Abstract:Spatial transcriptomics (ST) is a groundbreaking genomic technology that enables spatial localization analysis of gene expression within tissue sections. However, it is significantly limited by high costs and sparse spatial resolution. An alternative, more cost-effective strategy is to use deep learning methods to predict high-density gene expression profiles from histological images. However, existing methods struggle to capture rich image features effectively or rely on low-dimensional positional coordinates, making it difficult to accurately predict high-resolution gene expression profiles. To address these limitations, we developed HisToSGE, a method that employs a Pathology Image Large Model (PILM) to extract rich image features from histological images and utilizes a feature learning module to robustly generate high-resolution gene expression profiles. We evaluated HisToSGE on four ST datasets, comparing its performance with five state-of-the-art baseline methods. The results demonstrate that HisToSGE excels in generating high-resolution gene expression profiles and performing downstream tasks such as spatial domain identification. All code and public datasets used in this paper are available at <a class="link-external link-https" href="https://github.com/wenwenmin/HisToSGE" rel="external noopener nofollow">this https URL</a> and <a class="link-external link-https" href="https://zenodo.org/records/12792163" rel="external noopener nofollow">this https URL</a>.
Image and Video Processing,Artificial Intelligence,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the incomplete or missing gene expression information in Spatial Transcriptomics (ST) due to high cost and low spatial resolution. Specifically, although the existing ST techniques can achieve spatial localization analysis of gene expression, their high cost and sparse spatial resolution limit their wide application. In addition, the existing deep - learning - based methods have difficulties in predicting high - density gene expression profiles from histological images, because these methods are difficult to effectively capture rich image features or rely on low - dimensional position coordinates, thus it is difficult to accurately predict high - resolution gene expression profiles. To overcome these problems, the authors developed a new method named HisToSGE. This method uses the Pathology Image Large Model (PILM) to extract rich image features from histological images and generates high - resolution gene expression profiles through the feature learning module. Through this method, HisToSGE aims to improve the resolution and accuracy of gene expression prediction while reducing the experimental cost. In the paper, the authors verified the effectiveness of HisToSGE through experiments on multiple ST datasets and compared it with existing methods, showing its superior performance in generating high - resolution gene expression profiles.