Deep Learning in Spatially Resolved Transcriptomics: A Comprehensive Technical View

Roxana Zahedi Nasab,Mohammad Reza Eftekhariyan Ghamsari,Ahmadreza Argha,Callum Macphillamy,Amin Beheshti,Roohallah Alizadehsani,Nigel H. Lovell,Mohammad Lotfollahi,Hamid Alinejad-Rokny
DOI: https://doi.org/10.48550/arXiv.2210.04453
IF: 4.31
2022-10-10
Genomics
Abstract:Spatially resolved transcriptomics (SRT) has evolved rapidly through various technologies, enabling scientists to investigate both morphological contexts and gene expression profiling at single-cell resolution in parallel. SRT data are complex and multi-modal, comprising gene expression matrices, spatial information, and often high-resolution histology images. Because of this complexity and multi-modality, sophisticated computational algorithms are required to accurately analyze SRT data. Most efforts in this domain have been made to utilize conventional machine learning and statistical approaches exhibiting sub-optimal results due to the complicated nature of SRT datasets. To address these shortcomings, researchers have recently employed deep learning algorithms including various state-of-the-art methods mainly in spatial clustering, spatially variable gene identification, and alignment. In this paper, we provide an extensive methodological review of these deep learning methods and discuss their strengths and limitations. We also discuss the new frontiers, current challenges, limitations, and open questions in this field. Although researchers have put a great effort into developing deep learning-based models to analyze SRT data, some modifications are still needed to have more biologically aware models such as phylogeny-aware clustering, or treating small patches of histology images together. Collectively, appropriate strategies are still needed for batch effect removal, transformation and normalization, overdispersion and zero inflation patterns of gene expression in the analysis of SRT data with deep learning techniques. Also, we provided a comprehensive list of all available SRT databases that can be used as an extensive resource for future studies.
What problem does this paper attempt to address?