Abstract:Brain tumor is a fatal central nervous system disease that occurs in around 250,000 people each year globally and it is the second cause of cancer in children. It has been widely acknowledged that genetic factor is one of the significant risk factors for brain cancer. Thus, accurate descriptions of the locations of where the relative genes are active and how these genes express are critical for understanding the pathogenesis of brain tumor and for early detection. The Allen Developing Mouse Brain Atlas is a project on gene expression over the course of mouse brain development stages. Utilizing mouse models allows us to use a relatively homogeneous system to reveal the genetic risk factor of brain cancer. In the Allen atlas, about 435,000 high-resolution spatiotemporal in situ hybridization images have been generated for approximately 2,100 genes and currently the expression patterns over specific brain regions are manually annotated by experts, which does not scale with the continuously expanding collection of images. In this paper, we present an efficient computational approach to perform automated gene expression pattern annotation on brain images. First, the gene expression information in the brain images is captured by invariant features extracted from local image patches. Next, we adopt an augmented sparse coding method, called Stochastic Coordinate Coding, to construct high-level representations. Different pooling methods are then applied to generate gene-level features. To discriminate gene expression patterns at specific brain regions, we employ supervised learning methods to build accurate models for both binary-class and multi-class cases. Random undersampling and majority voting strategies are utilized to deal with the inherently imbalanced class distribution within each annotation task in order to further improve predictive performance. In addition, we propose a novel structure-based multi-label classification approach, which makes use of label hierarchy based on brain ontology during model learning. Extensive experiments have been conducted on the atlas and results show that the proposed approach produces higher annotation accuracy than several baseline methods. Our approach is shown to be robust on both binary-class and multi-class tasks and even with a relatively low training ratio. Our results also show that the use of label hierarchy can significantly improve the annotation accuracy at all brain ontology levels.

Unsupervised pattern identification in spatial gene expression atlas reveals mouse brain regions beyond established ontology

Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain

Data-driven fine-grained region discovery in the mouse brain with transformers

Deep convolutional neural networks for annotating gene expression patterns in the mouse brain

A bayesian multivariate mixture model for high throughput spatial transcriptomics

Whole brain alignment of spatial transcriptomics between humans and mice with BrainAlign

Insights from spatially mapped gene expression in the mouse brain

Principled feature attribution for unsupervised gene expression analysis

Exploring brain transcriptomic patterns: a topological analysis using spatial expression networks

stLearn: integrating spatial location, tissue morphology and gene expression to find cell types, cell-cell interactions and spatial trajectories within undissociated tissues

Analysis of spatial-temporal gene expression patterns reveals dynamics and regionalization in developing mouse brain

SpatialSPM: statistical parametric mapping for the comparison of gene expression pattern images in multiple spatial transcriptomic datasets

Spatial Single-Cell Mapping of Transcriptional Differences Across Genetic Backgrounds in Mouse Brains

Machine Learning for Uncovering Biological Insights in Spatial Transcriptomics Data

A robust statistical approach for finding informative spatially associated pathways

Automated gene expression pattern annotation in the mouse brain

PROST: quantitative identification of spatially variable genes and domain detection in spatial transcriptomics

Transcriptome Architecture of Adult Mouse Brain Revealed by Sparse Coding of Genome-Wide In Situ Hybridization Images

Cell type-specific genes show striking and distinct patterns of spatial expression in the mouse brain

Identifying a ubiquitous gene expression variation pattern in the human brain

SpaNCMG: improving spatial domains identification of spatial transcriptomics using neighborhood-complementary mixed-view graph convolutional network