Revealing Gene Expression Heterogeneity in a Clonal Population of through Single-Cell RNA Sequencing

Hiroki Kojima,Akiko Kashiwagi,Takashi Ikegami
DOI: https://doi.org/10.1101/2023.08.06.551249
2024-03-17
Abstract:We performed single-cell RNA sequencing (scRNA-seq) on a population of 5,000 , using the 10x Genomics 3’ gene expression analysis, to investigate gene expression variability within this clonal population. Initially, we estimated the 3’-untranslated regions (3’ UTRs), which were absent in existing annotation files but are crucial for the 10x Genomics 3’ gene expression analysis, using the peaks2utr method. This allowed us to create a modified annotation file, which was then utilized in our scRNA-seq analysis. Our analysis revealed significant gene expression variability within the population, even after removing the effect of cell phase-related features. This variability predominantly appeared in six distinct clusters. Through gene ontology and KEGG pathway enrichment analyses, we identified that these were primarily associated with ribosomal proteins, proteins specific to mitochondria, proteins involved in peroxisome-specific carbon metabolism, cytoskeletal proteins, motor proteins, and immobilized antigens.
Microbiology
What problem does this paper attempt to address?
The problem this paper attempts to address is revealing the gene expression heterogeneity in clonal populations of *Tetrahymena thermophila* through single-cell RNA sequencing (scRNA-seq). Specifically, the researchers aim to solve this problem through the following steps: 1. **Annotating 3' Untranslated Regions (3' UTRs)**: Since existing annotation files lack information on 3' UTRs, which are crucial for 10x Genomics 3' gene expression analysis, the researchers used the peaks2utr method to estimate 3' UTRs and created a modified annotation file. 2. **Conducting Single-Cell RNA Sequencing**: Using the 10x Genomics 3' gene expression analysis method, the researchers performed single-cell RNA sequencing on approximately 5,000 *Tetrahymena* cells. 3. **Removing Cell Cycle-Related Effects**: To exclude the influence of cell cycle stages on gene expression variation, the researchers performed regression analysis using a set of known cell cycle-dependent genes. 4. **Clustering Analysis and Functional Enrichment Analysis**: By performing clustering analysis on the adjusted data, the researchers identified 6 major gene expression clusters and interpreted the functions of these clusters through Gene Ontology (GO) and KEGG pathway enrichment analysis. Through the above steps, the researchers hope to reveal the gene expression heterogeneity in clonal populations of *Tetrahymena* and explore the biological significance of this heterogeneity. This not only helps in understanding the gene expression regulation mechanisms in *Tetrahymena* but also provides a reference for single-cell level studies of other microorganisms.