Identifying transposable element expression dynamics and heterogeneity during development at the single-cell level with a processing pipeline scTE

Jiangping He,Isaac A. Babarinde,Li Sun,Shuyang Xu,Ruhai Chen,Junjie Shi,Yuanjie Wei,Yuhao Li,Gang Ma,Qiang Zhuang,Andrew P. Hutchins,Jiekai Chen
DOI: https://doi.org/10.1038/s41467-021-21808-x
IF: 16.6
2021-03-05
Nature Communications
Abstract:Abstract Transposable elements (TEs) make up a majority of a typical eukaryote’s genome, and contribute to cell heterogeneity in unclear ways. Single-cell sequencing technologies are powerful tools to explore cells, however analysis is typically gene-centric and TE expression has not been addressed. Here, we develop a single-cell TE processing pipeline, scTE, and report the expression of TEs in single cells in a range of biological contexts. Specific TE types are expressed in subpopulations of embryonic stem cells and are dynamically regulated during pluripotency reprogramming, differentiation, and embryogenesis. Unexpectedly, TEs are expressed in somatic cells, including human disease-specific TEs that are undetectable in bulk analyses. Finally, we apply scTE to single-cell ATAC-seq data, and demonstrate that scTE can discriminate cell type using chromatin accessibly of TEs alone. Overall, our results classify the dynamic patterns of TEs in single cells and their contributions to cell heterogeneity.
multidisciplinary sciences
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: **Identify the expression dynamics and heterogeneity of transposable element (TEs) at the single - cell level**. Specifically, the authors developed a single - cell transposable element processing pipeline named scTE to quantify and analyze TE expression patterns in single cells. ### Main problems and background 1. **The role and challenges of transposable element (TEs)**: - TEs account for a large part of the typical eukaryotic genome and contribute to cell heterogeneity, but their mechanism of action remains unclear. - Although single - cell sequencing technology is powerful, it is usually gene - centered and ignores the study of TE expression. 2. **Limitations of existing methods**: - Due to the highly repetitive and degenerate characteristics of TEs, traditional single - cell RNA - seq data analysis methods cannot accurately quantify TE expression. - Most studies have overlooked the potential role of TEs in cell fate regulation. ### Goals of the paper - **Develop a new tool (scTE)**: To quantify TE expression in single - cell sequencing data and solve the quantification problem of TEs at the single - cell level. - **Reveal the dynamic changes of TE expression**: Explore the expression patterns of TEs in different biological processes, such as embryonic stem cell reprogramming, differentiation, and embryogenesis. - **Analyze the contribution of TEs to cell heterogeneity**: Through single - cell data, reveal how TEs affect cell - type - specific expression and cell fate determination. ### Specific problems 1. **Differences in TE expression in different cell types**: For example, in embryonic stem cells, certain types of TEs are specifically expressed in sub - population cells and are dynamically regulated during pluripotency reprogramming. 2. **TE expression in somatic cells**: It was found that TEs are expressed not only in embryonic cells but also in somatic cells, including TEs related to specific diseases. 3. **The role of TEs in chromatin accessibility**: Applying scTE to single - cell ATAC - seq data, it was proved that the chromatin accessibility of TEs can distinguish cell types. 4. **The role of TEs in the reprogramming process**: Study the expression patterns of TEs during the formation of induced pluripotent stem cells (iPSCs) and reveal their dynamic changes in different reprogramming pathways. Through these studies, the authors hope to provide new perspectives and tools for understanding the role of TEs in cell heterogeneity and development processes.