Deciphering the cis-regulatory landscape of natural yeast Transcript Leaders

Christina Akirtava,Gemma May,Joel Mcmanus
DOI: https://doi.org/10.1101/2024.07.03.601937
2024-07-06
Abstract:Protein synthesis is a vital process that is highly regulated at the initiation step of translation. Eukaryotic 5' transcript leaders (TLs) contain a variety of cis-regulatory features that influence translation and mRNA stability. However, the relative influences of these features in natural TLs are poorly characterized. To address this, we used massively parallel reporter assays (MPRAs) to quantify RNA levels, ribosome loading, and protein levels from 11,027 natural yeast TLs in vivo and systematically compared the relative impacts of their sequence features on gene expression. We found that yeast TLs influence gene expression over two orders of magnitude. While a leaky scanning model using Kozak contexts and uAUGs explained half of the variance in expression across transcript leaders, the addition of other features explained ~70% of gene expression variation. Our analyses detected key cis-acting sequence features, quantified their effects in vivo, and compared their roles to motifs reported from an in vitro study of ribosome recruitment. In addition, our work quantitated the effects of alternative transcription start site usage on gene expression in yeast. Thus, our study provides new quantitative insights into the roles of TL cis-acting sequences in regulating gene expression.
Genomics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to understand how cis - regulatory features in natural yeast transcriptional initiators (5' TLs) affect gene expression. Specifically, the authors used the massively parallel reporter assay (MPRA) technology to quantify the RNA levels, ribosome loading, and protein levels of 11,027 natural yeast 5' TLs in vivo and systematically compared the relative effects of these sequence features on gene expression. ### Main research questions: 1. **The influence of 5' TLs on gene expression**: The authors hope to understand how various cis - regulatory features in 5' TLs (such as Kozak sequences, upstream open reading frames (uORFs), RNA structures, etc.) affect gene expression. 2. **The role of Kozak sequences**: Evaluate the importance of Kozak sequences in translation initiation, especially in the presence of multiple upstream AUGs (uAUGs). 3. **The influence of RNA structures**: Study how RNA secondary structures (such as g - quadruplexes) affect the efficiency of translation initiation. 4. **The influence of alternative transcription start sites**: Explore how the selection of different transcription start sites affects gene expression. ### Research methods: - **FACS - seq**: Used to assess protein levels. - **RNA - seq**: Used to assess mRNA levels. - **PoLib - seq**: Used to assess ribosome loading. - **Leaky Scanning Model (LSM)**: Combines the strength of Kozak sequences and the influence of uAUGs to predict gene expression. ### Main findings: - **The range of influence of 5' TLs on gene expression**: Natural yeast 5' TLs can affect gene expression by more than two orders of magnitude. - **The importance of Kozak sequences**: The strength of Kozak sequences explains approximately 50% of the variation in gene expression, and this proportion increases to approximately 70% when combined with other features. - **The influence of RNA structures**: RNA secondary structures (especially g - quadruplexes) significantly reduce protein expression. - **Alternative transcription start sites**: The selection of different transcription start sites has a significant impact on gene expression. Through these studies, the authors provide new quantitative insights, revealing the important role of cis - acting sequences in 5' TLs in regulating gene expression.