Genomic Survey on Distribution of G-quadruplex Motifs and Their Functional Implications in Bombyx mori

Feng Wu,Hui Xiang,Qili Feng
DOI: https://doi.org/10.13441/j.cnki.cykx.2018.01.004
2018-01-01
Abstract:G-quadruplex (G4) is a kind of special four-strand structure different from the double helix structure of DNA.It is formed from guanine-rich sequences,which is stabled by monovalent cation.In mammals,G4 motif has been proven to be an epigenetic element with important biological functions.Quadparser program was run to predict G4 motifs in whole genome of the Lepidoptera model insect silkworm (Bombyx mori).Furthermore,distribution features of G4 motifs and regulatory effects of G4 motifs on target gene expression and function were preliminarily analyzed.Totally 6 278 G4 motifs were identified in silkworm whole genome.63.5% of them are located in transposable element regions,and 35.3% of them are distributed in coding gene regions.There are relatively enriched G4 structures near the 5′ flanking region of transcription initiation site and the 3′ transcriptional termination site,suggesting that G4 structures may play roles in regulation of gene expression.Compared to the genomic background,genes harboring G4 motifs at the 5′ flanking regions tend to have shorter coding regions while those with G4 motifs at 3′ flanking regions tend to have longer coding regions.Furthermore,genes with 5′ flanking sequence bearing G-quadruplex are enriched in molecular function of nucleic acid binding especially of transcription factor activity.These genes are mainly involved in regulation of nucleic acid metabolism related processes.Their G4 structures are mainly located on coding strand.Genes with 3′ flanking sequence bearing G-quadruplex are mainly enriched in molecular function of kinase,transferase and receptor activities.These genes are mainly involved in protein processing,phosphorylation modification and signal transduction.Their G4 structures are mainly located on template strand.The above results suggest that G4 structures located upstream or downstream of genes have different regulatory roles on target genes,and their functioning mechanisms may also be different.Combined analysis with the microarray data of silkworm,we found that genes with G4 motifs didn't show obvious tissue expression specificity,indicating that G4 motif regulated genes are involved in wide range of biological processes.This preliminary investigation provides important clues and references for further study on the biological function of this epigenetic genetic structure in silkworm.
What problem does this paper attempt to address?