A Map of the Cis-Regulatory Sequences in the Mouse Genome

Yin Shen,Feng Yue,David F. McCleary,Zhen Ye,Lee Edsall,Samantha Kuan,Ulrich Wagner,Jesse Dixon,Leonard Lee,Victor V. Lobanenkov,Bing Ren
DOI: https://doi.org/10.1038/nature11243
IF: 64.8
2012-01-01
Nature
Abstract:A genomic map of nearly 300,000 potential cis-regulatory sequences determined from diverse mouse tissues and cell types reveals active promoters, enhancers and CCCTC-binding factor sites encompassing 11% of the mouse genome and significantly expands annotation of mammalian regulatory sequences. The identification of cis-regulatory sequences in the mouse genome has lagged behind that of other model organisms. Here, a genomic map of nearly 300,000 potential cis-regulatory sequences has been experimentally determined from diverse mouse tissues and cell types. The map reveals active promoters, enhancers and CTCF (CCCTC-binding factor) sites in nearly 11% of the mouse genome and significantly expands the annotation of mammalian regulatory sequences. The laboratory mouse is the most widely used mammalian model organism in biomedical research. The 2.6 × 109 bases of the mouse genome possess a high degree of conservation with the human genome1, so a thorough annotation of the mouse genome will be of significant value to understanding the function of the human genome. So far, most of the functional sequences in the mouse genome have yet to be found, and the cis-regulatory sequences in particular are still poorly annotated. Comparative genomics has been a powerful tool for the discovery of these sequences2, but on its own it cannot resolve their temporal and spatial functions. Recently, ChIP-Seq has been developed to identify cis-regulatory elements in the genomes of several organisms including humans, Drosophila melanogaster and Caenorhabditis elegans3,4,5. Here we apply the same experimental approach to a diverse set of 19 tissues and cell types in the mouse to produce a map of nearly 300,000 murine cis-regulatory sequences. The annotated sequences add up to 11% of the mouse genome, and include more than 70% of conserved non-coding sequences. We define tissue-specific enhancers and identify potential transcription factors regulating gene expression in each tissue or cell type. Finally, we show that much of the mouse genome is organized into domains of coordinately regulated enhancers and promoters. Our results provide a resource for the annotation of functional elements in the mammalian genome and for the study of mechanisms regulating tissue-specific gene expression.
What problem does this paper attempt to address?