Perspectives on ENCODE

Michael P. Snyder,Thomas R. Gingeras,Jill E. Moore,Zhiping Weng,Mark B. Gerstein,Bing Ren,Ross C. Hardison,John A. Stamatoyannopoulos,Brenton R. Graveley,Elise A. Feingold,Michael J. Pazin,Michael Pagan,Daniel A. Gilchrist,Benjamin C. Hitz,J. Michael Cherry,Bradley E. Bernstein,Eric M. Mendenhall,Daniel R. Zerbino,Adam Frankish,Paul Flicek,Richard M. Myers,,Federico Abascal,Reyes Acosta,Nicholas J. Addleman,Jessika Adrian,Veena Afzal,Bronwen Aken,Rizi Ai,Jennifer A. Akiyama,Omar Al Jammal,Henry Amrhein,Stacie M. Anderson,Gregory R. Andrews,Igor Antoshechkin,Kristin G. Ardlie,Joel Armstrong,Matthew Astley,Budhaditya Banerjee,Amira A. Barkal,If H. A. Barnes,Iros Barozzi,Daniel Barrell,Gemma Barson,Daniel Bates,Ulugbek K. Baymuradov,Cassandra Bazile,Michael A. Beer,Samantha Beik,M. A. Bender,Ruth Bennett,Louis Philip Benoit Bouvrette,Andrew Berry,Anand Bhaskar,Alexandra Bignell,Steven M. Blue,David M. Bodine,Carles Boix,Nathan Boley,Tyler Borrman,Beatrice Borsari,Alan P. Boyle,Laurel A. Brandsmeier,Alessandra Breschi,Emery H. Bresnick,Jason A. Brooks,Michael Buckley,Christopher B. Burge,Rachel Byron,Eileen Cahill,Lingling Cai,Lulu Cao,Mark Carty,Rosa G. Castanon,Andres Castillo,Hassan Chaib,Esther T. Chan,Daniel R. Chee,Sora Chee,Hao Chen,Huaming Chen,Jia-Yu Chen,Songjie Chen,Surya B. Chhetri,Jyoti S. Choudhary,Jacqueline Chrast,Dongjun Chung,Declan Clarke,Neal A. L. Cody,Candice J. Coppola,Julie Coursen,Anthony M. D’Ippolito,Stephen Dalton,Cassidy Danyko,Claire Davidson,Jose Davila-Velderrain,Carrie A. Davis,Job Dekker,Alden Deran,Gilberto DeSalvo,Gloria Despacio-Reyes,Colin N. Dewey,Diane E. Dickel,Morgan Diegel,Mark Diekhans,Vishnu Dileep,Bo Ding,Sarah Djebali,Alexander Dobin,Daniel Dominguez,Sarah Donaldson,Jorg Drenkow,Timothy R. Dreszer,Yotam Drier,Michael O. Duff,Douglass Dunn,Catharine Eastman,Joseph R. Ecker,Matthew D. Edwards,Nicole El-Ali,Shaimae I. Elhajjajy,Keri Elkins,Andrew Emili,Charles B. Epstein,Rachel C. Evans,Iakes Ezkurdia,Kaili Fan,Peggy J. Farnham,Nina Farrell,Anne-Maud Ferreira,Katherine Fisher-Aylor,Stephen Fitzgerald,Chuan Sheng Foo,Kevin Fortier,Peter Freese,Shaliu Fu,Xiang-Dong Fu,Yu Fu,Yoko Fukuda-Yuzawa,Mariateresa Fulciniti,Alister P. W. Funnell,Idan Gabdank,Timur Galeev,Mingshi Gao,Carlos Garcia Giron,Tyler H. Garvin,Chelsea Anne Gelboin-Burkhart,Grigorios Georgolopoulos,Belinda M. Giardine,David K. Gifford,David M. Gilbert,Shawn Gillespie,Peng Gong,Alvaro Gonzalez,Jose M. Gonzalez,Peter Good,Alon Goren,David U. Gorkin,Michael Gray,Jack F. Greenblatt,Ed Griffiths,Mark T. Groudine,Fabian Grubert,Mengting Gu,Roderic Guigó,Hongbo Guo,Yu Guo,Yuchun Guo,Gamze Gursoy,Maria Gutierrez-Arcelus,Jessica Halow,Matthew Hardy,Manoj Hariharan,Arif Harmanci,Anne Harrington,Jennifer L. Harrow,Tatsunori B. Hashimoto,Richard D. Hasz,Meital Hatan,Eric Haugen,James E. Hayes,Peng He,Yupeng He,Nastaran Heidari,David Hendrickson,Elisabeth F. Heuston,Jason A. Hilton,Abigail Hochman,Cory Holgren,Lei Hou,Shuyu Hou,Yun-Hua E. Hsiao,Shanna Hsu,Hui Huang,Tim J. Hubbard,Jack Huey,Timothy R. Hughes,Toby Hunt,Sean Ibarrientos,Robbyn Issner,Mineo Iwata,Osagie Izuogu,Tommi Jaakkola,Nader Jameel,Camden Jansen,Lixia Jiang,Peng Jiang,Audra Johnson,Rory Johnson,Irwin Jungreis,Madhura Kadaba,Maya Kasowski,Mary Kasparian,Momoe Kato,Rajinder Kaul,Trupti Kawli,Michael Kay,Judith C. Keen,Sunduz Keles,Cheryl A. Keller,David Kelley,Manolis Kellis,Pouya Kheradpour,Daniel Sunwook Kim,Anthony Kirilusha,Robert J. Klein,Birgit Knoechel,Samantha Kuan,Michael J. Kulik,Sushant Kumar,Anshul Kundaje,Tanya Kutyavin,Julien Lagarde,Bryan R. Lajoie,Nicole J. Lambert,John Lazar,Ah Young Lee,Donghoon Lee,Elizabeth Lee,Jin Wook Lee,Kristen Lee,Christina S. Leslie,Shawn Levy,Bin Li,Hairi Li,Nan Li,Shantao Li,Xiangrui Li,Yang I. Li,Ying Li,Yining Li,Yue Li,Jin Lian,Maxwell W. Libbrecht,Shin Lin,Yiing Lin,Dianbo Liu,Jason Liu,Peng Liu,Tingting Liu,X. Shirley Liu,Yan Liu,Yaping Liu,Maria Long,Shaoke Lou,Jane Loveland,Aiping Lu,Yuheng Lu,Eric Lécuyer,Lijia Ma,Mark Mackiewicz,Brandon J. Mannion,Michael Mannstadt,Deepa Manthravadi,Georgi K. Marinov,Fergal J. Martin,Eugenio Mattei,Kenneth McCue,Megan McEown,Graham McVicker,Sarah K. Meadows,Alex Meissner,Christopher L. Messer,Wouter Meuleman,Clifford Meyer,Steve Miller,Matthew G. Milton,Tejaswini Mishra,Dianna E. Moore,Helen M. Moore,Samuel H. Moore,Jennifer Moran,Ali Mortazavi,Jonathan M. Mudge,Nikhil Munshi,Rabi Murad,Vivek Nandakumar,Preetha Nandi,Anil M. Narasimha,Aditi K. Narayanan,Hannah Naughton,Fabio C. P. Navarro,Patrick Navas,Jurijs Nazarovs,Jemma Nelson,Shane Neph,Fidencio Jun Neri,Joseph R. Nery,Amy R. Nesmith,J. Scott Newberry,Kimberly M. Newberry,Vu Ngo,Rosy Nguyen,Thai B. Nguyen,Tung Nguyen,Andrew Nishida,William S. Noble,Catherine S. Novak,Eva Maria Novoa,Briana Nuñez,Charles W. O’Donnell,Sara Olson,Kathrina C. Onate,Ericka Otterman,Hakan Ozadam,Tsultrim Palden,Xinghua Pan,Yongjin Park,E. Christopher Partridge,Benedict Paten,Florencia Pauli-Behn,Baikang Pei,Len A. Pennacchio,Alexander R. Perez,Emily H. Perry,Dmitri D. Pervouchine,Nishigandha N. Phalke,Quan Pham,Doug H. Phanstiel,Ingrid Plajzer-Frick,Gabriel A. Pratt,Henry E. Pratt,Sebastian Preissl,Jonathan K. Pritchard,Yuri Pritykin,Michael J. Purcaro,Qian Qin,Giovanni Quinones-Valdez,Ines Rabano,Ernest Radovani,Anil Raj,Nisha Rajagopal,Oren Ram,Lucia Ramirez,Ricardo N. Ramirez,Dylan Rausch,Soumya Raychaudhuri,Joseph Raymond,Rozita Razavi,Timothy E. Reddy,Thomas M. Reimonn,Alexandre Reymond,Alex Reynolds,Suhn K. Rhie,John Rinn,Miguel Rivera,Juan Carlos Rivera-Mulia,Brian Roberts,Jose Manuel Rodriguez,Joel Rozowsky,Russell Ryan,Eric Rynes,Denis N. Salins,Richard Sandstrom,Takayo Sasaki,Shashank Sathe,Daniel Savic,Alexandra Scavelli,Jonathan Scheiman,Christoph Schlaffner,Jeffery A. Schloss,Frank W. Schmitges,Lei Hoon See,Anurag Sethi,Manu Setty,Anthony Shafer,Shuo Shan,Eilon Sharon,Quan Shen,Yin Shen,Richard I. Sherwood,Minyi Shi,Sunyoung Shin,Noam Shoresh,Kyle Siebenthall,Cristina Sisu,Teri Slifer,Cricket A. Sloan,Anna Smith,Valentina Snetkova,Damek V. Spacek,Sharanya Srinivasan,Rohith Srivas,George Stamatoyannopoulos,Rebecca Stanton,Dave Steffan,Sandra Stehling-Sun,J. Seth Strattan,Amanda Su,Balaji Sundararaman,Marie-Marthe Suner,Tahin Syed,Matt Szynkarek,Forrest Y. Tanaka,Danielle Tenen,Mingxiang Teng,Jeffrey A. Thomas,Dave Toffey,Michael L. Tress,Diane E. Trout,Gosia Trynka,Junko Tsuji,Sean A. Upchurch,Oana Ursu,Barbara Uszczynska-Ratajczak,Mia C. Uziel,Alfonso Valencia,Benjamin Van Biber,Arjan G. van der Velde,Eric L. Van Nostrand,Yekaterina Vaydylevich,Jesus Vazquez,Alec Victorsen,Jost Vielmetter,Jeff Vierstra,Axel Visel,Anna Vlasova,Christopher M. Vockley,Simona Volpi,Shinny Vong,Hao Wang,Mengchi Wang,Qin Wang,Ruth Wang,Tao Wang,Wei Wang,Xiaofeng Wang,Yanli Wang,Nathaniel K. Watson,Xintao Wei,Zhijie Wei,Hendrik Weisser,Sherman M. Weissman,Rene Welch,Robert E. Welikson,Harm-Jan Westra,John W. Whitaker,Collin White,Kevin P. White,Andre Wildberg,Brian A. Williams,David Wine,Heather N. Witt,Barbara Wold,Maxim Wolf,James Wright,Rui Xiao,Xinshu Xiao,Jie Xu,Jinrui Xu,Koon-Kiu Yan,Yongqi Yan,Hongbo Yang,Xinqiong Yang,Yi-Wen Yang,Galip Gürkan Yardımcı,Brian A. Yee,Gene W. Yeo,Taylor Young,Tianxiong Yu,Feng Yue,Chris Zaleski,Chongzhi Zang,Haoyang Zeng,Weihua Zeng,Jie Zhai,Lijun Zhan,Ye Zhan,Bo Zhang,Jialing Zhang,Jing Zhang,Kai Zhang,Lijun Zhang,Peng Zhang,Qi Zhang,Xiao-Ou Zhang,Yanxiao Zhang,Zhizhuo Zhang,Yuan Zhao,Ye Zheng,Guoqing Zhong,Xiao-Qiao Zhou,Yun Zhu,Jared Zimmerman
DOI: https://doi.org/10.1038/s41586-020-2449-8
IF: 64.8
2020-07-29
Nature
Abstract:The Encylopedia of DNA Elements (ENCODE) Project launched in 2003 with the long-term goal of developing a comprehensive map of functional elements in the human genome. These included genes, biochemical regions associated with gene regulation (for example, transcription factor binding sites, open chromatin, and histone marks) and transcript isoforms. The marks serve as sites for candidate <i>cis</i>-regulatory elements (cCREs) that may serve functional roles in regulating gene expression<sup><a href="/articles/s41586-020-2449-8#ref-CR1">1</a></sup>. The project has been extended to model organisms, particularly the mouse. In the third phase of ENCODE, nearly a million and more than 300,000 cCRE annotations have been generated for human and mouse, respectively, and these have provided a valuable resource for the scientific community.
multidisciplinary sciences
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the comprehensive annotation of functional elements in the human genome. Specifically, since its launch in 2003, the ENCODE project has aimed to develop a comprehensive map of functional elements, which include genes, biochemical regions related to gene regulation (such as transcription factor binding sites, open chromatin regions, and histone marks), and transcriptional isoforms. These problems were particularly prominent at the beginning of the project because the understanding of the human genome was very limited at that time, especially with very little knowledge of specific elements in non - coding genes and regulatory regions. ### Main Objectives 1. **Comprehensively annotate functional elements in the human genome**: - Including genes, control elements, and transcriptional isoforms. - Extend to genome annotation in model organisms (such as mice). 2. **Identify candidate cis - regulatory elements (cCREs)**: - These elements may play a functional role in regulating gene expression. - Through a variety of high - throughput sequencing techniques (such as ChIP - seq, RNA - seq, ATAC - seq, etc.), a large number of cCRE annotations were generated on a large scale. 3. **Improve the understanding of the human genome**: - From the initial annotation to a richer, higher - resolution view. - Identify more transcriptional isoforms, long non - coding RNAs (lncRNAs), and potential regulatory regions. ### Specific Problems - **Insufficient understanding of the human genome in the early days**: - Although it is known that 5% of the genome is under purifying selection in placental mammals, knowledge of specific elements, especially non - coding genes and regulatory regions, is still limited. - **The need for technological development**: - New technologies and standards need to be developed to ensure the reproducibility and high quality of data. - **Data integration and standardization**: - Ensure that data from different laboratories and projects can be interoperable and can use unified standards and processes together. ### Solutions - **Multi - stage project implementation**: - Phase 1 (2003 - 2007): Evaluate emerging technologies and study 1% of the human genome. - Phase 2 (2007 - 2012): Introduce sequencing - based technologies to study the entire genome and transcriptome. - Phase 3 (2012 - 2017): Expand production, add new types of experiments, and reveal the landscape of RNA binding and chromatin 3D organization. - **High - quality data standards and processing procedures**: - Establish independent replicate experiments, use high - quality reagents, and formulate strict quality control standards. - Develop a unified data processing pipeline to ensure data consistency and reproducibility. - **Extensive cooperation and sharing**: - Cooperate with other large international projects, share data standards and processing methods, and improve data interoperability and value. Through these efforts, the ENCODE project has greatly enriched our understanding of the human genome and provided valuable resources to support a wide range of scientific research.