NCBI GEO: archive for high-throughput functional genomic data
Tanya Barrett,Dennis B. Troup,Stephen E. Wilhite,Pierre Ledoux,Dmitry Rudnev,Carlos Evangelista,Irene F. Kim,Alexandra Soboleva,Maxim Tomashevsky,Kimberly A. Marshall,Katherine H. Phillippy,Patti M. Sherman,Rolf N. Muertter,Ron Edgar,T. Barrett,D. B. Troup,S. E. Wilhite,P. Ledoux,D. Rudnev,C. Evangelista,I. F. Kim,A. Soboleva,M. Tomashevsky,K. A. Marshall,K. H. Phillippy,P. M. Sherman,R. N. Muertter,R. Edgar
DOI: https://doi.org/10.1093/nar/gkn764
IF: 14.9
2009-01-01
Nucleic Acids Research
Abstract:The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) is the largest public repository for high-throughput gene expression data. Additionally, GEO hosts other categories of high-throughput functional genomic data, including those that examine genome copy number variations, chromatin structure, methylation status and transcription factor binding. These data are generated by the research community using high-throughput technologies like microarrays and, more recently, next-generation sequencing. The database has a flexible infrastructure that can capture fully annotated raw and processed data, enabling compliance with major community-derived scientific reporting standards such as 'Minimum Information About a Microarray Experiment' (MIAME). In addition to serving as a centralized data storage hub, GEO offers many tools and features that allow users to effectively explore, analyze and download expression data from both gene-centric and experiment-centric perspectives. This article summarizes the GEO repository structure, content and operating procedures, as well as recently introduced data mining features. GEO is freely accessible at http://www.ncbi.nlm.nih.gov/geo/.
biochemistry & molecular biology