Systematic Evaluation of Variability in ChIP-chip Experiments Using Predefined DNA Targets

David S. Johnson,Wei Li,D. Benjamin Gordon,Arindam Bhattacharjee,Bo Curry,Jayati Ghosh,Leonardo Brizuela,Jason S. Carroll,Myles Brown,Paul Flicek,Christoph M. Koch,Ian Dunham,Mark Bieda,Xiaoqin Xu,Peggy J. Farnham,Philipp Kapranov,David A. Nix,Thomas R. Gingeras,Xinmin Zhang,Heather Holster,Nan Jiang,Roland D. Green,Jun S. Song,Scott A. Mccuine,Elizabeth Anton,Loan Nguyen,Nathan D. Trinklein,Zhen Ye,Keith Ching,David Hawkins,Bing Ren,Peter C. Scacheri,Joel Rozowsky,Alexander Karpikov,Ghia Euskirchen,Sherman Weissman,Mark Gerstein,Michael Snyder,Annie Yang,Zarmik Moqtaderi,Heather Hirsch,Hennady P. Shulha,Yutao Fu,Zhiping Weng,Kevin Struhl,Richard M. Myers,Jason D. Lieb,X. Shirley Liu
DOI: https://doi.org/10.1101/gr.7080508
2008-01-01
Abstract:The most widely used method for detecting genome-wide protein-DNA interactions is chromatin immunoprecipitation on tiling microarrays, commonly known as ChIP-chip. Here, we conducted the first objective analysis of tiling array platforms, amplification procedures, and signal detection algorithms in a simulated ChIP-chip experiment. Mixtures of human genomic DNA and "spike-ins" comprised of nearly 100 human sequences at various concentrations were hybridized to four tiling array platforms by eight independent groups. Blind to the number of spike-ins, their locations, and the range of concentrations, each group made predictions of the spike-in locations. We found that microarray platform choice is not the primary determinant of overall performance. In fact, variation in performance between labs, protocols, and algorithms within the same array platform was greater than the variation in performance between array platforms. However, each array platform had unique performance characteristics that varied with tiling resolution and the number of replicates, which have implications for cost versus detection power. Long oligonucleotide arrays were slightly more sensitive at detecting very low enrichment. On all platforms, simple sequence repeats and genome redundancy tended to result in false positives. LM-PCR and WGA, the most popular sample amplification techniques, reproduced relative enrichment levels with high fidelity. Performance among signal detection algorithms was heavily dependent on array platform. The spike-in DNA samples and the data presented here provide a stable benchmark against which future ChIP platforms, protocol improvements, and analysis methods can be evaluated.
What problem does this paper attempt to address?