Whole-Genome Bisulfite Sequencing Protocol for the Analysis of Genome-Wide DNA Methylation and Hydroxymethylation Patterns at Single-Nucleotide Resolution

David Derbala,Abel Garnier,Eric Bonnet,Jean-François Deleuze,Jörg Tost
DOI: https://doi.org/10.1007/978-1-0716-4051-7_18
Abstract:The analysis of genome-wide epigenomic alterations including DNA methylation and hydroxymethylation has become a subject of intensive research for many biological and clinical questions. DNA methylation analysis bears the particular promise to supplement or replace biochemical and imaging-based tests for the next generation of personalized medicine. Whole-genome bisulfite sequencing (WGBS) using next-generation sequencing technologies is currently considered the gold standard for a comprehensive and quantitative analysis of DNA methylation throughout the genome. However, bisulfite conversion does not allow distinguishing between cytosine methylation and hydroxymethylation requiring an additional chemical or enzymatic step to identify hydroxymethylated cytosines. Here, we provide a detailed protocol based on a commercial kit for the preparation of sequencing libraries for the comprehensive whole-genome analysis of DNA methylation and/or hydroxymethylation. The protocol is based on the construction of sequencing libraries from limited amounts of input DNA by ligation of methylated adaptors to the fragmented DNA prior to bisulfite conversion. For analyses requiring a quantitative distinction between 5-methylcytosine and 5-hydroxymethylcytosines levels, an oxidation step is included in the same workflow to perform oxidative bisulfite sequencing (OxBs-Seq). In this case, two sequencing libraries will be generated and sequenced: a classic methylome following bisulfite conversion and analyzing modified cytosines (not distinguishing between methylated and hydroxymethylated cytosines) and a methylome analyzing only methylated cytosines, respectively. Hydroxymethylation levels are deduced from the differences between the two reactions. We also provide a step-by-step description of the data analysis using publicly available bioinformatic tools. The described protocol has been successfully applied to different human and plant samples and yields robust and reproducible results.
What problem does this paper attempt to address?