Characterization of DNA Methylation in Short Tandem Repeats in Human Genome

Lifang Hou*,Zhou Zhang,Yinan Zheng,Wei Zhang,Xu Zhang,Xiao Zhang,Andrea Baccarelli,Tao Gao,Jane Hoppin
DOI: https://doi.org/10.1289/isee.2014.p2-520
2014-01-01
ISEE Conference Abstracts
Abstract:Background: DNA methylation plays a critical role in the DNA regulation, which occurs mainly at the C5 position of CpG dinucleotides, and can be affected by environmental pollutants. Short Tandem Repeats (STRs) are repeating sequences of 2-6 base pairs of DNA in human genome. DNA methylation in certain STRs has been shown to contribute to the development of various diseases, including cancer. However, the characterization of DNA methylation in STRs throughout the human genome remains largely unknown. Aims: To characterize STRs cross the human genome using the Illumina HumanMethylation450 BeadChip (450K array), and estimate the correlation of DNA methylation with STRs. Methods: To locate the STRs associated with 450K probes, we adopted the 450K array to examine DNA methylation level throughout the genome using data from 80 healthy pesticide applicators, and retrieved STRs in the 1050 bps DNA region which included both 500 bps upstream and downstream DNA sequences of each probe. For each probe, we obtained the STRs factors, including counts of CG and GC, STRs repeats size, copy number, and distance to the probe. Based on their repeat size (2 to 6), these probes were divided into 5 groups, and tested with Wilcoxon Rank-Sum Test to examine the methylation level of each group against the rest. For each group, we utilized linear regression analyses to determine the association of DNA methylation level with these STRs factors. Results: We found 29,528 probes with STRs, and 27,446 of them were kept for analysis. We observed significantly differences in methylation level of each repeat size group against the rest (P-values were 2.2E-16, 2.2E-16, 2.2E-16, 3.803E-05, and 2.2E-16, respectively) and DNA methylation level was associated with the CG and GC counts, distance between STRs and probes, copy number and the repeat size. Conclusion: The CG and GC counts, copy number, repeat size and the distance to probes of STRs may determine their DNA methylation level.
What problem does this paper attempt to address?