Decoding topologically associating domains with ultra-low resolution Hi-C data by graph structural entropy

Angsheng Li,Xianchen Yin,Bingxiang Xu,Danyang Wang,Jimin Han,Yi Wei,Yun Deng,Ying Xiong,Zhihua Zhang
DOI: https://doi.org/10.1038/s41467-018-05691-7
IF: 16.6
2018-08-15
Nature Communications
Abstract:Submegabase-size topologically associating domains (TAD) have been observed in high-throughput chromatin interaction data (Hi-C). However, accurate detection of TADs depends on ultra-deep sequencing and sophisticated normalization procedures. Here we propose a fast and normalization-free method to decode the domains of chromosomes (deDoc) that utilizes structural information theory. By treating Hi-C contact matrix as a representation of a graph, deDoc partitions the graph into segments with minimal structural entropy. We show that structural entropy can also be used to determine the proper bin size of the Hi-C data. By applying deDoc to pooled Hi-C data from 10 single cells, we detect megabase-size TAD-like domains. This result implies that the modular structure of the genome spatial organization may be fundamental to even a small cohort of single cells. Our algorithms may facilitate systematic investigations of chromosomal domains on a larger scale than hitherto have been possible.
multidisciplinary sciences
What problem does this paper attempt to address?