Read Mapping for Hi-C Analysis

Simon Thomas Kelly,Kugui Tanaka,Chiaki Hosaka,Satoshi Yuhara
DOI: https://doi.org/10.1007/978-1-0716-4136-1_3
Abstract:Hi-C is a popular ligation-based technique to detect 3D physical chromosome structure within the nucleus using cross-linking and next-generation sequencing. As an unbiased genome-wide assay based on chromosome conformation capture, it provides rich insights into chromosome structure, dynamic chromosome folding and interactions, and the regulatory state of a cell. Bioinformatics analyses of Hi-C data require dedicated protocols as most genome alignment tools assume that both paired-end reads will map to the same chromosome, resulting in large two-dimensional matrices as processed data. Here, we outline the necessary steps to generate high-quality aligned Hi-C data by separately mapping each read while correcting for biases from restriction enzyme digests. We introduce our own custom open-source pipeline, which enables users to select an aligner of their choosing with high accuracy and performance. This enables users to generate high-resolution datasets with fast turnaround and fewer unmapped reads. Finally, we discuss recent innovations in experimental techniques, bioinformatics techniques, and their applications in clinical testing for diagnostics.
What problem does this paper attempt to address?