Exploring the H2H Genes in 3D View

Mingming Zhao,Jianguo Zhou,Zifeng Wu,Wenyu Peng,Wei Zhou,Yu Liang
DOI: https://doi.org/10.1088/1755-1315/440/4/042079
2020-01-01
Abstract:It is estimated to contain hundreds of head-to-head (H2H) gene in the eukaryotic genomes, often with two genes in an H2H pair been prone to co-express and o-function. Therefore, h2h plays a crucial role in human disease control and there have been many studies on H2H gene and its related bidirectional promoters. Recent chromosome conformation capture techniques, such as Hi-C, and ChIA-PET have provided us with new opportunities to study H2H in 3D view. This paper proposes a powerful machine learning algorithm LightGBM to predict h2h cluster. Two sets of features, namely protein features and sequence features, are extracted. Then these two sets of features are used to train a classifier to predict h2h cluster. Experimental results show that this method can effectively predict h2h cluster. Our results show a large fraction of long-range H2H (TSS >1 kb) exist and help us to understand the H2H at 3D level.
What problem does this paper attempt to address?