LoopNetica: Predicting Chromatin Loops Using Convolutional Neural Networks and Attention Mechanisms

Yang Lei,Li Tang,Hanyu Luo,WenJie Huang,Min Li
DOI: https://doi.org/10.1007/978-981-97-5087-0_2
2024-01-01
Abstract:Within the cell nucleus, chromatin folds to form loop structures that bring distant genomic regions into close proximity, a foundational mechanism in gene regulation. These loop structures facilitate interactions among enhancers, promoters, and other regulatory elements, fundamentally influencing gene expression patterns. With the advent of high-throughput technologies such as Hi-C and ChIA-PET, researchers have begun to peel back the layers of the genome’s complex three-dimensional organization, identifying thousands of looping interactions that vary across cell types and are conserved across species. However, these experimental techniques often require extremely high costs and complex experimental workflows, and their resolution is often low, while existing computational methods do not take into account the extremely imbalanced challenges of samples and often require additional epigenetic data, which are not always available. To overcome this problem, we propose a new deep learning computational tool called LoopNetica by utilizing a combination of one-dimensional convolutional neural networks and a multi-head attention mechanism. It can accurately predict the formation of chromatin loops using only sequences. Its accuracy is higher than existing methods, and LoopNetica can still maintain its accuracy even when the sample distribution is extremely imbalanced. With a simple and exquisite architecture, LoopNetica has high performance and very fast training speed. LoopNetica not only marks a major leap in computational exploration of genome structure, but also lays the foundation for a deeper understanding of the regulatory environment that drives gene expression and disease.
What problem does this paper attempt to address?