ChromaFold predicts the 3D contact map from single-cell chromatin accessibility
Vianne R. Gao,Rui Yang,Arnav Das,Renhe Luo,Hanzhi Luo,Dylan R. McNally,Ioannis Karagiannidis,Martin A. Rivas,Zhong-Min Wang,Darko Barisic,Alireza Karbalayghareh,Wilfred Wong,Yingqian A. Zhan,Christopher R. Chin,William S. Noble,Jeff A. Bilmes,Effie Apostolou,Michael G. Kharas,Wendy Béguelin,Aaron D. Viny,Danwei Huangfu,Alexander Y. Rudensky,Ari M. Melnick,Christina S. Leslie
DOI: https://doi.org/10.1038/s41467-024-53628-0
IF: 16.6
2024-11-02
Nature Communications
Abstract:Identifying cell-type-specific 3D chromatin interactions between regulatory elements can help decipher gene regulation and interpret disease-associated non-coding variants. However, achieving this resolution with current 3D genomics technologies is often infeasible given limited input cell numbers. We therefore present ChromaFold, a deep learning model that predicts 3D contact maps, including regulatory interactions, from single-cell ATAC sequencing (scATAC-seq) data alone. ChromaFold uses pseudobulk chromatin accessibility, co-accessibility across metacells, and a CTCF motif track as inputs and employs a lightweight architecture to train on standard GPUs. Trained on paired scATAC-seq and Hi-C data in human samples, ChromaFold accurately predicts the 3D contact map and peak-level interactions across diverse human and mouse test cell types. Compared to leading contact map prediction models that use ATAC-seq and CTCF ChIP-seq, ChromaFold achieves state-of-the-art performance using only scATAC-seq. Finally, fine-tuning ChromaFold on paired scATAC-seq and Hi-C in a complex tissue enables deconvolution of chromatin interactions across cell subpopulations.
multidisciplinary sciences