scDART: integrating unmatched scRNA-seq and scATAC-seq data and learning cross-modality relationship simultaneously

Ziqi Zhang,Chengkai Yang and Xiuwei Zhang
DOI: https://doi.org/10.1186/s13059-022-02706-x
IF: 17.906
2022-06-28
Genome Biology
Abstract:It is a challenging task to integrate scRNA-seq and scATAC-seq data obtained from different batches. Existing methods tend to use a pre-defined gene activity matrix to convert the scATAC-seq data into scRNA-seq data. The pre-defined gene activity matrix is often of low quality and does not reflect the dataset-specific relationship between the two data modalities. We propose scDART, a deep learning framework that integrates scRNA-seq and scATAC-seq data and learns cross-modalities relationships simultaneously. Specifically, the design of scDART allows it to preserve cell trajectories in continuous cell populations and can be applied to trajectory inference on integrated data.
genetics & heredity,biotechnology & applied microbiology
What problem does this paper attempt to address?