DORA: an interactive map for the visualization and analysis of ancient human DNA and associated data

Keith D Harris,Gili Greenbaum
DOI: https://doi.org/10.1093/nar/gkae373
IF: 14.9
2024-05-16
Nucleic Acids Research
Abstract:The ability to sequence ancient genomes has revolutionized the way we study evolutionary history by providing access to the most important aspect of evolution—time. Until recently, studying human demography, ecology, biology, and history using population genomic inference relied on contemporary genomic datasets. Over the past decade, the availability of human ancient DNA (aDNA) has increased rapidly, almost doubling every year, opening the way for spatiotemporal studies of ancient human populations. However, the multidimensionality of aDNA, with genotypes having temporal, spatial and genomic coordinates, and integrating multiple sources of data, poses a challenge for developing meta-analyses pipelines. To address this challenge, we developed a publicly-available interactive tool, DORA, which integrates multiple data types, genomic and non-genomic, in a unified interface. This web-based tool enables browsing sample metadata alongside additional layers of information, such as population structure, climatic data, and unpublished samples. Users can perform analyses on genotypes of these samples, or export sample subsets for external analyses. DORA integrates analyses and visualizations in a single intuitive interface, resolving the technical issues of combining datasets from different sources and formats, and allowing researchers to focus on the scientific questions that can be addressed through analysis of aDNA datasets.
biochemistry & molecular biology
What problem does this paper attempt to address?
### Problems Addressed by the Paper The paper introduces DORA (Data Overlays for Research in Archaeogenomics), an interactive tool for exploring and analyzing ancient human DNA (aDNA) data. The paper aims to address the following issues: 1. **Challenges of Multidimensional Data Analysis**: With the rapid increase in ancient human DNA data, effectively integrating and visualizing these multidimensional data (including temporal, spatial, and genomic coordinates) has become a challenge. Existing tools face technical barriers when dealing with these complex data. 2. **Simplification of Meta-Analysis Workflow**: Researchers need a tool that can easily integrate datasets from different sources and allow for analysis and visualization within a unified interface. This can reduce cumbersome data preparation steps, allowing researchers to focus more on the scientific research itself. By developing the DORA tool, the authors address the above issues, enabling users to browse sample metadata within the same intuitive interface, combine it with other information layers (such as population structure, climate data, and unpublished samples), and analyze or export subsets of these samples for external analysis. DORA not only simplifies the analysis workflow but also allows users to seamlessly combine published resources with their unpublished genomic samples or other data layers.