A roadmap for fair reuse of public microbiome data

Laura A Hug,Roland Hatzenpichler,Cristina Moraru,Andre Soares,Folker Meyer,Anke Heyder,Alexander J Probst
DOI: https://doi.org/10.1101/2024.06.21.599698
2024-06-24
Abstract:Science benefits from rapid, open data sharing but samples for sequencing data are expensive for data creators to acquire and process. Current guidelines for data reuse were established two decades ago, when databases were several million times smaller, necessitating an update. This article presents a roadmap to establish best practices for sequence data reuse, developed in consultation with a data consortium of 167 microbiome scientists. It introduces a Data Reuse Information tag (DRI) for public sequencing data, which will be associated with at least one Open Researcher and Contributor ID (ORCID) account. The machine-readable DRI tag indicates that the data creators prefer to be contacted prior to data reuse, and simultaneously provides data consumers with a mechanism to get in touch with the data creators. Ideally, the DRI will facilitate and foster collaborations, and serve as a guideline that can be expanded to other data types.
Microbiology
What problem does this paper attempt to address?