ReUseData: an R/Bioconductor tool for reusable and reproducible genomic data management

Qian Liu,Qiang Hu,Song Liu,Alan Hutson,Martin Morgan
DOI: https://doi.org/10.1186/s12859-023-05626-0
IF: 3.307
2024-01-05
BMC Bioinformatics
Abstract:The increasing volume and complexity of genomic data pose significant challenges for effective data management and reuse. Public genomic data often undergo similar preprocessing across projects, leading to redundant or inconsistent datasets and inefficient use of computing resources. This is especially pertinent for bioinformaticians engaged in multiple projects. Tools have been created to address challenges in managing and accessing curated genomic datasets, however, the practical utility of such tools becomes especially beneficial for users who seek to work with specific types of data or are technically inclined toward a particular programming language. Currently, there exists a gap in the availability of an R-specific solution for efficient data management and versatile data reuse.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?