Informatics Management of Tumor Specimens in the Era of Big Data: Challenges and Solutions

Pei-Fen Zhang,Xiao-Hui Zheng,Xi-Zhao Li,Lin Sun,Wei-Hua Jia
DOI: https://doi.org/10.1089/bio.2020.0084
2021-01-01
Biopreservation and Biobanking
Abstract:Biomedical data bear the potential to facilitate personalized diagnosis and precision treatment. In the era of Big Data, high-quality annotation of human specimens has become the primary mission of biobankers, especially for tumor biobanks with large amounts of "omics" and clinical data. However, the lack of agreed-upon standardization and the gap among heterogeneous databases make information application and communication a major challenge. International efforts are underway to develop national projects on informatics management. The aim of this review is to provide references in specimen annotation to regulate and take full advantage of biological and biomedical information. First, critical data categories that are vital for specimen applications, including sample attributes, clinical data, preanalytical variations, and analytical records, are systematically listed for subsequent data mining. Second, current standards and guidelines related to biospecimen information are reviewed, and proper standards for tumor biobanks are recommended. In particular, commonly-used approaches and functionalities of data management are summarized and discussed. This review highlights the importance of informatics management of tumor specimens, defines critical data types, recommends data standards, and presents the methodologies of data harmonization for biobankers to reach high quality annotation of biospecimens.
What problem does this paper attempt to address?