A biological ocean data reformatting effort

Kimberlee Baldry,Robert Johnson,Peter G. Strutton,Philip W. Boyd
DOI: https://doi.org/10.1038/s41597-024-03038-0
2024-02-17
Scientific Data
Abstract:Biological ocean data collected from ships find reuse in aggregations of historical data. These data are heavily relied upon to document long term change, validate satellite algorithms for ocean biology and are useful in assessing the performance of autonomous platforms and biogeochemical models. Existing aggregate products have largely been restricted to the surface ocean, omit physical data or have limited biological data. We present the first version of a BIOlogical ocean data reforMATting Effort (BIO-MATE) to begin to fill a gap in subsurface bio-physical data aggregates in a reproducible way. BIO-MATE uses open-source R software that reformats openly sourced published datasets from oceanographic voyages. These reformatted biological and physical data from underway sensors, profiling sensors, pigments analysis and particulate organic carbon analysis are stored in an interoperable BIO-MATE data product for easy access and use. Specific QA/QC protocols can now be easily applied to the BIO-MATE data product to support a variety of surface and subsurface applications.
multidisciplinary sciences
What problem does this paper attempt to address?
This paper aims to solve the difficult problem of integrating marine biological data and physical data. Specifically, the paper proposes a data restructuring effort named BIO - MATE (BIOlogical ocean data reforMATting Effort), with the following goals: 1. **Fill in the gaps in existing data sets**: Existing aggregated products mainly focus on surface ocean data, either lacking physical data or having limited biological data. BIO - MATE fills these gaps, especially for subsurface bio - physical data, by integrating biological and physical data from different voyages. 2. **Provide reusable data products**: BIO - MATE uses the open - source R software to re - format publicly released oceanographic voyage data and store it in a highly interoperable data product for easy user access and use. 3. **Support multiple applications**: The BIO - MATE data product can be applied to various surface and subsurface applications, such as: - Understanding the response changes of in - situ fluorometers in the Southern Ocean. - Evaluating non - photochemical quenching corrections. - Studying the role of ocean physics in regulating subsurface chlorophyll characteristics. - Verifying satellite observations. - Developing new methods to verify in - situ bio - optical observations collected by autonomous platforms. - Training ocean state estimation. - Supporting the development of bio - physical models. - Using multivariate analysis to understand bio - physical relationships. 4. **Improve data reusability and citation rate**: By providing easily accessible and citable data products, BIO - MATE encourages researchers to publish their data for reuse, thereby increasing the value of these data. In summary, the BIO - MATE project aims to improve the availability and reusability of biological and physical ocean data by integrating and standardizing them, thus supporting a wider range of research applications.