A Beacon in the Dark: Grey Literature Data Mining and Machine Learning Enlightening Historical Plankton Seasonality Dynamics in the Ligurian Sea

Alice Guzzi,Stefano Schiaparelli,Maria Balan,Marco Grillo
DOI: https://doi.org/10.3390/d16030189
2024-03-21
Diversity
Abstract:The Mediterranean Sea, as one of the world's most climate-sensitive regions, faces significant environmental changes due to rising temperatures. Zooplankton communities, particularly copepods, play a vital role in marine ecosystems, yet their distribution dynamics remain poorly understood, especially in the Ligurian Sea. Leveraging open-source software and environmental data, this study adapted a methodology to model copepod distributions from 1985 to 1986 in the Portofino Promontory ecosystem using the Random Forest machine learning algorithm to produce the first abundance and distribution maps of the area. Five copepod genera were studied across different trophic guilds, revealing habitat preferences and ecological fluctuations throughout the seasons. The assessment of model accuracy through symmetric mean absolute percentage error (sMAPE) highlighted the variability in copepod dynamics influenced by environmental factors. While certain genera exhibited higher predictive accuracy during specific seasons, others posed challenges due to ecological complexities. This study underscores the importance of species-specific responses and environmental variability in predictive modeling. Moreover, this study represents the first attempt to model copepod distribution in the Ligurian Sea, shedding light on their ecological niches and historical spatial dynamics. The study adhered to FAIR principles, repurposing historical data to generate three-dimensional predictive maps, enhancing our understanding of copepod biodiversity. Future studies will focus on developing abundance distribution models using machine learning and artificial intelligence to predict copepod standing crop in the Ligurian Sea with greater precision. This integrated approach advances knowledge of copepod ecology in the Mediterranean and sets a precedent for integrating historical data with contemporary methodologies to elucidate marine ecosystem dynamics.
ecology,biodiversity conservation
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve The paper aims to address the issue of seasonal dynamics of zooplankton (particularly copepods) in the Ligurian Sea of the Mediterranean. Specifically, the paper focuses on the following points: 1. **Impact of Climate Change on Zooplankton**: As one of the most climate-sensitive regions globally, the Mediterranean is facing significant environmental changes, especially due to rising temperatures. The paper attempts to reveal the impact of these changes by studying the distribution dynamics of zooplankton (particularly copepods). 2. **Utilization of Historical Data**: The paper utilizes historical gray literature data collected from 1985 to 1986 in the Portofino Promontory ecosystem, which had not been fully utilized before. By digitizing this data, the paper hopes to fill the historical gap in the distribution dynamics of zooplankton in this region. 3. **Establishment of Predictive Models**: The paper employs the random forest machine learning algorithm to establish distribution predictive models for five copepod genera (Acartia, Oithona, Centropages, Temora, and Corycaceus). These models aim to generate the first abundance and distribution maps for the region, revealing ecological fluctuations and habitat preferences in different seasons. 4. **Evaluation of Model Accuracy**: The paper evaluates the accuracy of the models using the symmetric mean absolute percentage error (sMAPE), revealing differences in predictive performance among different seasons and copepod genera. This helps to understand the impact of environmental factors on copepod dynamics and provides a reference for further research. ### Main Objectives - **Enhance Understanding of Zooplankton Ecological Dynamics**: Through modeling and analysis, the paper aims to improve the understanding of distribution patterns and ecological responses of copepods in different seasons. - **Promote Reuse of Historical Data**: The paper emphasizes the importance of following the FAIR principles (Findable, Accessible, Interoperable, Reusable) by digitizing historical data, making it widely usable. - **Provide a Foundation for Future Research**: The paper provides a methodological foundation for future use of machine learning and artificial intelligence techniques to predict copepod biomass in the Ligurian Sea, aiding in more accurate prediction and management of changes in marine ecosystems. In summary, by combining historical data and modern machine learning techniques, the paper aims to deeply explore and understand the seasonal dynamics of zooplankton in the Ligurian Sea, providing scientific evidence to address the challenges posed by climate change.