Accelerating joint species distribution modeling with Hmsc-HPC: A 1000x faster GPU deployment

Anis Ur Rahman,Gleb Tikhonov,Jari Oksanen,Tuomas Rossi,Otso Ovaskainen
DOI: https://doi.org/10.1101/2024.02.13.580046
2024-02-14
Abstract:Joint Species Distribution Modelling (JSDM) is a powerful and increasingly widely used statistical methodology in biodiversity modelling, enabling researchers to assess and predict the joint distribution of species across space and time. However, JSDM can be computationally intensive and even prohibitive, especially for large datasets and sophisticated model structures. To address computational limitations of JSDM, we expanded one widely used JSDM framework, Hmsc-R, by developing a Graphical Processing Unit (GPU) -compatible implementation of its model fitting algorithm. While our augmented framework retains the original user interface in R, its new computational core is coded in Python and dominantly uses TensorFlow library. This enhancement primarily targets to enable leveraging high-performance computing resources effectively, though it also accelerates model fitting with consumer-level machines. This upgrade is designed to leverage high-performance computing resources more effectively. We evaluated the performance of the proposed implementation across diverse model configurations and dataset sizes. Our results indicate significant model fitting speed-up compared to the existing Hmsc-R package across most models. Notably, for the largest datasets, we achieved 1000 times speed-ups. This GPU-compatible enhancement boosts the scalability of Hmsc-R package by several orders of magnitude, reaching a significantly higher level. It opens promising opportunities for modeling extensive and intricate datasets, enabling better-informed conservation strategies, environmental management, and climate change adaptation planning.
Ecology
What problem does this paper attempt to address?