'cito': an R package for training neural networks using 'torch'

Christian Amesöder,Florian Hartig,Maximilian Pichler
DOI: https://doi.org/10.1111/ecog.07143
IF: 5.9
2024-05-08
Ecography
Abstract:Deep neural networks (DNN) have become a central method in ecology. To build and train DNNs in deep learning (DL) applications, most users rely on one of the major deep learning frameworks, in particular PyTorch or TensorFlow. Using these frameworks, however, requires substantial experience and time. Here, we present 'cito', a user‐friendly R package for DL that allows specifying DNNs in the familiar formula syntax used by many R packages. To fit the models, 'cito' takes advantage of the numerically optimized 'torch' library, including the ability to switch between training models on the CPU or the graphics processing unit (GPU) which allows the efficient training of large DNNs. Moreover, 'cito' includes many user‐friendly functions for model plotting and analysis, including explainable AI (xAI) metrics for effect sizes and variable importance. All xAI metrics as well as predictions can optionally be bootstrapped to generate confidence intervals, including p‐values. To showcase a typical analysis pipeline using 'cito', with its built‐in xAI features, we built a species distribution model of the African elephant. We hope that by providing a user‐friendly R framework to specify, deploy and interpret DNNs, 'cito' will make this interesting class of models more accessible to ecological data analysis. A stable version of 'cito' can be installed from the comprehensive R archive network (CRAN).
ecology,biodiversity conservation
What problem does this paper attempt to address?