Abstract:Predicting species distributions and entire communities is crucial for ecologists, to enhance our understanding of the drivers behind species distributions and community assembly and to provide quantitative data for conservation efforts. Popular species distribution models use statistical and machine learning methods but face limitations with multi‐species predictions at the community level, hindered by scalability and data imbalance sensitivity. This paper explores the potential of deep learning methods to overcome these challenges and provide more accurate multi‐species predictions. Specifically, we introduced four distinct deep learning models that use site × species community data but differ in their internal structure or on the input environmental data structure: (1) a multi‐layer perceptron (MLP) model for tabular data (e.g. in‐situ/raster climate or soil data), (2) a convolutional neural network (CNN) and (3) a vision transformer (ViT) models tailored for image data (e.g. aerial ortho‐photographs, satellite imagery), and a multimodal model that integrates both tabular and image data. We also show how adapted loss functions can address imbalance issues. We applied these deep learning models to a plant community dataset comprising 130,582 vegetation surveys encompassing 2522 species located in the French Alps. The tabular environmental data consisted of climate, terrain and soil information, while the images were derived from aerial photographs. All models achieved approximately 70% true skill statistics on hold‐out data, demonstrating high predictive capacity for community data, the multimodal model being the best performing one. Additionally, we showcased how interpretability tools can illuminate community structure as seen by deep learning models. Deep learning models offer a broad array of features for predicting entire species communities. They handle imbalance issues and accommodate various data types, from tabular datasets to images, while also being equipped with insightful interpretation tools. The versatility extends to tabular datasets and images, with no clear superiority between the two. The last hidden layers can provide valuable features for modelling other species, and the trained models can be used to support transfer learning to related tasks. The field of ecology now possesses an additional, potent tool in its arsenal that can foster basic and fundamental research. Résumé La prédiction de la distribution des espèces et des communautés est essentielle pour les écologistes, afin d'améliorer notre compréhension des facteurs qui sous‐tendent la répartition des espèces et l'assemblage des communautés et de fournir des données quantitatives pour les efforts de conservation. Les modèles de distribution des espèces utilisent généralement des méthodes statistiques et d'apprentissage automatique, mais se heurtent à des limites pour les prédictions multi‐espèces au niveau de la communauté, entravées par l'extensibilité et la sensibilité au déséquilibre des données. Cet article explore le potentiel des méthodes d'apprentissage profond pour surmonter ces défis et fournir des prédictions multi‐espèces plus réalistes. Plus précisément, nous introduisons quatre différents modèles d'apprentissage profond qui utilisent des données sur les communautés d'espèces (sites × espèces) mais qui diffèrent par leur structure interne ou par la structure des données environnementales d'entrée: (1) un modèle de perceptron multicouche (MLP) pour les données tabulaires (par exemple, des données climatiques ou pédologiques mesurées in situ ou sous forme de raster), (2) un réseau de neurones convolutif (CNN) et (3) un vision transforer (ViT) adaptés aux données sous forme d'image (par exemple, orthophotos aériennes, images satellitaires), et un modèle multimodal qui intègre à la fois les données tabulaires et les données d'image. Nous montrons également comment des fonctions de perte adaptées peuvent résoudre les problèmes de déséquilibre. Nous avons appliqué ces modèles d'apprentissage profond sur des données de communautés végétales comprenant 130,582 relevés de végétation et 2522 d'espèces situées dans les Alpes françaises. Les données environnementales tabulaires comprenaient des informations sur le climat, le terrain et le sol, tandis que les images provenaient de photographies aériennes. Tous les modèles ont atteint environ une True Skill Statistics de 70% sur les données d'évaluation , démontrant une grande capacité de prédiction pour les données de communautées, le modèle multimodal étant le plus performant. En outre, nous montrons comment les outils d'interprétabilité peuvent éclairer la structure de la communauté telle qu'elle est perçue par les modèles d'apprentissage profond. Les modèles d'apprentissage profond offrent donc un large éventail de caractéristiques pour prédire des communautés d'espèces dans leur ensemble. Ils gèrent les problèmes -Abstract Truncated-

Interpretable and predictive models based on high-dimensional data in ecology and evolution

Disentangling Key Species Interactions in Diverse and Heterogeneous Communities: A Bayesian Sparse Modelling Approach.

Shrinkage-based Bayesian variable selection for species distribution modelling in complex environments: An application to urban biodiversity

Scientific machine learning in ecological systems: A study on the predator-prey dynamics

Using Full Models, Stepwise Regression and Model Selection in Ecological Data Sets: Monte Carlo Simulations

A practical guide to selecting models for exploration, inference, and prediction in ecology

High-dimensional regression in practice: an empirical study of finite-sample prediction, variable selection and ranking

Sparse modeling for climate variable selection across trophic levels

Fast and More Powerful Selective Inference for Sparse High-Order Interaction Model

Variable Selection and Minimax Prediction in High-dimensional Functional Linear Model

72 Developing Machine Learning Models When Data is Limiting

A big data-model integration approach for predicting epizootics and population recovery in a keystone species

Inferring the Effect of Species Interactions on Trait Evolution

Predictive habitat distribution models in ecology

Sparse Variable Selection on High Dimensional Heterogeneous Data With Tree Structured Responses

Machine learning and deep learning—A review for ecologists

Joint hierarchical models for sparsely sampled high-dimensional LiDAR and forest variables

Exploring the Dynamics of Lotka-Volterra Systems: Efficiency, Extinction Order, and Predictive Machine Learning

High-dimensional prediction for count response via sparse exponential weights

Introduction to deep learning methods for multi‐species predictions

A brief introduction to mixed effects modelling and multi-model inference in ecology