Abstract:Deep learning methods have been applied when working to enhance the prediction accuracy of traditional statistical methods in the field of plant breeding. Although deep learning seems to be a promising approach for genomic prediction, it has proven to have some limitations, since its conventional methods fail to leverage all available information. Multimodal deep learning methods aim to improve the predictive power of their unimodal counterparts by introducing several modalities (sources) of input information. In this review, we introduce some theoretical basic concepts of multimodal deep learning and provide a list of the most widely used neural network architectures in deep learning, as well as the available strategies to fuse data from different modalities. We mention some of the available computational resources for the practical implementation of multimodal deep learning problems. We finally performed a review of applications of multimodal deep learning to genomic selection in plant breeding and other related fields. We present a meta-picture of the practical performance of multimodal deep learning methods to highlight how these tools can help address complex problems in the field of plant breeding. We discussed some relevant considerations that researchers should keep in mind when applying multimodal deep learning methods. Multimodal deep learning holds significant potential for various fields, including genomic selection. While multimodal deep learning displays enhanced prediction capabilities over unimodal deep learning and other machine learning methods, it demands more computational resources. Multimodal deep learning effectively captures intermodal interactions, especially when integrating data from different sources. To apply multimodal deep learning in genomic selection, suitable architectures and fusion strategies must be chosen. It is relevant to keep in mind that multimodal deep learning, like unimodal deep learning, is a powerful tool but should be carefully applied. Given its predictive edge over traditional methods, multimodal deep learning is valuable in addressing challenges in plant breeding and food security amid a growing global population.

Machine learning algorithms translate big data into predictive breeding accuracy

Applications and Trends of Machine Learning in Genomics and Phenomics for Next-Generation Breeding

Plant Genotype to Phenotype Prediction Using Machine Learning

Deep learning methods improve genomic prediction of wheat breeding

A review of machine learning models applied to genomic prediction in animal breeding

A review of deep learning applications for genomic selection

Machine learning approaches for crop improvement: Leveraging phenotypic and genotypic big data

Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction

Prediction and importance of predictors in approaches based on computational intelligence and machine learning

Genomic prediction using machine learning: a comparison of the performance of regularized regression, ensemble, instance-based and deep learning methods on synthetic and empirical data

A review of multimodal deep learning methods for genomic-enabled prediction in plant breeding

Using machine learning to integrate genetic and environmental data to model genotype-by-environment interactions

Using machine learning to improve the accuracy of genomic prediction of reproduction traits in pigs

Multimodal deep learning methods enhance genomic prediction of wheat breeding

Smart breeding driven by advances in sequencing technology

Genomic prediction in plants: opportunities for ensemble machine learning based approaches

Genome-Wide Prediction of Complex Traits in Two Outcrossing Plant Species Through Deep Learning and Bayesian Regularized Neural Network

A Benchmarking Between Deep Learning, Support Vector Machine and Bayesian Threshold Best Linear Unbiased Prediction for Predicting Ordinal Traits in Plant Breeding

Digitalizing breeding in plants: A new trend of next-generation breeding based on genomic prediction

Using machine learning to combine genetic and environmental data for maize grain yield predictions across multi-environment trials

What makes a good prediction? Feature importance and beginning to open the black box of machine learning in genetics