The Interpretable Multimodal Machine Learning (IMML) framework reveals pathological signatures of distal sensorimotor polyneuropathy

Phong BH Nguyen,Daniel Garger,Diyuan Lu,Haifa Maalmi,Holger Prokisch,Barbara Thorand,Jerzy Adamski,Gabi Kastenmueller,Melanie Waldenberger,Christian Gieger,Annette Peters,Karsten Suhre,Gidon J Boenhof,Wolfgang Rathmann,Michael Roden,Harald Grallert,Dan Ziegler,Christian Herder,Michael Menden
DOI: https://doi.org/10.1101/2024.01.04.574164
2024-09-04
Abstract:Distal sensorimotor polyneuropathy (DSPN) is a common neurological disorder in elderly adults and people with obesity, prediabetes and diabetes and is associated with high morbidity and premature mortality. DSPN is a multifactorial disease and not fully understood yet. Here, we developed the Interpretable Multimodal Machine Learning (IMML) framework for predicting DSPN prevalence and incidence based on sparse multimodal data. Exploiting IMMLs interpretability further empowered biomarker identification. We leveraged the population-based KORA F4/FF4 cohort including 1,091 participants and their deep multimodal characterisation, i.e. clinical data, genomics, methylomics, transcriptomics, proteomics, inflammatory proteins and metabolomics. Clinical data alone is sufficient to stratify individuals with and without DSPN (AUROC = 0.752), whilst predicting DSPN incidence 6.5 +- 0.2 years later strongly benefits from clinical data complemented with two or more molecular modalities (improved AUROC >0.1, achieved AUROC of 0.714). Important and interpretable features of incident DSPN prediction include up-regulation of proinflammatory cytokines, down-regulation of SUMOylation pathway and essential fatty acids, thus yielding novel insights in the disease pathophysiology. These may become biomarkers for incident DSPN, guide prevention strategies and serve as proof of concept for the utility of IMML in studying complex diseases.
Bioinformatics
What problem does this paper attempt to address?
The problem this paper attempts to address is the prediction and elucidation of the pathogenesis of Distal Sensorimotor Polyneuropathy (DSPN). Specifically, the goals of the paper include: 1. **Predicting the prevalence and incidence of DSPN**: By developing an Interpretable Multimodal Machine Learning (IMML) framework, utilizing sparse multimodal data (such as clinical data, genomics, methylomics, transcriptomics, proteomics, inflammatory proteins, and metabolomics) to predict the prevalence and incidence of DSPN over the next 6.5 years. 2. **Identifying important biomarkers**: Through the interpretability of the IMML framework, identifying biomarkers associated with DSPN, which may provide important clues for the prevention and treatment of DSPN. 3. **Understanding the pathophysiological mechanisms of DSPN**: By analyzing important features in the predictive model, revealing the pathophysiological mechanisms of DSPN, particularly the upregulation of inflammatory cytokines, downregulation of the SUMOylation pathway, and reduction of essential fatty acids. The paper demonstrates the effectiveness of the IMML framework in predicting the prevalence and incidence of DSPN using data from 1,091 participants in the KORA F4/FF4 cohort, and identifies multiple potential biomarkers, providing new directions for future disease prevention and treatment.