Contribution of MALDI‐TOF mass spectrometry and machine learning including deep learning techniques for the detection of virulence factors of Clostridioides difficile strains

Alexandre Godmer,Quentin Giai Gianetto,Killian Le Neindre,Valentine Latapy,Mathilda Bastide,Muriel Ehmig,Valérie Lalande,Nicolas Veziris,Alexandra Aubry,Frédéric Barbut,Catherine Eckert
DOI: https://doi.org/10.1111/1751-7915.14478
2024-06-10
Microbial Biotechnology
Abstract:This study assessed the use of MALDI‐TOF Mass Spectrometry combined with Machine Learning including Deep Learning techniques to identify toxigenic and hypervirulent strains of Clostridioides difficile. The method demonstrated high accuracy, particularly in excluding non‐toxigenic strains with a negative predictive value of 0.95 and consistently high specificity (>0.96) for detecting binary producing strains. The findings suggest that MALDI‐TOF MS enhanced by ML techniques is a rapid, cost‐effective tool for detecting virulence factors in CD strains, potentially improving outbreak management. Clostridioides difficile (CD) infections are defined by toxins A (TcdA) and B (TcdB) along with the binary toxin (CDT). The emergence of the 'hypervirulent' (Hv) strain PR 027, along with PR 176 and 181, two decades ago, reshaped CD infection epidemiology in Europe. This study assessed MALDI‐TOF mass spectrometry (MALDI‐TOF MS) combined with machine learning (ML) and Deep Learning (DL) to identify toxigenic strains (producing TcdA, TcdB with or without CDT) and Hv strains. In total, 201 CD strains were analysed, comprising 151 toxigenic (24 ToxA+B+CDT+, 22 ToxA+B+CDT+ Hv+ and 105 ToxA+B+CDT−) and 50 non‐toxigenic (ToxA−B−) strains. The DL‐based classifier exhibited a 0.95 negative predictive value for excluding ToxA−B− strains, showcasing accuracy in identifying this strain category. Sensitivity in correctly identifying ToxA+B+CDT− strains ranged from 0.68 to 0.91. Additionally, all classifiers consistently demonstrated high specificity (>0.96) in detecting ToxA+B+CDT+ strains. The classifiers' performances for Hv strain detection were linked to high specificity (≥0.96). This study highlights MALDI‐TOF MS enhanced by ML techniques as a rapid and cost‐effective tool for identifying CD strain virulence factors. Our results brought a proof‐of‐concept concerning the ability of MALDI‐TOF MS coupled with ML techniques to detect virulence factor and potentially improve the outbreak's management.
biotechnology & applied microbiology,microbiology
What problem does this paper attempt to address?