Predicting Salmonella MIC and Deciphering Genomic Determinants of Antibiotic Resistance and Susceptibility

Moses B. Ayoola,Athish Ram Das,B. Santhana Krishnan,David R. Smith,Bindu Nanduri,Mahalingam Ramkumar
DOI: https://doi.org/10.3390/microorganisms12010134
IF: 4.5
2024-01-10
Microorganisms
Abstract:Salmonella spp., a leading cause of foodborne illness, is a formidable global menace due to escalating antimicrobial resistance (AMR). The evaluation of minimum inhibitory concentration (MIC) for antimicrobials is critical for characterizing AMR. The current whole genome sequencing (WGS)-based approaches for predicting MIC are hindered by both computational and feature identification constraints. We propose an innovative methodology called the “Genome Feature Extractor Pipeline” that integrates traditional machine learning (random forest, RF) with deep learning models (multilayer perceptron (MLP) and DeepLift) for WGS-based MIC prediction. We used a dataset from the National Antimicrobial Resistance Monitoring System (NARMS), comprising 4500 assembled genomes of nontyphoidal Salmonella, each annotated with MIC metadata for 15 antibiotics. Our pipeline involves the batch downloading of annotated genomes, the determination of feature importance using RF, Gini-index-based selection of crucial 10-mers, and their expansion to 20-mers. This is followed by an MLP network, with four hidden layers of 1024 neurons each, to predict MIC values. Using DeepLift, key 20-mers and associated genes influencing MIC are identified. The 10 most significant 20-mers for each antibiotic are listed, showcasing our ability to discern genomic features affecting Salmonella MIC prediction with enhanced precision. The methodology replaces binary indicators with k-mer counts, offering a more nuanced analysis. The combination of RF and MLP addresses the limitations of the existing WGS approach, providing a robust and efficient method for predicting MIC values in Salmonella that could potentially be applied to other pathogens.
microbiology
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to address the issue of predicting the **Minimum Inhibitory Concentration (MIC)** of antibiotics for **Salmonella**. Specifically, the paper focuses on the following points: 1. **Antimicrobial Resistance (AMR)**: Salmonella is a major pathogen causing foodborne illnesses, and the increase in its antibiotic resistance poses a serious threat to global public health. Traditional laboratory methods (such as agar or broth dilution) can determine MIC values but are time-consuming and cannot quickly respond to severe infections. 2. **Application of Whole Genome Sequencing (WGS)**: Existing WGS-based methods for predicting MIC face challenges such as high computational resource demands and difficulty in feature identification. The paper proposes a new method—"Genome Feature Extractor Pipeline," which combines traditional machine learning (Random Forest, RF) and deep learning models (Multilayer Perceptron, MLP) to predict MIC values. 3. **Identification of Genomic Features**: Through WGS data, the paper attempts to identify key genomic features (k-mers) that influence antibiotic resistance and sensitivity. These features are significant for understanding the mechanisms of antibiotic resistance in Salmonella. ### Main Contributions 1. **Improving Prediction Accuracy**: The proposed "Genome Feature Extractor Pipeline" can reduce a large number of 10-mers (about 1 million) to fewer 20-mers (about 27,932), thereby improving the accuracy and efficiency of MIC prediction. 2. **Analysis of Genomic Feature Importance**: Using Random Forest and DeepLift techniques, the paper identifies key 20-mers and their corresponding genes related to the MIC values of specific antibiotics, providing new insights into understanding antibiotic resistance. 3. **Potential Application Across Pathogens**: This method is not only applicable to Salmonella but may also be used for MIC prediction in other pathogens, aiding in the rapid and accurate formulation of personalized antibiotic treatment plans. ### Conclusion By combining traditional machine learning and deep learning, this paper successfully addresses the computational and feature identification challenges in predicting antibiotic resistance in Salmonella. It provides strong support for improving the effectiveness and safety of antibiotic treatments.