Rapid Detection of Carbapenem-Resistant Klebsiella pneumoniae Using Machine Learning and MALDI-TOF MS Platform
Jinyu Wang,Cuiping Xia,Yue Wu,Xin Tian,Ke Zhang,Zhongxin Wang
DOI: https://doi.org/10.2147/IDR.S367209
2022-07-12
Infection and Drug Resistance
Abstract:Jinyu Wang, &ast Cuiping Xia, &ast Yue Wu, Xin Tian, Ke Zhang, Zhongxin Wang Department of Clinical Laboratory, The First Affiliated Hospital of Anhui Medical University, Hefei, People's Republic of China &astThese authors contributed equally to this work Correspondence: Zhongxin Wang, Department of Clinical Laboratory, The First Affiliated Hospital of Anhui Medical University, Hefei, People's Republic of China, Tel +8613866709500, Fax +5516-5908076, Email Background: Rapid detection of carbapenem-resistant Klebsiella pneumoniae (CRKP) is essential for specific antimicrobial therapy. Machine learning techniques combined with matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) can be used as a rapid, reliable, sensitive, and low-cost species identification method. Methods: Clinically collected K. pneumoniae were subjected to MALDI-TOF MS analysis. A random forest (RF) algorithm and non-linear support vector machine (SVM) were used to construct the RF, SVM, and dimension reduction (SVM-K) models, and their performance was assessed for accuracy, sensitivity, specificity, and area under the subject worker curve (AUC). Results: The RF, SVM and SVM-K models showed good classification performance with 0.88, 0.88, and 0.91 accuracy, 0.82, 0.85, and 0.89 sensitivity, 0.93, 0.92, and 0.94 specificity with an AUC of 0.9013, 0.9298, and 0.9356, respectively. For the SVM-K model, the optimal dimension reduction was 105 to 153, and the average accuracy was > 0.9. The top 10 peak features of significance according to the RF algorithm with 6515 Da appeared in 56.8% of CRKP isolates and 5.3% of CSKP isolates, which indicated the best classification performance. Conclusion: The three RF, SVM, and SVM-K models showed excellent classification performance differentiating the CRKP from CSKP; the SVM-K model was the best. Data analysis with machine learning combined with MALDI-TOF MS can be employed as a rapid and inexpensive alternative to existing detection methods. Keywords: Klebsiella pneumoniae , RF, SVM, SVM-K, MALDI-TOF MS Klebsiella pneumoniae is commonly encountered opportunistic gram-negative bacterium that causes higher morbidity and mortality. 1 With the widespread use of broad-spectrum antimicrobials such as lactamides and aminoglycosides, bacteria are prone to become multi-drug resistant by producing β-lactamases and cephalosporinases. Worldwide, treatment of carbapenem resistance K. pneumoniae (CRKP) has become a serious challenge. The resistance mechanism of CRKP involves the absence of membrane porins OmpK35 and OmpK36 and the production of broad-spectrum lactamases (ESBLs) or carbapenases. 2 The emergence of CRKP greatly limits the selection of antimicrobial therapy, often resulting in poor outcomes. 3 Meanwhile, antimicrobial drug susceptibility tests are time-consuming and expensive. A longer bacterial resistance testing cycle further aggravates the problem of carbapenem resistance. Therefore, developing rapid and reliable detection methods for the pathogenic bacteria resistant to antibiotics is crucial for the accurate and timely treatment. In the past few decades, matrix-assisted laser-resolved ionization time-of-flight mass spectrometry (MALDI-TOF MS), with resistance testing potential, has been widely used for rapid species identification of clinical microbes. It is faster, precise, and cost-effective than conventional microbial identification tests. 4 Previous studies used direct analysis of characteristic hydrolysis peaks to rapidly detect drug resistance and classification in the MALDI-TOF MS mass spectrometry data acquired from enzyme-mediated antimicrobial hydrolysis. 5 However, the method is complicated and requires an additional 3–4 h time. Meanwhile, for species identification, MALDI-TOF MS only depends on a few features, such as m/z and peak height, making it a fast and effective method. Though, the mass spectrum data information yet remains largely unutilized. Related studies discovered that extraneous variables could be removed to change the expression of MS data using machine learning to fully utilize the obscured information in mass spectrum data. By using intelligent data analysis, machine learning can maximize the mining of the information encoded in these mass spectra, exceeding most other methods. Several studies used machine learning algorithms to make full use of MALDI-TOF MS data for species identification and simplified antimicrobial resistance assays. 6 With self-supervised learning and continuously refining processes, machine learning can deeply mine the non-linear correlations in the data. It can swiftly evaluat -Abstract Truncated-
pharmacology & pharmacy,infectious diseases