Skip to main content

ORIGINAL RESEARCH article

Front. Microbiol.
Sec. Systems Microbiology
Volume 15 - 2024 | doi: 10.3389/fmicb.2024.1508030
This article is part of the Research Topic Artificial Intelligence in Pathogenic Microorganism Research View all 14 articles

A clinical prediction model to distinguish between KP colonisation and infection

Provisionally accepted
  • 1 Department of Infectious Diseases, The First Affiliated Hospital of China Medical University, Shenyang, China
  • 2 Department of Infectious Diseases, The First Affiliated Hospital of Xiamen University,Xiamen,China, Xiamen, Fujian Province, China

The final, formatted version of the article will be published soon.

    Objective: To develop a machine learning-based prediction model to assist clinicians in accurately determining whether the detection of Klebsiella pneumoniae (KP) in sputum samples indicates an infection, facilitating timely diagnosis and treatment. Research Methods: A retrospective analysis was conducted 286 cases with sputum cultures yielding only KP were included, comprising 67 cases in the colonization group and 219 cases in the infection group. Antimicrobial susceptibility testing was performed on the included strains, and through univariate logistic regression analysis, 15 key influencing factors were identified. These factors were used to construct the model, which was evaluated using accuracy, precision, recall, F1 score, AUC value, and Brier score. Results: Antimicrobial susceptibility testing indicated that the resistance rates for penicillins, cephalosporins, carbapenems, and quinolones were significantly higher in the infection group compared to the colonization group (all P < 0.05). Six predictive models were constructed using 15 key influencing factors, including Classification and Regression Trees (CART), C5.0, Gradient Boosting Machines (GBM), Support Vector Machines (SVM), Random Forest (RF), and Nomogram. The Random Forest model performed best among all indicators (accuracy 0.93, precision 0.98, Brier Score 0.06, recall 0.72, F1 Score 0.83, AUC 0.99).The importance of each factor was demonstrated using mean decrease in Gini. "Admitted with a diagnosis of respiratory infectious disease" (8.39) was identified as the most important factor in the model, followed by "Hypoalbuminemia" (7.83), then "ESBL" (7.06), "Electrolyte Imbalance" (5.81), "Age > 62 years" (5.24), "The number of Positive Sputum Cultures for KP > 2" (4.77), and being bedridden (4.24). Additionally, invasive procedures (such as history of tracheostomy, use of ventilators for > 96 hours, and craniotomy) were also significant predictive factors. The Nomogram indicated that CRKP, presence of a nasogastric tube, admission to the ICU, and history of tracheostomy were important factors in determining KP colonization. Conclusion: The Random Forest model effectively distinguishes between infection and colonization status of KP, while the Nomogram visually presents the predictive value of various factors, providing clinicians with a reference for formulating treatment plans.

    Keywords: KP, Sputum culture, Risk factors, machine learning, Clinical prediction model

    Received: 08 Oct 2024; Accepted: 30 Dec 2024.

    Copyright: © 2024 Zhang, Zhang, Zhang, Xu, Zhang and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence:
    Jingping Zhang, Department of Infectious Diseases, The First Affiliated Hospital of China Medical University, Shenyang, China
    Xin Zhang, Department of Infectious Diseases, The First Affiliated Hospital of China Medical University, Shenyang, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.