- 1The Graduate School, Liberation Army Medical College, Beijing, China
- 2Department of Cardiology, Shaanxi Provincial People’s Hospital, Xian, Shanxi, China
- 3School of Medicine, Nankai University, Tianjin, China
- 4Department of Pulmonary and Critical Care Medicine, Liaocheng People’s Hospital, Liaocheng, Shandong, China
- 5Department of Pulmonary and Critical Care Medicine, The Eighth Medical Centre, Chinese People's Liberation Army (PLA) General Hospital, Beijing, China
Purpose: The aim of this study was to develop a valuable clinical prediction model for pneumonia-associated bloodstream infections (PABSIs).
Patients and methods: The study retrospectively collected clinical data of pneumonia patients at the First Medical Centre of the Chinese People’s Liberation Army General Hospital from 2019 to 2024. Patients who met the definition of pneumonia-associated bloodstream infections (PABSIs) were selected as the main research subjects. A prediction model for the probability of bloodstream infections (BSIs) in pneumonia patients was constructed using a combination of LASSO regression and logistic regression. The performance of the model was verified using several indicators, including receiver operating characteristic (ROC) curve, calibration curve, decision curve analysis (DCA) and cross validation.
Results: A total of 423 patients with confirmed pneumonia were included in the study, in accordance with the Inclusion Criteria and Exclusion Criteria. Of the patients included in the study, 73 developed a related bloodstream infection (BSI). A prediction model was constructed based on six predictors: long-term antibiotic use, invasive mechanical ventilation, glucocorticoids, urinary catheterization, vasoactive drugs, and central venous catheter placement. The areas under the curve (AUC) of the training set and validation set were 0.83 and 0.80, respectively, and the calibration curve demonstrated satisfactory agreement between the two.
Conclusion: This study has successfully constructed a prediction model for bloodstream infections associated with pneumonia cases, which has good stability and predictability and can help guide clinical work.
1 Introduction
Pneumonia is a prevalent acute respiratory infection that primarily affects the alveoli and distal bronchial tree. Pneumonia is classified according to the source of infection, with the distinction between community-acquired pneumonia (CAP) and hospital-acquired pneumonia (HAP). The latter category includes ventilator-associated pneumonia (VAP) (Torres et al., 2021). The most prevalent clinical manifestations of the disease include cough, dyspnea, chest pain, sputum production, and fatigue (Metlay et al., 1997; Lamping et al., 2002). Pneumonia represents a significant public health concern, with high morbidity rates observed across all age groups and a strong association with both short- and long-term mortality. The 2019 Global Burden of Disease (GBD) study revealed that approximately 489 million individuals worldwide had been affected by lower respiratory tract infections (LRTIs), encompassing conditions such as pneumonia and bronchiolitis. The demographic groups most affected by these infections were children below five years of age and adults above 70 years of age (Abbafati et al., 2020). Bloodstream infections (BSIs) are serious systemic infections that can readily lead to sepsis and multiple organ dysfunction syndrome (MODS), and have a high mortality rate. Consequently, bloodstream infections have become a significant challenge to public health globally (Kern and Rieg, 2020). Pneumonia represents the most common cause of BSIs, accounting for approximately half of all cases (Angus and van der Poll, 2013). In the event that patients diagnosed with pneumonia fail to receive timely or appropriate treatment, they may develop bloodstream infections or even sepsis as their condition progresses. The incidence of bloodstream infections in modern respiratory wards has been increasing year on year, and this may be related to a number of factors, including an ageing population, increased use of immunosuppressants and rising antibiotic resistance. Severe complications of pneumonia, such as bacteremia and sepsis, represent a significant threat to patient survival.
The progressive advancement of data measurement technology and analytical methods is facilitating a gradual transition in the field of medicine from the era of evidence-based medicine to the era of precision medicine. As a consequence of this transition, the role of clinical prediction models is becoming increasingly significant within the field of healthcare, thereby facilitating the development of more sophisticated and tailored healthcare services. Logistic regression, a widely employed statistical technique, plays a pivotal role in the development of clinical prediction models. However, due to the relatively small sample sizes of some studies, problems such as overfitting are common in model construction, which limits the generalizability and practical application performance of the model. To address this challenge, LASSO regression (Least Absolute Shrinkage and Selection Operator) is an effective strategy that introduces a penalty term to reduce the number of variables in the model, thereby enhancing the stability, interpretability and predictive performance of the model. Consequently, an increasing number of researchers are employing LASSO in conjunction with logistic regression to develop more precise and dependable clinical prediction models.
Currently, there is a dearth of clinical research on bloodstream infections in patients with pneumonia, and there is no consensus on the definition of pneumonia-associated bloodstream infections. In light of these considerations, this study has been designed with the objective of addressing this clinical challenge with a view to preventing pneumonia patients from developing bloodstream infections, thereby avoiding the deterioration of their condition to sepsis. The ultimate objective is a reduction in mortality. The objective of this study is to facilitate an early diagnosis and timely treatment of pneumonia-associated bloodstream infections, with the aim of improving outcomes.
2 Patients and methods
2.1 Study subjects
This study employed a retrospective research design to collate data on inpatients diagnosed with pneumonia at the PLA General Hospital’s First Medical Centre between May 2019 and May 2024.
The diagnostic criteria for pneumonia-associated bloodstream infections were as follows: (1) the patient presented with pneumonia as the initial symptom and a definitive diagnosis was reached; (2) the results of sputum and blood cultures were consistent with the same pathogen; and (3) all other potential sources of infection were excluded.
2.2 Inclusion criteria
The following inclusion criteria were employed:
1. Age > 18 years;
2. Pneumonia-associated clinical manifestations (any one of the following four manifestations is sufficient): (1) A recent cough with sputum or worsening of pre-existing respiratory symptoms, with or without purulent sputum, chest pain, dyspnea, and hemoptysis; (2) Fever; (3) Evidence of lung consolidation (voice raised, percussion dull) and/or wet rales; (4) Peripheral blood leukocyte count >10×109/L or <4×109/L, with or without nuclear shift to the left;
3. Chest imaging demonstrates the presence of new patchy infiltrates, lobular or segmental consolidation, ground-glass opacities, or interstitial changes, with or without pleural effusion;
4. Sputum and blood samples are obtained for culture.
2.3 Exclusion criteria
The exclusion criteria were as follows:
1. The patient was below the age of 18 years;
2. The diagnosis of pulmonary infection was uncertain;
3. Sputum and blood culture specimens were not obtained.
2.4 Study variables
The study collected patients’ personal information and clinical data by searching the hospital’s electronic medical record system. All baseline characteristics were recorded within a 24-hour period.
Data were collected from each patient for statistical analysis, including:(1) Demographic characteristics: sex, age, body mass index (BMI), date and time of admission; (2) Vital signs: body temperature, heart rate, respiratory rate, blood pressure (systolic and diastolic), and mean arterial pressure (MAP); (3) Personal lifestyle history: smoking and alcohol consumption; (4) History of chronic disease: Hypertension, Diabetes, coronary heart disease (CAD), Congestive Heart Failure, Atrial Fibrillation (AF), Arrhythmia, Hyperlipidemia, Cerebral Infarction, Chronic Respiratory Disease, Alzheimer’s Disease, Parkinson’s Disease, Thyroid Dysfunction, Renal Insufficiency, Benign Prostatic Hyperplasia (BPH), Gastrointestinal Bleeding, Chronic Gastritis, Reflux Esophagitis, Fatty Liver, Cirrhosis, Malignant Tumors, Hematologic Diseases, Lymphoma, Rheumatic Diseases of Immune System; (5) Surgical and treatment history: major surgery(Refers to surgery involving major organs or systems that carries a higher risk, longer recovery time, and may have a significant impact on the patient’s quality of life. This includes, but is not limited to: heart surgery, brain surgery, organ removal, etc.), recent surgery(Refers to surgery that the patient has had within the last 2 weeks), long-term antibiotic treatment(Refers to the use of antibiotics for ≥2 weeks and ≥1 type), radiotherapy, chemotherapy, hemodialysis and organ transplant history; (6) Use of medical devices: nasogastric tube, urinary catheter, central venous catheter placement and drainage tube; (7) Implementation of nutritional support and therapeutic measures: Total parenteral nutrition, invasive mechanical ventilation, sedatives, anticoagulants, vasopressors and glucocorticoids; steroids; (8) Laboratory Indicators: Sputum and blood microbial culture, pH, PaO2, PaCO2, oxygenation index, bicarbonate, lactate, white blood cell count, neutrophil percentage, lymphocyte percentage, hemoglobin, platelet count, C-reactive protein (CRP), interleukin-6 (IL-6), procalcitonin (PCT), serum albumin, total serum albumin, alanine aminotransferase (ALT), aspartate aminotransferase (AST), total bilirubin, direct bilirubin, urea, serum creatinine, serum potassium concentration, serum sodium ion concentration, serum chloride ion concentration, prothrombin time (PT), activated partial thromboplastin time (APTT), prothrombin time activity (PTA), fibrinogen, D-dimer.
2.5 Missing data handling
To address the issue of missing data during the data processing stage, the following measures were implemented. In cases where more than 50% of the data was missing, the decision was taken to delete the entire data set. Secondly, if the proportion of missing values for a specific variable exceeded 21%, the variable was excluded from the analysis. In cases where missing values were present for variables within the remaining case data, multiple imputation techniques were employed to ensure the integrity and accuracy of the data analysis.
2.6 Sample size estimation
This study is a single-center, retrospective cohort design aimed at developing a multifactorial regression model to predict pneumonia-associated bloodstream infections (PABSI). The study population was selected based on the criterion of concordance between sputum and blood culture results. Given the limitations of clinical resources, sample size estimation was conducted with practical feasibility in mind, ensuring that the statistical requirements were met.
2.6.1 Requirements for multifactor regression analysis (Peduzzi guidelines)
A maximum of six outcome-related predictor variables were intended to be included in the model to balance explanatory power and predictive accuracy, while maintaining simplicity. According to the Peduzzi guideline, each predictor variable should correspond to at least 10 outcome events (EPV ≥ 10) to prevent overfitting and ensure the stability and accuracy of model estimates. Therefore, with six predictor variables, a minimum of 60 outcome events (6 × 10 = 60) is required.
2.6.2 Calculation formula and process
To ensure adequate statistical power, sample size calculations were conducted as follows: assuming the incidence of bloodstream infection (BSI) in the exposed group (p1) is 20% and in the unexposed group (p2) is 5%, with a Type I error rate (α) set at 0.05 and a power (1 - β) of 80%. The sample size was calculated using the formula (Figure 1):
where Zα/2 is the critical value for the standard normal distribution (1.96 for α = 0.05), and Zβ is the critical value for statistical power (0.84 for 80% power). The sample size was calculated as n = 84 to ensure the study would meet the statistical requirements.
Based on the above calculations, the minimum sample size required for this study was 146 cases, with 73 cases in each of the exposed and unexposed groups, to achieve 80% power.
2.7 Statistical processing
The statistical analysis was conducted using SPSS 26.0 and R 4.4.1. Quantitative data are expressed as mean ± standard deviation, and for non-normally distributed data, the median and interquartile range are employed. Categorical data are displayed as frequencies or percentages. In order to conduct a comparative analysis of continuous variables, the t-test or Mann-Whitney U test was employed. Similarly, the chi-square test or Fisher’s exact test was used for the comparison of categorical variables. The data was randomly divided into a training set and a validation set in a 1:1 ratio.
In the feature selection phase, a combination of LASSO regression and logistic regression was employed. Initially, all data were subjected to univariate logistic regression analysis to identify potentially pertinent variables with a P-value less than 0.05. Subsequently, LASSO regression was performed, and the 1 standard error (1se) rule was employed to ascertain the optimal penalty coefficient and exclude non-representative variables. Subsequently, a further multivariate logistic regression analysis was conducted to evaluate the independent effects of these variables on the outcome, with the corresponding odds ratios (OR) and 95% confidence intervals (CI) subsequently calculated.
Once the regression equation had been established, a line chart was employed to represent the prediction model. An ROC curve was constructed for the purpose of evaluating the model’s performance, and the AUC value was calculated for the purpose of evaluating its diagnostic ability. The Hosmer-Lemeshow test was employed to evaluate the model’s goodness of fit, while a calibration curve was plotted to assess the calibration effect of the model. Moreover, decision curve analysis (DCA) was employed to determine the clinical utility of the model. In this study, a statistically significant level was set at P < 0.05.
3 Results
3.1 Patient demographics
A total of 423 patients were enrolled in this study, including 211 patients in the training group and 212 patients in the test group. This sample size significantly exceeded the calculated minimum requirement, thereby ensuring the statistical validity of the study. All patient data were complete and suitable for analysis.
A comparison of the baseline data from the two cohorts revealed significant differences between the two groups with respect to partial pressure of arterial carbon dioxide (Paco2; P = 0.036), lymphocyte percentage (P = 0.040), history of diabetes (P = 0.008), and use of drainage tubes (P = 0.025). No significant differences were identified for the remaining indicators (all p-values were greater than 0.05). For further details, please refer to Table 1.
3.2 Factor screening
In this study, a univariate logistic regression analysis was initially performed on all included variables, which encompassed demographic characteristics, vital signs, personal history of lifestyle habits, history of chronic diseases, history of surgery and treatment, use of medical equipment, nutrition and supportive care, use of drugs, and laboratory test indicators.
The results of the univariate logistic regression analysis demonstrated that the following variables were significantly associated with pneumonia-associated bloodstream infections (PABSIs) (P < 0.05): coronary heart disease, long-term use of antibiotics, nasogastric tube use, urinary catheter use, and central venous catheter, invasive mechanical ventilation, and sedative drug use, use of vasoactive drugs, use of glucocorticoids, body temperature, heart rate, diastolic blood pressure, and mean arterial pressure (MAP), oxygenation index, urea. For further details, please refer to Table 2.
The use of LASSO regression analysis identified six possible predictors that were found to be significantly associated with pneumonia-associated bloodstream infections (PABSIs) when the λ value was set to 1 standard error (1-se). The identified predictors were long-term antibiotic use, urinary catheter use, central venous catheterization, invasive mechanical ventilation, vasodilator use and glucocorticoid use. For further details, please refer to Figure 2.

Figure 2. Overview of LASSO coefficients for 15 risk factors (A). Six risk factors were selected using LASSO regression analysis (B).
Further multivariate logistic regression analysis was used to verify the six potential predictors selected by LASSO regression. The results of the analysis showed that the following factors were significantly associated with the risk of pneumonia-associated bloodstream infections (PABSIs): invasive mechanical ventilation, glucocorticoid use, use of long-term antibiotics, use of vasoactive drugs, urinary catheter use, and central venous catheter use. See Table 3 for details.
3.3 Construction of a prediction model
Based on the results of the multifactor logistic regression analysis, a prediction model was constructed to predict the risk of pneumonia-associated bloodstream infections. The model is presented visually in a line graph for ease of clinical application and interpretation. For further details, please refer to Figure 3.

Figure 3. Nomogram for predicting the risk probability of pneumonia-associated bloodstream infections.
3.4 Model evaluation
To comprehensively evaluate the performance of the nomogram we constructed, we used a number of statistical validation methods. These included receiver operating characteristic (ROC) curve analysis, area under the curve (AUC) calculation, calibration curve analysis and decision curve analysis (DCA). Together, these methods provide insight into the predictive accuracy of the nomogram, its calibration and its use in clinical decision making.
3.4.1 ROC curve and AUC analysis
In order to evaluate the discriminatory power of the nomogram that we had constructed, we initially employed a receiver operating characteristic (ROC) curve analysis. The area under the curve (AUC) values of the nomogram in the training and validation sets reached 0.83 and 0.80 (see Figure 4), respectively. These results demonstrate that the model exhibits excellent discriminative power on both the training and validation sets. The high AUC values also indicate that the model has high accuracy in identifying pneumonia-associated bloodstream infections (PABSIs).

Figure 4. ROC curve of nomogram division with binomial verification of the training group (A) and test group (B). The AUC of the training group and the test group was 0.83 and 0.80, respectively.
3.4.2 Calibration curve
In order to ascertain the predictive accuracy of the nomogram, a calibration curve analysis was conducted. The results are presented in Figure 5, which demonstrate that the nomogram exhibits robust predictive performance on both the training and validation sets. The calibration curve is in close proximity to the diagonal, indicating that the probability predicted by the model is highly consistent with the actual observed probability of event occurrence. These findings provide further confirmation of the model’s predictive accuracy.
3.4.3 Decision curve analysis
Decision curve analysis (DCA) further verified the value of the nomogram in clinical applications, as shown in Figure 6. The results of the DCA showed that the model could achieve above average net benefits regardless of the threshold probability setting, indicating that the nomogram has significant advantages in clinical decision making.

Figure 6. Decision Curve Analysis (DCA) curves of the training group (A) and the test group (B). The net benefit of the nomogram was verified by the DCA curves of the two cohorts.
3.4.4 Cross validation
To further validate the model, 5-fold cross-validation was performed. The AUC scores for each fold were as follows: 0.84, 0.77, 0.71, 0.87, and 0.82, with an average AUC of 0.80 ± 0.05. These results indicate that the model showed stable performance across different subsets of the data, with good generalizability. The cross-validation results further support the robustness and reliability of the predictive model.
4 Discussion
This study employed a combination of logistic regression and LASSO regression to develop a clinical prediction model for pneumonia-associated bloodstream infections (PABSIs) with notable success. The model employs clinical laboratory indicators and is trained and evaluated using medical record data. The model is capable of accurately predicting the risk of pneumonia patients developing bloodstream infections. The model demonstrated excellent performance on the training and validation sets, with AUC values of 0.83 and 0.80, respectively, confirming its excellent discrimination and prediction capabilities. Furthermore, the calibration curve and decision curve analysis demonstrate that the model exhibits high accuracy in actual prediction and clinical application, providing a robust foundation for clinical decision-making.
To ensure the adequacy of the sample size, this study performed sample size estimation based on the Peduzzi criteria, ensuring sufficient statistical power for the research. In the total sample, the number of outcome events per predictor variable was 12.17 (73 ÷ 6), which is significantly higher than the recommended minimum of 10 events per variable by the Peduzzi criterion, thereby ensuring the robustness and reliability of the model.
For variable selection, this study employed LASSO regression. By introducing a penalty term, LASSO regression effectively selects and shrinks variables, reducing the risk of over-parameterization and mitigating the overfitting problems commonly encountered with small sample sizes, thereby enhancing the model’s predictive capability.
To further assess the robustness and accuracy of the model, the study used the Hosmer-Lemeshow test and calibration curves. The results indicated that the model exhibited good fit and strong predictive ability in both the training and validation sets. The validation set included 31 patients who developed pneumonia-associated bloodstream infections (PABSI), with an event-to-variable ratio of 5.17:1, which meets the basic requirements for regression analysis. The results from the validation set further reinforced the reliability and stability of the model.
In summary, this study ensured the robustness of the model through LASSO regression, Hosmer-Lemeshow test, and calibration curves, thus effectively assessing the risk of pneumonia-associated bloodstream infections.
The results of the study analysis identified several significant predictors associated with pneumonia-associated bloodstream infections (PABSIs), including long-term antibiotic use, use of urinary and central venous catheters, use of invasive mechanical ventilation, use of vasopressors, use of glucocorticoids, and use of urinary catheters.
In recent years, public health authorities and scientific societies around the world have articulated profound concern regarding the escalating issue of antimicrobial resistance (AMR). Antimicrobial resistance (AMR) has been widely acknowledged as a significant global health challenge of the 21st century (Tabah et al., 2023). In clinical practice, the misuse of antibiotics has been the subject of considerable attention, and their overuse has been identified as a significant contributing factor to the development of drug resistance (Read and Woods, 2014). The long-term use of antibiotics can result in selective changes to the structure of the human microbiota, whereby susceptible strains are suppressed or eliminated, while resistant strains survive and proliferate. This increase in selective pressure has resulted in an increase in the frequency of multidrug-resistant strains (Meng et al., 2021). These resistant strains demonstrate reduced susceptibility to the majority of antimicrobial drugs. As drug resistance increases, the treatment of lung infections becomes more challenging, leading to an increase in the prevalence of pneumonia-associated bloodstream infections. The results of the present study demonstrate a robust correlation between long-term antibiotic utilization and pneumonia-associated bloodstream infections, which is consistent with the findings of prior studies in this area. Such an association may be attributed to a dysbiosis, or disruption, of the patient’s endogenous flora, thereby elevating the risk of bloodstream infections among patients diagnosed with pneumonia. These findings have significant clinical implications, emphasizing the necessity for more precise and efficacious prevention and treatment strategies. The introduction of a defined daily dose (DDD) for each individual patient represents a crucial step in the effective management of those with pneumonia.
Device-associated infections have been shown to remain a significant challenge in clinical practice, primarily due to their high prevalence and potential for mortality (Blot et al., 2005; Li, 2015). In particular, urinary and central venous catheters provide a pathway for bacteria to enter the bloodstream, serving as a common conduit for hospital-acquired infections. In this study, we focused on the cohort of patients with pneumonia and found a notable association between urinary and central venous catheters and pneumonia-associated bloodstream infections after excluding other potential sources of bloodstream infections. This association may be due to the fact that patients with these catheters tend to have poorer overall health, higher disease severity and poorer prognosis. In addition, these patients often require prolonged periods of bed rest, which can affect their hemodynamic status.
Invasive mechanical ventilation is a fundamental aspect of respiratory and critical care medicine. It allows patients with respiratory distress to maintain adequate oxygenation and carbon dioxide elimination, thereby improving oxygenation and reducing the incidence of hypoxemia. In patients with fatigued or weak respiratory muscles, invasive mechanical ventilation can reduce the burden on the respiratory muscles and provide the necessary respiratory support for other treatments. However, the establishment of an artificial airway also deprives the patient of the airway’s natural defense mechanisms, such as the ability to heat, humidify, and clear the inspired air. This increases the risk of airborne pathogens entering the patient’s airways and lungs. In addition, prolonged use may result in tracheal damage (Eggimann and Pittet, 2001). Current research suggests that the presence of an endotracheal tube is the most significant risk factor for the development of ventilator-associated pneumonia (VAP) (Walaszek et al., 2016). The present study also identified invasive mechanical ventilation as a risk factor for the progression of pneumonia patients to pneumonia-associated bloodstream infections. Because the establishment of an artificial airway creates a direct pathway between the lungs and the outside world, increasing the possibility of invasion by pathogenic microorganisms from the outside world, it may further exacerbate pneumonia and thus increase the risk of bloodstream infections.
Vasopressors are of great importance in critical care medicine, particularly in the context of resuscitation and treatment of shock. These drugs can significantly improve the patient’s hemodynamics, control blood pressure, and optimize blood flow to vital organs (Annane et al., 2018). In patients with pneumonia, pathogens of pulmonary infection, including bacteria, viruses or fungi, can enter the bloodstream through various mechanisms and cause bloodstream infections. During infection, pathogens such as viruses or bacteria can directly infect epithelial cells, causing cell death (Kenney et al., 2017). Tissue damage and inflammatory responses can lead to disruption of the alveolar-capillary barrier, allowing pathogens to cross the damaged lung tissue and enter the bloodstream (Long et al., 2022). In addition, the inflammatory response in the infected area may lead to local vasodilation and increased vascular permeability, facilitating pathogen invasion. Once the pathogen enters the blood stream, it can cause a systemic infection, which is a bloodstream infection. Studies have shown that the use of vasoactive drugs is significantly associated with the incidence of pneumonia-associated bloodstream infections. This may facilitate the occurrence of bloodstream infections by affecting the vasodilator and permeability properties of blood vessels due to damage to the alveolar-capillary barrier.
The pro-inflammatory mediator storm that critically ill patients with pulmonary infections may experience represents a complex pathological process in which an imbalance between pro- and anti-inflammatory mediators can lead to a variety of adverse effects (Li et al., 2020). Glucocorticoids help to attenuate the inflammatory response in critically ill patients by inhibiting the release of pro-inflammatory mediators, thereby regulating excessive immune responses and improving clinical symptoms (Patel et al., 2020). However, this treatment strategy may also suppress the patient’s immune function, thereby reducing their ability to resist new infections. This may provide opportunities for invasion and spread of pathogenic microorganisms, and may even lead to immunosuppression and uncontrolled inflammatory responses. The results of this study demonstrate an increased likelihood of pneumonia-associated bloodstream infections in patients treated with glucocorticoids. This may be due to the immunosuppressive effects of glucocorticoids, which may increase the likelihood of such infections.
The particular advantage of this model over existing studies is that it combines the advantages of LASSO regression in feature selection with the strengths of logistic regression in predictive ability, thereby enhancing the stability and predictive accuracy of the model. This is evidenced by the high AUC and exemplary calibration curve performance of this model. In addition, this study focuses on the risk of pneumonia patients developing bloodstream infections, which is a critical aspect of pneumonia patient management. Given the current lack of research on individuals with pneumonia-associated bloodstream infections, this study seeks to assess the risk of such infections by developing a predictive model to facilitate the identification of early-stage pneumonia patients who may subsequently develop bloodstream infections. This will facilitate the implementation of prompt treatment measures and contribute to a reduction in mortality rates.
Although this model has demonstrated considerable efficacy in predicting pneumonia-associated bloodstream infections, it is important to acknowledge the limitations inherent in its evaluation. First, because this study used a retrospective cohort study design, there is a possibility of selection and information bias. Second, the study sample was derived from a single center, which may limit the generalizability of the model. Therefore, it is recommended that future studies use a multicenter prospective study design to further validate and optimize this model. Furthermore, future studies could explore the potential of combining this model with other biomarkers or imaging features to improve the accuracy of prediction and the value of clinical application. At the same time, external validation of the model and its applicability in different populations are important directions for future research.
5 Conclusion
In conclusion, the clinical prediction model for pneumonia-related bloodstream infections developed in this study, based on LASSO regression and logistic regression, provides clinicians with a precise risk assessment tool specifically for the management of pneumonia patients. The model not only has high predictive accuracy, but also has good calibration performance, which allows clinicians to identify high-risk patient groups early. The implementation of timely and effective preventive and treatment measures based on this tool is expected to significantly improve the clinical prognosis of patients.
Data availability statement
Ethical approval has been signed, and the data from this research project will be kept only by the researcher, will not be uploaded to public databases, and will not be shared with other researchers. Requests to access the datasets should be directed to ZZ, purple_zzt@163.com.
Ethics statement
The studies involving humans were approved by Ethics Committee of the First Medical Center of the PLA General Hospital (Approval No. S-603-01). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.
Author contributions
ZZ: Data curation, Investigation, Methodology, Software, Validation, Writing – original draft, Formal analysis, Visualization. SL: Investigation, Methodology, Validation, Writing – review & editing, Formal Analysis. FQ: Formal analysis, Methodology, Validation, Writing – review & editing. YW: Investigation, Visualization, Writing – review & editing. MS: Methodology, Validation, Writing – review & editing. XG: Conceptualization, Project administration, Supervision, Writing – review & editing.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Acknowledgments
The authors would like to express their gratitude to Dr. Changxin Liu for his assistance with the details of data collection. They would also like to acknowledge the valuable suggestions provided by Dr. Junyu Ding and Dr. Huijun Zhao regarding data analysis. Thanks to the Generative Artificial Intelligence Tool (Chat GPT) for assistance in embellishing the English expression. The tool uses GPT-4o sourced from OpenAI. All content edited through this tool has been checked for accuracy and plagiarism to ensure research rigor. Detailed records cannot be provided as the specific prompts for each interaction with GPT-4o were not recorded, but all content using the generated AI strictly followed academic norms, and the prompts were mainly related to text optimization, grammar checking, and language fluency enhancement to ensure compliance with academic standards.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that Generative AI was used in the creation of this manuscript. Thank you for your review and attention to my submission. During the preparation of this manuscript, I utilized the generative AI tool (GPT-4o) to refine and optimize the English expression. It is important to note that I did not rely on this tool for content creation or to generate original ideas. All research data, analysis results, and discussions were independently conducted by me, and I have strictly adhered to academic standards.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Abbafati, C., Abbas, K. M., Abbasi, M., Abbasifard, M., Abbasi-Kangevari, M., Abbastabar, H., et al. (2020). Global burden of 369 diseases and injuries in 204 countries and territories 1990-2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet 396, 1204–1222.
Angus, D. C., van der Poll, T. (2013). Severe sepsis and septic shock. New Engl. J. Med. 369, 840–851. doi: 10.1056/NEJMra1208623
Annane, D., Ouanes-Besbes, L., de Backer, D., Du, B., Gordon, A. C., Hernández, G., et al. (2018). A global perspective on vasoactive agents in shock. Intensive Care Med. 44, 833–846. doi: 10.1007/s00134-018-5242-5
Blot, S. I., Depuydt, P., Annemans, L., Benoit, D., Hoste, E., de Waele, J. J., et al. (2005). Clinical and economic outcomes in critically ill patients with nosocomial catheter-related bloodstream infections. Clin. Infect. Dis. 41, 1591–1598. doi: 10.1086/497833
Eggimann, P., Pittet, D. (2001). Infection control in the ICU. Chest 120, 2059–2093. doi: 10.1378/chest.120.6.2059
Kenney, A. D., Dowdle, J. A., Bozzacco, L., McMichael, T. M., St Gelais, C., Panfil, A. R., et al. (2017). Human genetic determinants of viral diseases.Ann. Rev. Genet. 51, 241–263. doi: 10.1146/annurev-genet-120116-023425
Kern, W. V., Rieg, S. (2020). Burden of bacterial bloodstream infection-a brief update on epidemiology and significance of multidrug-resistant pathogens. Clin. Microbiol. Infection 26, 151–157. doi: 10.1016/j.cmi.2019.10.031
Lamping, D. L., Schroter, S., Marquis, P., Marrel, A., Duprat-Lomon, I., Sagnier, P. P. (2002). The community-acquired pneumonia symptom questionnaire - A new, patient-based outcome measure to evaluate symptoms in patients with community-acquired pneumonia. Chest 122, 920–929. doi: 10.1378/chest.122.3.920
Li, H. M. (2015). Distribution and clinical characteristics of pathogenic bacteria causing catheter-related bloodstream infections. Pakistan J. Pharm. Sci. 28, 1163–1166.
Li, X., Chen, Z., Ji, N., Zhang, M. (2020). Recent research progress of acute respiratory distress syndrome in COVID-19 patients. Int. Respir. J. 40, 1690–1696. doi: 10.3760/cma.j.cn131368-20200320-00195
Long, M. E., Mallampalli, R. K., Horowitz, J. C. (2022). Pathogenesis of pneumonia and acute lung injury. Clin. Sci. 136, 747–769. doi: 10.1042/cs20210879
Meng, X., Fu, J., Zheng, Y., Qin, W., Yang, H., Cao, D., et al. (2021). Ten-year changes in bloodstream infection with acinetobacter baumannii complex in intensive care units in eastern China: A retrospective cohort study. Front. Med. (Lausanne) 8. doi: 10.3389/fmed.2021.715213
Metlay, J. P., Fine, M. J., Schulz, R., Marrie, T. J., Coley, C. M., Kapoor, W. N., et al. (1997). Measuring symptomatic and functional recovery in patients with community-acquired pneumonia. J. Gen. Internal Med. 12, 423–430. doi: 10.1046/j.1525-1497.1997.00074.x
Patel, S. K., Saikumar, G., Rana, J., Dhama, J., Yatoo, M. I., Tiwari, R., et al. (2020). Dexamethasone: A boon for critically ill COVID-19 patients? Travel Med. Infect. Dis. 37. doi: 10.1016/j.tmaid.2020.101844
Read, A. F., Woods, R. J. (2014). Antibiotic resistance management. Evol. Med. Public Health 2014, 147. doi: 10.1093/emph/eou024
Tabah, A., Buetti, N., Staiquly, Q., Ruckly, S., Akova, M., Aslan, A. T., et al. (2023). Epidemiology and outcomes of hospital-acquired bloodstream infections in intensive care unit patients: the EUROBACT-2 international cohort study. Intensive Care Med. 49, 178–190. doi: 10.1007/s00134-022-06944-2
Torres, A., Cilloniz, C., Niederman, M. S., Menéndez, R., Chalmers, J. D., Wunderink, R. G., et al. (2021). Pneumonia. Nat. Rev. Dis. Primers 7. doi: 10.1038/s41572-021-00259-0
Keywords: pneumonia, bloodstream infections, bacteremia, risk factor, early diagnosis, prediction model
Citation: Zhou Z, Liu S, Qu F, Wei Y, Song M and Guan X (2025) Development and validation of a clinical prediction model for pneumonia - associated bloodstream infections. Front. Cell. Infect. Microbiol. 15:1531732. doi: 10.3389/fcimb.2025.1531732
Received: 20 November 2024; Accepted: 03 March 2025;
Published: 18 March 2025.
Edited by:
Sam Ebenezer, Sathyabama Institute of Science and Technology, IndiaReviewed by:
Saranya Somusundaram, Dr. N.G.P. Arts and Science College, IndiaTamilmaran Nagarajan, Saveetha University, India
Copyright © 2025 Zhou, Liu, Qu, Wei, Song and Guan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xizhou Guan, gxz301@126.com
‡ORCID: Shangshu Liu, orcid.org/0009-0001-0097-6009, Manya Song, orcid.org/0009-0007-0970-3537