- 1Department of Neurosurgery, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
- 2Wenzhou Medical University, Wenzhou, China
Background: Postoperative pneumonia (POP) is one of the primary complications after aneurysmal subarachnoid hemorrhage (aSAH) and is associated with postoperative mortality, extended hospital stay, and increased medical fee. Early identification of pneumonia and more aggressive treatment can improve patient outcomes. We aimed to develop a model to predict POP in aSAH patients using machine learning (ML) methods.
Methods: This internal cohort study included 706 patients with aSAH undergoing intracranial aneurysm embolization or aneurysm clipping. The cohort was randomly split into a train set (80%) and a testing set (20%). Perioperative information was collected from participants to establish 6 machine learning models for predicting POP after surgical treatment. The area under the receiver operating characteristic curve (AUC), precision-recall curve were used to assess the accuracy, discriminative power, and clinical validity of the predictions. The final model was validated using an external validation set of 97 samples from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database.
Results: In this study, 15.01% of patients in the training set and 12.06% in the testing set with POP after underwent surgery. Multivariate logistic regression analysis showed that mechanical ventilation time (MVT), Glasgow Coma Scale (GCS), Smoking history, albumin level, neutrophil-to-albumin Ratio (NAR), c-reactive protein (CRP)-to-albumin ratio (CAR) were independent predictors of POP. The logistic regression (LR) model presented significantly better predictive performance (AUC: 0.91) than other models and also performed well in the external validation set (AUC: 0.89).
Conclusion: A machine learning model for predicting POP in aSAH patients was successfully developed using a machine learning algorithm based on six perioperative variables, which could guide high-risk POP patients to take appropriate preventive measures.
Introduction
Aneurysmal subarachnoid hemorrhage (aSAH), primarily caused by the rupture of intracranial aneurysms, is a devastating disease with high morbidity and mortality (1, 2). The total occurrence of aSAH is about 9.1 per 100,000 people, accounts for 2–7% of all strokes (3, 4). Despite the advances in the treatment of aneurysmal subarachnoid hemorrhage (aSAH), such as endovascular coil embolization and microsurgery, postoperative complications can significantly impact the prognosis of patients, prolonged the time of hospitalization and lead to increased economic costs (2, 5).
Among these complications, postoperative pneumonia (POP), which occurs in 10–30% aSAH patients after surgical treatment, is considered a critical complication closely associated with the prognosis of aSAH patients (2, 6–10). Our previous study have found a close relationship between smoking history, delayed cerebral ischemia (DCI), mechanical ventilation time (MVT), Glasgow Coma Scale (GCS), Albumin, c-reactive protein (CRP). Using these variables, our team initially constructed a nomogram model to predicat POP for aSAH patients (11). But with the further research, numerous scholars contend many composite indicators were very effective for postoperative complications prediction, such as neutrophil-to-lymphocyte Ratio (NLR), prognostic nutrition index (PNI), neutrophil-to-albumin Ratio (NAR), CRP-to-albumin ratio (CAR), D-Dimer-to-Albumin Ratio (DAR) (2, 5, 12, 13). The aforementioned indicators, however, were not incorporated as variables in our previous study. The evaluation of the validity of these variables was therefore deemed necessary in order to ascertain their potential for enhancing the predictive power of our model.
In addition, although previous studies have suggested that conventional LR can provide a clinical prediction model that is easy to interpret, when conventional LR is used for complex multivariate non-linear relationships, complex transformations are often required owing to low robustness and multicollinearity between variables (14). Therefore, recent work has highlighted the potential of machine learning (ML) algorithms for stroke-related complications in stroke patients (15, 16). Savarraj et al. suggested that ML models significantly outperform conventional LR in predicting functional outcomes and has the potential to improve SAH management (17). The ML models possess the potential to outperform conventional linear or logistic regression models due to their exceptional ability in identifying intricate and nonlinear relationships among a multitude of prognostic variables (16). Thus, in this study, we conducted five ML prediction model, which contains support vector machine (SVM), logistic regression (LR), random forest (RF), multilayer perceptron (MLP), K-nearest neighbor (KNN) and extreme gradient boosting (XGBoost) for the prediction of POP within 30 days in aSAH patients after surgical treatment. The best model will be verified in the MIMIC-IV database. To supply clinical basis for the prevention and therapy of POP (18).
Materials and methods
Study population
Data for aged more than 18 years aSAH patients treated with intracranial aneurysm embolization or aneurysm clipping in the First Affiliated Hospital of Wenzhou Medical University from 1 June 2017 to 4 February 2022 were retrospectively collected. Patients will be included in this study if they meet the following requirements: (1) aged 18 years or older, who suffered their first SAH ever, admitted to our hospital within 24 h of symptom onset; (2) All aSAH patients should be confirmed by computed tomographic (CT), computed tomographic angiography (CTA) and digital subtraction angiography (DSA); (3) endovascular coiling of the aneurysm was performed; and (4) The definition of POP in this study refers to lower respiratory tract infections that occur within 30 days after endovascular coiling procedure in our hospital and the diagnosis of POP should follow modified Centers for Disease Control and Prevention (CDC) criteria, which was characterized by the following criteria: (1) A probable case of POP cannot be diagnosed based solely on the admission or follow-up chest x-ray, and it cannot be attributed to another diagnosis. (2) A proven case of POP is confirmed when there is a documented change in diagnosis observed on at least one chest x-ray image. For the purpose of this study, patients meeting the modified CDC criteria for probable/proven pneumonia were considered as cases, while those with pre-existing pneumonia prior to admission were excluded from analysis (19). A total of 893 patients met the above conditions, among which 187 patients were excluded for the following reasons: (1) 115 patients with other potential causes of SAH (96 patients with arteriovenous malformation, 10 patients with craniocerebral trauma, and 9 patients with hypertensive intracerebral hemorrhage); (2) 14 patient with history of malignant tumors, severe heart, hepatic or renal failure; (3) 7 patient previous use of antibiotics, systemic glucocorticoids, immunosuppressive agents, or immunotherapy within 1 month before admission; (4) proven POP had a confirmed change in diagnosis on at least one image of the chest x-ray, and 35 patients with pneumonia before admission were excluded (2, 19); (5) 14 aSAH patients who died within 24 h after surgery or lack complete case records were excluded. Finally, 706 patients were included in the internal cohort.
The external cohort was extracted from the Medical Information Mart for Intensive Care IV (MIMIC-IV) (version 2.2) database, derived from a large, freely accessible critical care database comprising 299,712 patients who were admitted to the ICU or the emergency department of Beth Israel Deaconess Medical Center between 2008 and 20191 (11). For the external cohort, individuals who met the following criteria were included: (1) those aged 18 years or older and (2) those with a diagnosis matching the International Classification of Diseases (ICD) code associated with aSAH among MIMIC-IV (ICD-9 code 430, and ICD-10 codes I60 to I609). A total of 1,172 patients fulfilled these conditions. Exclusion criteria consisted of: (1) non-first admission cases (n = 33); (2) history of malignant tumors, severe heart, hepatic or renal failure (n = 120); (3) length of stay less than 24 h in hospitalization (n = 84); and (4) patients without clinical data available for analysis (n = 838). Ultimately, a validation set comprising of 97 patients was selected from this group to validate the ML model. Figure 1 provides an overview of this process.
Variable selection
Collected variables included: (1) patients demographics (age, gender, history of smoking, alcoholics); (2) GCS, Hunt-Hess grades, modified Fisher (mFS) grade and WFNS grade on admission; (3) past medical history (hypertension, diabetes mellitus), coronary heart disease (CHD), chronic obstructive pulmonary disease (COPD), Stent assisted endovascular treatment (EVT), with craniotomy; (4) aneurysm location [anterior communicating artery (ACoA), internal carotid artery (ICA), middle cerebral artery (MCA), posterior communicating artery (PCoA), vertebrobasilar artery (VBA)]; (5) laboratory results were obtained within 24 h after admission in the context of a first examination [albumin, hemoglobin, neutrophils, monocytes, lymphocytes, uric acid, total cholesterol, triglycerides, neutrophils, monocytes, mean corpuscular volumec-reactive (MCV), CRP], and the composite index calculated by these laboratory results NLR = Neutrophil counts (*109 /L)/Lymphocyte count (*109 /L), PNI = Albumin (g/L) +10*Lymphocyte count (*109 /L), NAR = Neutrophil counts (*109 /L)/Albumin (g/L), CAR = CRP (mg/L) / Albumin (g/L), D-Dimer/Albumin Ratio (DAR) = D-dimer level (μg/mL)/Albumin (g/L).
The developed ML model of risk factors associated with POP in aSAH patients is relied on statistically significant results obtained through LASSO regression and multivariate logistic regression analysis (p < 0.05) (20).
ML model construction
A total of 706 patients with aSAH from our center were enrolled. We randomly divided the patients as training cohort (N = 565) and validation cohort (N = 141) according to a ratio of 80–20%. The training cohort was utilized to develop Linear models (logistic regression (LR), support vector machine (SVM)) and Nonlinear models [XGBoost, k-nearest neighbor (KNN), random forest (RF), and multilayer perceptron (MLP)] (21–23). XGBoost model was constructed using the xgboost package.2 The remaining five models were established via Scikit-learn package.3 To develop an unbiased assessment of model performance, we performed 5 random shuffles of 5-fold cross-validation, as shown in Figure 1. Each iteration used a different stratified fold for model evaluation, and the remaining folds were used for model training (24). Subsequently, we recorded Area Under The Curve (AUC) to compare each ML models. Data processing and the ML process are summarized in Figure 1.
After the model was established, the SHapley Additive exPlanations (SHAP) package in Python was used to explain the model by analyzing two cases. The SHAP package interpreted the output of the machine learning model using a game-theoretic approach (25). SHAp values quantify the association of a variable with the outcome of a single patient, and the mean absolute SHAP value across all patients is reported as the SHAP value of avariable (26).
Statistical analysis
Continuous variables are represented in terms of mean and standard deviation (SD), while categorical variables are expressed in terms of frequency and percentage. Normally distributed and non-normally distributed variables were presented in the form of mean ± standard deviation and median (interquartile range). LASSO regression model was used to deal with collinearity of candidate variables, and the optimal predictive variable was selected (27). A multivariate logistic regression analysis was generated using predictors selected from the lasso analysis. Te features were represented by odds ratio (OR) and 95% confidence interval (CI). A two-tailed P value <0.05 was considered statistically significant. The stability of the columns in the validation queue is calculated using 1,000 boot replicas and a relative corrected C-index is calculated. All Statistical analyses were performed using R version 3.6.3 and python version 3.7.
Results
Demographic characteristics
Table 1 shows the clinical characteristics of the study population. A total of 706 aSAH patients were included in this study, which were divided into a training cohort (N = 565) and a validation cohort (N = 141). The number of patients with POP were 106 (16%) and 27 (12%) in training and testing cohorts, and men comprised 190 (34%) and 54 (38%) patients in the two groups, respectively. The median age in training cohort and testing cohorts were 55 and 59 years. The baseline data exhibited a high degree of consistency between the two groups (p > 0.05).
Feature selection
As partially relevant or less important features may negative affect performance of machine learning models, we performed feature selection by using LASSO regression. 39 variables with missing values <20% was extracted after interpolation, and 6 potential predictors were finally screened from the LASSO regression analysis (Figure 2). We included these 6 variables in multi-factor logistic regression and found that all 6 variables were statistically significant (Table 2). These 6 features were listed as follows: GCS (OR, 0.72; 95% CI, 0.58–0.73; p < 0.001), MVT (OR, 1.11; 95% CI, 1.03–1.21; p = 0.01), Albumin (OR, 0.90; 95% CI, 0.85–0.96; p < 0.001), NAR (OR, 55.23; 95% CI, 4.74–657.27; p < 0.001), CAR (OR, 2.61; 95% CI, 1.59–4.29; p < 0.001), Smoking history (OR, 8.37; 95% CI, 3.74–18.84; p < 0.001).
Figure 2. Perioperative variable selection using a LASSO logistic regression model. (A) The minimum criteria (lambda.min) and 1 SE of the minimum criteria (lambda. 1se) were used to depict the optimal values with dotted vertical lines. (B) LASSO coefficient profile of 39 variables. The coefficient profile is plotted according to the logarithmic sequence. To determine the optimal predictors of the model, five-fold cross-validation with minimum criteria was used, resulting in seven features with nonzero coefficients.
Table 2. Multivariable Logistic Regression Model for Predicting postoperative pneumonia in aSAH patients.
Machine learning model performance
Using the six features obtained by screening, we developed six machine learning models, including LR, SVM, RF, MLP, XGBoost, and KNN. Supplementary Table S1 and Figure 3 showed the best hyperparameter combination for each model and their AUCs in predicting POP. Their performance for prediction of POP was assessed (Table 3). The AUC values of KNN (0.78) and MLP (0.56) were relatively lower than LR (0.91), SVM (0.89), RF (0.87), XGBoost (0.86). Among them, LR exhibited the best performance for the prediction of POP risk. As the primary metric, the AUC for LR was 0.91 (95% confidence interval: 0.86–0.96). LR also exhibited the best performance based on the average precision of the precision-recall curve (0.65). The average precisions of the precision-recall curve for the remaining models are summarized in Figure 4. The remaining metrics are summarized in Table 3.
Figure 3. ROC curves for prediction of postoperative pneumonia (POP) on the test data set. Greater AUC shows higher discriminative ability of the model.
Figure 4. Precision-recall curve for prediction of postoperative pulmonary risk. BRF, balanced random forest; CI, confidence interval; LR, logistic regression; XGBoost, extreme gradient boosting; RF, random forest; MLP, multilayer perceptron; SVM, support vector machine; KNN, K-nearest neighbor.
Application of the model
The SHAP package conducted a comprehensive analysis of training set, showing the impact of each variable on predicting POP (Figure 5). The preoperative and postoperative information of a patient was input into the model: with mechanical ventilation 5 days, the GCS score at admission was 8, albumin level 37.2 g/L, with smoke history, CAR 0.17, NAR 0.16. The model analyzed that the risk of POP in this patient was 85.0%, indicating that the probability of POP for the patients was high, and the implementation of preventive treatments against POP should be prioritized (Figure 6A). The preoperative and postoperative information of another patient was input into the model: with mechanical ventilation 2 days, the GCS score at admission was 14, albumin level 40.9 g/L, with no smoke history, CAR 0.10, NAR 0.12. The model analyzed that the risk of POP in this patient was 3.3%, which indicated a low probability of POP occurrence in the patients and the occurrence of POP in the patient does not require significant attention (Figure 6B). Furthermore, a website was established for clinicians to use the proposed LR model.4
Figure 5. SHAP analysis of the proposed model on the testing set. This figure described data from the testing set, with each point representing one patient. The color represents the value of the variable; red represents the larger value; blue represents the smaller value. The horizontal coordinates represent a positive or negative correlation with transfusion risk, with a positive value indicating a risk of POP and a negative value indicating no risk for POP. The absolute value of the horizontal coordinate indicates the degree of influence; the greater the absolute value of the horizontal coordinate, the greater the degree of influence.
Figure 6. Examples of website usage. Entering the input value determined the transfusion requirements and displayed how each value contributed to the prediction. (A) Example 1 has a high probability of POP, the probability of POP is 85.0%, and (B) Example 2 has a low probability of POP, the probability of POP is 3.3%.
External data validation
To further confirmed the applicability of our model, we performed external validation using data from 97 patients with aSAH with MIME-IV. Figure 7 showed that the AUC of the model in the external data is 0.89 (95% confidence interval: 0.80–0.98), indicating that the model can still play a very good predictive performance in the external data.
Figure 7. Receiver operating characteristic (ROC) curves for predicting postoperative pneumonia in aneurysmal subarachnoid hemorrhage patients in the external valication set.
Discussion
Early detection of POP is critical for timely interventions to prevent the onset of the complication (21). Numerous studies have developed ML models to predict postoperative pulmonary complications. Jong Ho Kim et al. developed an ML model to predict POP in patients undergoing surgery (28). Peng et al. had successfully created and verified a deep-neural-network model based on combined natural language data and structured data to predict pulmonary complications in geriatric patients (29). However, there is no POP prediction ML model designed explicitly for aSAH patients. Therefore, it is urgent to establish a POP clinical prediction model for aSAH patients, which has good application value in clinical identification and decision-making.
We used ML to develop models for the prediction of POP for aSAH patients. Model training using data from 565 patients was followed by model testing using data from 141 patients. Six algorithms (LR, XGBoost, RF, MLP, SVM, KNN) were used to develop the models, whereas four metrics were used to evaluate their performances. LR exhibited the best overall performance, with a specificity of 78% and a sensitivity of 94% in predicting POP in aSAH patients. Besides, the AUC values of MLP and KNN were relatively lower than XGBoost, RF and SVM, whose accuracy and robustness might be attributed to their nature of integrating multiple base classifiers or learners (21). In addition, two examples were used to visualize how the LR model could predict POP and determine the relative importance of each variable for the clinician. With millions of POP taking place each year, the findings could help surgeons perform the management of postoperative pulmonary infection, such as the consideration of ventilator use time and tracheotomy, the use of prophylactic antibiotics. Finally, In order to reflect the model has a extensive range of applications, LR model was also validated on external data (MIMIC-IV) and showed good predictive performance.
In addition to successfully constructing machine learning models for predicting patient outcomes, the selection of variables should also prioritize the clinical requirements. As widely acknowledge, feature selection is a crucial process in machine learning. Selecting the proper combination of features to achieve a balance between model performance and efficiency is difficult but of great significance (30). Previous ML models required many features to predict Prognosis of patients, which reduces its practicality (28). We identified this issue at an early stage, prompting us to initially conduct LASSO regression on 39 variables. This approach not only mitigated the problem of collinearity among variables, but also resulted in a reduction of included variables in the model to six. Therefore, our ML models incorporated 6 variables of GCS, smoking history, MVT, GCS, NAR, CAR as predictors. These variables can be easily collected in clinical practice and important implications in clinical practice.
Among these variables, CAR was identified for the first time as an independent risk factor for predicting the POP in patients with aSAH. Previously, Dingding Zhang et al. discovered that an elevated CAR was correlated with the WFNS grade and Glasgow Outcome Scale (GOS) after 3 months aSAH (31). Our study further confirmed the importance of CAR on the prognosis of aSAH, and we also found that CAR was a better predictor of POP than other composite indicators such as NLR, NAR, DAR, and PNI (Supplementary Figure S1). Although Xin Zhang and Manman Xu et al. had demonstrated that NAR, PNI, DAR, and NLR are independent risk factors for predicting POP separately (2, 5, 12, 13), if these factors are compared together, the AUC of CAR and NAR was better than that of NLR, DAR and PNI, which also makes CAR and NAR remain as predictors into our model, while other indicators were excluded due to lower AUC or too high correlation coefficient with CAR and NAR. Back to the nature of the CAR, many previous studies had demonstrated the role of CRP and albumin in predicting pneumonia. CRP, an acute-phase protein, is triggered by different cytokines in reaction to infection, ischemia, trauma, and other inflammatory circumstances (32). In a recent study, Ben Gaastra et al. also discovered that CRP serves as an independent prognostic indicator for outcome following aSAH. The incorporation of CRP into prognostic models enhances their predictive accuracy (33). Xinlong Ma et al. found early increase in blood CRP appears to correlate with poor functional outcome after aSAH (34). Such patients exhibit a protracted recovery period for cough, expectoration, and swallowing function, necessitating an extended bed rest duration and presenting an increased susceptibility to pulmonary infections. Among them, the level of albumin played a pivotal role in our model, as it not only functioned as an independent prognostic indicator but also contributed to the calculation of two crucial predictive indicators, CAR and NAR. We posit that the primary factor contributing to this phenomenon is the multifaceted impact of hypoalbuminemia on patients with aSAH. Hypoalbuminemia in patients with aSAH not only reduces the level of immune protein and obstructs the repair of the mucosal barrier, leading to increased susceptibility to infection, but also exacerbates brain edema after subarachnoid hemorrhage, ultimately worsening motor function impairment in patients (32, 35). Some recent experimental findings additionally indicate that albuminemia mediates its neuroprotection through neurovascular remodeling and reducing cerebral lesions (36–38). A pilot study revealed that 1.25 g/kg/day albumin treatment was safe in SAH patients and might produce a better outcome (37). The clinicians should therefore prioritize the monitoring of hypoalbuminemia in aSAH patients, and consider implementing albumin supplementation as a preventive measure against infection. There are few clinical studies in this area, and further research is warranted to establish more advanced clinical diagnostic and treatment protocols.
In addition, we also note that Xiao Jin et al. constructed a nomogram model for predicting POP in aSAH patients (11). All of our models use MIMIC-IV data as our external data to verify model efficacy. Xiao Jin’s nomogram model had a validation performance of AUC 0.85 on external data, slightly lower than our LR model AUC 0.89. This is further evidence that, ML can offer unique perspective on the patient’s condition and can serve as a decision support tool in the management of aSAH. However, clinical judgment is necessary to interpret the ML results and implement a corresponding plan of action (17). For example, in the process of modeling, because our database only had 5 patients with COPD, and these patients had mild intracerebral hemorrhage, and none of them had POP, COPD was not included as an important predictor in the process of modeling. However, according to clinical experience, COPD patients are extremely prone to pulmonary infection. Therefore, the knowledge and experience of doctors are essential for the prediction of POP in aSAH patients with special conditions.
This study had several limitations. First, although the ML models have been validated in another database, the ML models are developed on the basis of a single-center cohort study, and future multi-center study will be needed for external validation. Furthermore, it should be noted that this study was conducted in a retrospective manner, which may have introduced collection and entry biases as well as residual confounding factors.” Third, in the case of external cohort inclusion, determining whether patients have received surgical treatment is challenging due to insufficient data availability. Hence, surgery was not considered as a prerequisite for enrolling an external cohort (11). In addition, it poses a significant challenge to monitor the occurrence of POP due to premature discharge resulting from severe illness or financial constraints, making it arduous to assess the likelihood of these patients developing POP. This limitation is equally applicable when utilizing the MIMIC-IV database, which solely documents the presence or absence of POP during hospitalization. Nonetheless, we believe that any potential error stemming from this constraint remains within manageable boundaries, as demonstrated in Xiao Jin et al.’s study (11). Last but not least, two or more machine learning algorithms can be synthesized to further improve predictive accuracy. Such a process is referred to as ensemble modeling, and it has been used broadly in various industries (39). However, because the authors of this paper were unfamiliar with this method, they did not dare to operate ensemble modeling arbitrarily. In the future, we will pay more attention to improving the prediction performance of clinical models by ensemble learning algorithms.
Conclusion
Our study has successfully established six novel ML models to predict POP among aSAH patients. Of these, the LR model has demonstrated overall best performance. Furthermore, an online prediction tool based on the LR model was developed to identify patients at high risk for POP after aSAH and facilitate timely interventions.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.
Ethics statement
The studies involving humans were approved by the Ethics Committee in Clinical Research (ECCR) of the First Affiliated Hospital of Wenzhou Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.
Author contributions
XL: Writing – original draft, Writing – review & editing. CZ: Writing – original draft, Writing – review & editing. JW: Formal analysis, Data curation, Methodology, Writing – original draft, Writing – review & editing. CY: Writing – original draft, Writing – review & editing. JZ: —. QicZ: Writing – original draft, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study received funding from Wenzhou Science and Technology Bureau Major Project ZS2017007 and Key project of International Cooperation of National Natural Science Foundation 81820108011. This work was supported by Extreme Smart Analysis platform (https://www.xsmartanalysis.com/).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2024.1341252/full#supplementary-material
Footnotes
1. ^https://mimic.physionet.org/
2. ^https://xgboost.readthedocs.io/en/latest/python/index.html
3. ^https://github.com/scikit-learn/scikit-learn
4. ^https://www.xsmartanalysis.com/model/list/predict/model/html?mid=9730&symbol=4170wQvi00hz43760JQ1
References
1. Maher, M, Schweizer, TA, and Macdonald, RL. Treatment of spontaneous subarachnoid hemorrhage: guidelines and gaps. Stroke. (2020) 51:1326–32. doi: 10.1161/STROKEAHA.119.025997
2. Zhang, X, Zhang, S, Wang, C, Liu, R, and Li, A. High neutrophil-to-albumin ratio predicts postoperative pneumonia in aneurysmal subarachnoid hemorrhage. Front Neurol. (2022) 13:840858. doi: 10.3389/fneur.2022.840858
3. Aneurysmal, D’SS, and Hemorrhage, S. Aneurysmal subarachnoid Hemorrhage. J Neurosurg Anesthesiol. (2015) 27:222–40. doi: 10.1097/ANA.0000000000000130
4. Ingall, T, Asplund, K, Mähönen, M, and Bonita, R. A multinational comparison of subarachnoid hemorrhage epidemiology in the WHO MONICA stroke study. Stroke. (2000) 31:1054–61. doi: 10.1161/01.STR.31.5.1054
5. Xu, M, Zhang, L, Wang, J, Cheng, L, Chen, C, Li, S, et al. Pre-operative prognostic nutrition index and post-operative pneumonia in aneurysmal subarachnoid hemorrhage patients. Front Neurol. (2023) 14:1045929. doi: 10.3389/fneur.2023.1045929
6. Wu, MR, Chen, YT, Li, ZX, Gu, HQ, Yang, KX, Xiong, YY, et al. Dysphagia screening and pneumonia after subarachnoid hemorrhage: findings from the Chinese stroke center alliance. CNS Neurosci Ther. (2022) 28:913–21. doi: 10.1111/cns.13822
7. Hop, JW, Rinkel, GJ, Algra, A, and van Gijn, J. Case-fatality rates and functional outcome after subarachnoid hemorrhage: a systematic review. Stroke. (1997) 28:660–4. doi: 10.1161/01.STR.28.3.660
8. Inagawa, T. Trends in incidence and case fatality rates of aneurysmal subarachnoid hemorrhage in Izumo City, Japan, between 1980-1989 and 1990-1998. Stroke. (2001) 32:1499–507. doi: 10.1161/01.STR.32.7.1499
9. Ding, CY, Peng, L, Lin, YX, Yu, LH, Wang, DL, and Kang, DZ. Elevated lactate dehydrogenase level predicts postoperative pneumonia in patients with aneurysmal subarachnoid hemorrhage. World Neurosurg. (2019) 129:e821–30. doi: 10.1016/j.wneu.2019.06.041
10. Li, R, Lin, F, Chen, Y, Lu, J, Han, H, Yan, D, et al. In-hospital complication-related risk factors for discharge and 90-day outcomes in patients with aneurysmal subarachnoid hemorrhage after surgical clipping and endovascular coiling: a propensity score-matched analysis. J Neurosurg. (2021) 137:381–92. doi: 10.3171/2021.10.JNS211484
11. Jin, X, Wang, S, Zhang, C, Yang, S, Lou, L, Xu, S, et al. Development and external validation of a nomogram for predicting postoperative pneumonia in aneurysmal subarachnoid hemorrhage. Front Neurol. (2023) 14:1251570. doi: 10.3389/fneur.2023.1251570
12. Cai, L, Zeng, H, Tan, X, Wu, X, Qian, C, and Chen, G. The role of the blood neutrophil-to-lymphocyte ratio in aneurysmal subarachnoid hemorrhage. Front Neurol. (2021) 12:671098. doi: 10.3389/fneur.2021.671098
13. Wu, W, Liu, X, Zhu, Q, Chen, X, Sheng, B, Zhang, J, et al. The D-dimer/albumin ratio is a prognostic marker for aneurysmal subarachnoid Hemorrhage. Brain Sci. (2022) 12:1700. doi: 10.3390/brainsci12121700
14. Tu, JV. Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. J Clin Epidemiol. (1996) 49:1225–31. doi: 10.1016/S0895-4356(96)00002-9
15. Alaka, SA, Menon, BK, Brobbey, A, Williamson, T, Goyal, M, Demchuk, AM, et al. Functional outcome prediction in ischemic stroke: a comparison of machine learning algorithms and regression models. Front Neurol. (2020) 11:889. doi: 10.3389/fneur.2020.00889
16. Mutke, MA, Madai, VI, Hilbert, A, Zihni, E, Potreck, A, Weyland, CS, et al. Comparing poor and favorable outcome prediction with machine learning after mechanical thrombectomy in acute ischemic stroke. Front Neurol. (2022) 13:737667. doi: 10.3389/fneur.2022.737667
17. Savarraj, JPJ, Hergenroeder, GW, Zhu, L, Chang, T, Park, S, Megjhani, M, et al. Machine learning to predict delayed cerebral ischemia and outcomes in subarachnoid hemorrhage. Neurology. (2021) 96:e553–62. doi: 10.1212/WNL.0000000000011211
18. Grimes, DA. The nomogram epidemic: resurgence of a medical relic. Ann Intern Med. (2008) 149:273–5. doi: 10.7326/0003-4819-149-4-200808190-00010
19. Smith, CJ, Kishore, AK, Vail, A, Chamorro, A, Garau, J, Hopkins, SJ, et al. Diagnosis of stroke-associated pneumonia: recommendations from the pneumonia in stroke consensus group. Stroke. (2015) 46:2335–40. doi: 10.1161/STROKEAHA.115.009617
20. Wei, X, Min, Y, Yu, J, Wang, Q, Wang, H, Li, S, et al. The value of admission serological indicators for predicting 28-day mortality in intensive care patients with acute heart failure: construction and validation of a nomogram. Front Cardiovasc Med. (2021) 8:741351. doi: 10.3389/fcvm.2021.741351
21. Chen, C, Yang, D, Gao, S, Zhang, Y, Chen, L, Wang, B, et al. Development and performance assessment of novel machine learning models to predict pneumonia after liver transplantation. Respir Res. (2021) 22:94. doi: 10.1186/s12931-021-01690-3
22. Hu, P, Li, Y, Liu, Y, Guo, G, Gao, X, Su, Z, et al. Comparison of conventional logistic regression and machine learning methods for predicting delayed cerebral ischemia after aneurysmal subarachnoid Hemorrhage: a multicentric observational cohort study. Front Aging Neurosci. (2022) 14:857521. doi: 10.3389/fnagi.2022.857521
23. Lee, HC, Yoon, SB, Yang, SM, Kim, WH, Ryu, HG, Jung, CW, et al. Prediction of acute kidney injury after liver transplantation: machine learning approaches vs. logistic regression model. J Clin Med. (2018) 7:428. doi: 10.3390/jcm7110428
24. Xue, B, Li, D, Lu, C, King, CR, Wildes, T, Avidan, MS, et al. Use of machine learning to develop and evaluate models using preoperative and intraoperative data to identify risks of postoperative complications. JAMA Netw Open. (2021) 4:e212240. doi: 10.1001/jamanetworkopen.2021.2240
25. Lundberg, SM, Erion, G, Chen, H, DeGrave, A, Prutkin, JM, Nair, B, et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. (2020) 2:56–67. doi: 10.1038/s42256-019-0138-9
26. van den Bosch, T, Warps, AK, de Nerée tot Babberich, MPM, Stamm, C, Geerts, BF, Vermeulen, L, et al. Predictors of 30-day mortality among Dutch patients undergoing colorectal cancer surgery, 2011-2016. JAMA Netw Open. (2021) 4:e217737. doi: 10.1001/jamanetworkopen.2021.7737
27. Liu, J, Huang, J, Ma, S, and Wang, K. Incorporating group correlations in genome-wide association studies using smoothed group lasso. Biostatistics. (2013) 14:205–19. doi: 10.1093/biostatistics/kxs034
28. Kim, JH, Kim, Y, Yoo, K, Kim, M, Kang, SS, Kwon, YS, et al. Prediction of postoperative pulmonary edema risk using machine learning. J Clin Med. (2023) 12:1804. doi: 10.3390/jcm12051804
29. Peng, X, Zhu, T, Chen, G, Wang, Y, and Hao, X. A multicenter prospective study on postoperative pulmonary complications prediction in geriatric patients with deep neural network model. Front Surg. (2022) 9:976536. doi: 10.3389/fsurg.2022.976536
30. Shafiee, M, Lotfi, FH, and Saleh, H. Supply chain performance evaluation with data envelopment analysis and balanced scorecard approach. Appl Math Model. (2014) 38:5092–112. doi: 10.1016/j.apm.2014.03.023
31. Zhang, D, Yan, H, Wei, Y, Liu, X, Zhuang, Z, Dai, W, et al. C-reactive protein/albumin ratio correlates with disease severity and predicts outcome in patients with aneurysmal subarachnoid Hemorrhage. Front Neurol. (2019) 10:1186. doi: 10.3389/fneur.2019.01186
32. Bozada-Gutiérrez, K, Trejo-Ávila, M, and Moreno-Portillo, M. Postoperative complications and predictors of mortality in patients with COVID-19. Cir Cir. (2023) 91:344–53. doi: 10.24875/CIRU.22000512
33. Gaastra, B, Barron, P, Newitt, L, Chhugani, S, Turner, C, Kirkpatrick, P, et al. CRP (C-reactive protein) in outcome prediction after subarachnoid hemorrhage and the role of machine learning. Stroke. (2021) 52:3276–85. doi: 10.1161/STROKEAHA.120.030950
34. Ma, X, Lan, F, and Zhang, Y. Associations between C-reactive protein and white blood cell count, occurrence of delayed cerebral ischemia and poor outcome following aneurysmal subarachnoid hemorrhage: a systematic review and meta-analysis. Acta Neurol Belg. (2021) 121:1311–24. doi: 10.1007/s13760-020-01496-y
35. Fakler, JK, Grafe, A, Dinger, J, Josten, C, and Aust, G. Perioperative risk factors in patients with a femoral neck fracture - influence of 25-hydroxyvitamin D and C-reactive protein on postoperative medical complications and 1-year mortality. BMC Musculoskelet Disord. (2016) 17:51. doi: 10.1186/s12891-016-0906-1
36. Xie, Y, Liu, W, Zhang, X, Wang, L, Xu, L, Xiong, Y, et al. Human albumin improves long-term behavioral sequelae after subarachnoid hemorrhage through neurovascular remodeling. Crit Care Med. (2015) 43:e440–9. doi: 10.1097/CCM.0000000000001193
37. Suarez, JI, Martin, RH, Calvillo, E, Dillon, C, Bershad, EM, Macdonald, RL, et al. The albumin in subarachnoid Hemorrhage (ALISAH) multicenter pilot clinical trial: safety and neurologic outcomes. Stroke. (2012) 43:683–90. doi: 10.1161/STROKEAHA.111.633958
38. Richter, F, Mehdorn, AS, Fedders, T, Reichert, B, Egberts, JH, Becker, T, et al. C-reactive protein as predictor for infectious complications after robotic and open esophagectomies. J Clin Med. (2022) 11:5654. doi: 10.3390/jcm11195654
Keywords: aneurysmal subarachnoid hemorrhage, postoperative pneumonia, machine learning, endovascular treatment, prediction
Citation: Li X, Zhang C, Wang J, Ye C, Zhu J and Zhuge Q (2024) Development and performance assessment of novel machine learning models for predicting postoperative pneumonia in aneurysmal subarachnoid hemorrhage patients: external validation in MIMIC-IV. Front. Neurol. 15:1341252. doi: 10.3389/fneur.2024.1341252
Edited by:
Tarun Singh, University of Michigan, United StatesReviewed by:
Elisa Gouvêa Bogossian, Université libre de Bruxelles, BelgiumZhongheng Zhang, Sir Run Run Shaw Hospital, China
Copyright © 2024 Li, Zhang, Wang, Ye, Zhu and Zhuge. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Qichuan Zhuge, qc.zhuge@wmu.edu.cn
†These authors have contributed equally to this work