Performance of Machine Learning Algorithms for Predicting Adverse Outcomes in Community-Acquired Pneumonia

Xu, Zhixiao; Guo, Kun; Chu, Weiwei; Lou, Jingwen; Chen, Chengshui

doi:10.3389/fbioe.2022.903426

ORIGINAL RESEARCH article

Front. Bioeng. Biotechnol. , 29 June 2022

Sec. Bionics and Biomimetics

Volume 10 - 2022 | https://doi.org/10.3389/fbioe.2022.903426

Performance of Machine Learning Algorithms for Predicting Adverse Outcomes in Community-Acquired Pneumonia

Zhixiao Xu¹

Kun Guo¹

Weiwei Chu¹

Jingwen Lou¹

Chengshui Chen^1,2*

¹Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
²The Interventional Pulmonary Key Laboratory of Zhejiang Province, Wenzhou, China

Background: The ability to assess adverse outcomes in patients with community-acquired pneumonia (CAP) could improve clinical decision-making to enhance clinical practice, but the studies remain insufficient, and similarly, few machine learning (ML) models have been developed.

Objective: We aimed to explore the effectiveness of predicting adverse outcomes in CAP through ML models.

Methods: A total of 2,302 adults with CAP who were prospectively recruited between January 2012 and March 2015 across three cities in South America were extracted from DryadData. After a 70:30 training set: test set split of the data, nine ML algorithms were executed and their diagnostic accuracy was measured mainly by the area under the curve (AUC). The nine ML algorithms included decision trees, random forests, extreme gradient boosting (XGBoost), support vector machines, Naïve Bayes, K-nearest neighbors, ridge regression, logistic regression without regularization, and neural networks. The adverse outcomes included hospital admission, mortality, ICU admission, and one-year post-enrollment status.

Results: The XGBoost algorithm had the best performance in predicting hospital admission. Its AUC reached 0.921, and accuracy, precision, recall, and F1-score were better than those of other models. In the prediction of ICU admission, a model trained with the XGBoost algorithm showed the best performance with AUC 0.801. XGBoost algorithm also did a good job at predicting one-year post-enrollment status. The results of AUC, accuracy, precision, recall, and F1-score indicated the algorithm had high accuracy and precision. In addition, the best performance was seen by the neural network algorithm when predicting death (AUC 0.831).

Conclusions: ML algorithms, particularly the XGBoost algorithm, were feasible and effective in predicting adverse outcomes of CAP patients. The ML models based on available common clinical features had great potential to guide individual treatment and subsequent clinical decisions.

1 Introduction

Lower respiratory tract infections are a serious threat to human health, as evidenced by injuring 471.8 million people and causing 2.6 million deaths in 2017 alone (GBD 2017 Causes of Death Collaborators, 2018). Three cities in South America revealed the region’s high illness burden of adult community-acquired pneumonia (CAP), with an incidence rate ranging from 1.76 to 7.03 per 1,000 person-years (Lopardo et al., 2018). Obviously, CAP is still one of the most serious clinical and public health problems in the world, despite advances in technology and perspectives related to CAP (Aliberti et al., 2019). Physicians frequently apply various severity scoring systems to evaluate the prognosis of CAP patients, and the CURB-65 score is one of the preferred scoring systems, especially in predicting short-term mortality (Ito et al., 2017). However, research into the predictive variables of CAP remains a major issue in clinical practice. On the other hand, different projects use different prognostic factors and develop distinct prediction models for various CAP groups. Not surprisingly, multicenter investigations are also required.

Machine learning (ML) is a field of artificial intelligence and has many advantages. First, it enables predictions to be made using a range of approaches. Second, ML-based techniques could often be rigorously validated compared with that traditional statistical methods (Li R. et al., 2020). Third, model development is more data-driven and readily accommodates a large number of variables (Obermeyer and Emanuel, 2016), which allows the identification of previously unnoticed features to improve predictions (Woodman et al., 2021).

Most studies have reported the application of ML algorithms to novel coronavirus disease (COVID-19), but few to CAP (Mondal et al., 2021; Podder et al., 2021). Furthermore, many ML studies have focused on disease diagnosis; For example, ML could be applied to distinguish COVID-19 from CAP (Li L. et al., 2020; Podder et al., 2021; Qi et al., 2021), rather than on the prediction of outcomes. There are few studies that use machine learning to predict the dire outcomes in patients with CAP, especially one-year post-enrollment status. Therefore, in this study, we performed nine machine learning algorithms to evaluate the adverse outcomes of adult CAP patients.

The primary purpose was to develop feasible models based on multicenter data to predict specific adverse outcomes (including hospital admission, mortality, ICU admission, and one-year post-enrollment status), rather than to compare various learning algorithms as a research endpoint. Moreover, we hoped that the ML models based on available and common clinical-related feature variables developed in the study could be simple and accurate when medical practitioners assessed the prognosis of CAP patients.

2 Participants and Methods

2.1 Study Participants and Design

Through the PLOS ONE and DATADRYAD policy, we retrieved the raw data encompassing 2,302 adults with CAP who were prospectively recruited between January 2012 and March 2015 and performed a secondary analysis. (Lopardo et al., 2018) conducted a prospective cohort study of adult individuals across three cities in South America [General Roca (Argentina), Rivera (Uruguay), and Concepción (Paraguay)]. Participants were eligible for inclusion in the study if they 1) age ≥18 years old, 2) exited evidence of an acute lower respiratory infection, and 3) manifestation of radiologically confirmed pneumonia, defined as new or progressive pulmonary infiltrate(s) consistent with pneumonia detected on chest radiography or CT scan, and specific inclusion criteria can be found in the study by Lopardo et al. (Lopardo et al., 2018).

We did not infringe the rights of the authors when we used these data for secondary analysis, due to the original research has been ethically approved, and its authors have relinquished all copyright and related proprietary rights of these data. Therefore, no separate ethical approval was required for this study.

2.2 Outcome Definitions

Hospital admission, mortality, ICU admission, and one-year post-enrollment status were enrolled as outcomes in this study. One-year post-enrollment status indicated whether the CAP patients died at the one-year follow-up.

2.3 Candidate Predictors

34 clinically relevant variables were included in this study. The variables incorporated into our models included clinical signs (3 variables), clinical characteristics (6 variables), laboratory tests (8 variables), and comorbidities (17 variables).

2.4 Machine Learning Algorithms

The first stage in the data preprocessing step was to impute the missing values of continuous features such as clinical characteristics and laboratory testing using the feature’s mean value. Then, the data was divided into two sets at random in the ratio of 70:30, with the training set used to construct a model for each algorithm, and the test set used for a final evaluation of the accuracy of each algorithm. The prediction models were developed utilizing a systematic machine learning-based framework, and nine different machine learning algorithms included decision trees (DTs), random forests (RFs), eXtreme Gradient Boosting (XGBoost), support vector machines (SVM), naïve Bayes, K-nearest neighbors (KNN), ridge regression (logistic regression with L2 regularization), logistic regression without regularization, and neural networks (NNs). All features were employed, but no interactions or higher-order words were created.

A grid search of hyper-parameters was implemented for each algorithm to determine the optimal set of hyper-parameters for training data accuracy, and each grid search was carried out by using 10-fold cross-validation. The process was repeated 10 times, with each fold being used for one of the 10 training steps and for evaluating the model accuracy of the training data. Throughout the grid search, we employed the AUC as the accuracy statistic which is the most commonly used in clinical settings for comparison with other studies. However, for the completeness and accuracy of results, accuracy, precision, recall, and F1-score were also reported.

2.5 Statistical Analysis

All analyses were performed using Python version 3.8.8 and R version 4.1.1. The XGBoost algorithm was built with its own Python package, while other machine learning algorithms were developed using Python’s scikit-learn library.

3 Results

3.1 Characteristics of the Datasets Used for Machine Learning

A total of 2,302 patients were enrolled in the study. In the cohort, hospital admission occurred in 1,565 (68.0%) patients, death in 277 (12.0%) patients, ICU admission occurred in 343 (14.9%) patients, and one-year post-enrollment status in 1,621 (70.4%) patients. For age and clinical signs, there were no missing values. There were some missing values in laboratory tests and comorbidity among the variables utilized for machine learning. The patients’ characteristics were listed in Table 1 before the mean imputation for missing values. Moreover, the characteristics of the patients after mean imputation for missing values were also described according to four specific outcomes. According to hospital admission stratification, there were significant differences between the two groups for all features except for systolic blood pressure, leukocytes values, segmented neutrophils values, platelet values, creatinine, glucose, the history of intravenous drug use, the history of overcrowding, and the history of the received flu shot in the last 12 months (Table 2). And we could get similar results when stratifying based on other outcomes (Supplementary Tables S1–S3).

TABLE 1

TABLE 1. Characteristics of the dataset used for machine learning before imputation.

TABLE 2

TABLE 2. Patient characteristics according to hospital admission stratification.

3.2 Performance of the Machine Learning Model

3.2.1 Outcome 1: Hospital Admission

The test accuracy results were shown in Table 3 and Figure 1. The AUC for XGBoost (AUC = 0.921) was considerably higher than other ML algorithms. The results of the accuracy score, precision score, recall score, and F1-score for the nine ML algorithms were also provided. Overall, the best-performing ML algorithm was XGBoost. As with the AUC, values for accuracy score and precision score were comparable across the various models with a range of 0.716 (Naïve Bayes) to 0.877 (XGBoost) for accuracy score, a range of 0.816 (Ridge) to 0.9 (KNN) for the precision score, a range of 0.662 (Naïve Bayes) to 0.957 (random forest) for recall score and a range of 0.801 (KNN) to 0.911 (XGBoost) for F1-score.

TABLE 3

TABLE 3. Diagnostic accuracy for the nine machine learning algorithms with the test dataset for the prediction of hospital admission and ICU admission.

FIGURE 1

FIGURE 1. Test accuracy of the nine machine learning algorithms for the prediction of hospital admission, ICU admission, death and one-year enrollment status.

3.2.2 Outcome 2: ICU Admission

Although machine learning algorithms did not perform as well in the prediction of ICU admission as they did in the prediction of other outcomes, there were still the most, but not all, of the ML algorithms with AUC values > 0.7 (Table 3 and Figure 1). Furthermore, the AUC for XGBoost (AUC = 0.801) was considerably higher than other ML algorithms. Unfortunately, the recall scores and F1-score did not farewell.

3.2.3 Outcome 3: Clinical Evolution (Death)

For the prediction of death, the neural network had a much higher AUC than other machine learning methods (AUC = 0.831) (Table 4 and Figure 2). Table 4 showed that the precision, recall, and F1-score were all rather low, although the model had a high accuracy score of 0.893. We also noticed that XGBoost displayed the second highest AUC of 0.825.

TABLE 4

TABLE 4. Diagnostic accuracy for the nine machine learning algorithms with the test dataset for the prediction of death and one-year post-enrollment status.

FIGURE 2

FIGURE 2. Feature importance plot for the eXtreme gradient boosting algorithm using test data for the prediction of hospital admission, ICU admission, death and one-year post-enrollment status.

3.2.4 Outcome 4: One-Year Post-Enrollment Status (Death)

Overall, the nine machine learning algorithms did a good job at predicting one-year post-enrollment status. Similarly, the AUC for XGBoost (AUC = 0.837) was much higher than that of other machine learning algorithms. Except for the AUC values, the results of accuracy, precision, recall, and F1-score also indicated that the nine ML algorithms had high accuracy and precision, with values for accuracy and precision being comparable across the various models with a range of 0.736 (Naïve Bayes) to 0.816 (XGBoost) for accuracy score, a range of 0.798 (KNN) to 0.858 (Naïve Bayes) for the precision score, a range of 0.815 (Naïve Bayes) to 0.959 (KNN) for recall and a range of 0.815 (Naïve Bayes) to 0.883 (XGBoost) for F1-score.

3.3 Feature Importance and Distributions

Given the high accuracy and precision of the XGBoost for the prediction of all four outcomes, the feature importance plot of the XGBoost algorithm for the test dataset is also shown. For the prediction of hospital admission, creatinine had the highest feature importance. For the prediction of ICU admission, heart rate had the highest feature importance. For the prediction of death, age, and CURB-65 score had the highest feature importance. For the prediction of one-year post-enrollment status, age and CURB-65 score had the highest feature importance.

4 Discussion

Machine learning has become increasingly applied in medicine because of its computational power and the availability of massive new datasets. Most of the current research about machine learning focuses on distinguishing CAP from other diseases, such as COVID-19 (Li L. et al., 2020), while studies predicting the outcomes of pneumonia based on commonly used clinical test data are limited. In this study, we applied a systematic machine learning framework to data from 2,302 adult CAP patients in three cities in South America to develop and test predictive models that could potentially be used in clinical settings to assist with risk management. And a grid search of the hyperparameter space and 10-fold cross-validation for each algorithm was performed to ensure the reliability of the results.

As a result, our models offered very high accuracy in detecting the adverse outcomes of CAP patients. This study predicted multiple adverse of pneumonia, which was innovative noticeably. In detail, the test data accuracy using the clinical signs and clinical characteristics and laboratory tests and comorbidities reached an AUC of 0.921 using the XGBoost algorithm for the prediction of hospital admission, an AUC of 0.831 using the neural network algorithm for the prediction of death, an AUC of 0.801 using XGBoost algorithm for the prediction of ICU admission, and an AUC of 0.837 using XGBoost algorithm for the prediction of one-year post-enrollment status.

To make the models implementable in clinical practice, the models were trained using variables commonly available to medical practitioners. We included four adverse outcomes and used 34 variables to generate the predictive models for adverse outcomes. Then, the medical team could make a judgment and take immediate action. Potentially important implications of the development of predictive models for optimizing post-hospital monitoring and the quality of care (Brajer et al., 2020; D'Ascenzo et al., 2021).

At present, only a few studies have reported the application of machine learning in pneumonia-related studies. And several studies have employed machine learning methods to predict mortality from pneumonia (Cooper et al., 1997; Wiemken et al., 2017; Kang et al., 2020), while other adverse outcomes have received less attention, especially one-year post-enrollment status. Feng et al. built a three-layer fully connected neural network to classify the prognosis of CAP patients with high accuracy and good generalizability by using ML techniques to predict CAP mortality (Feng et al., 2021). The proposed ML-based models including the CURB-65 score could accurately predict the death within 30 days or initial admission to the ICU from the emergency department with an AUC of 0.844 (Kang et al., 2020). Cooper et al. constructed 11 statistical and machine learning models that predict dire outcomes for CAP patients (such as mortality or severe clinical complications) and discovered an innovative neural network learning method that induced a model using Multitask and Learning along with Rank-prop learning (MTLR) with the largest ROC area (Cooper et al., 2005).

Models that highlight vital signs and laboratory tests may be more valid in alerting healthcare workers to potentially modifiable organ failure than models that rely heavily on comorbidities and demographic characteristics (Jones et al., 2021). Based on this spirit, our study established the accuracy of computerized prediction and revealed that the XGBoost algorithm was reliable. Moreover, the XGBoost algorithm is one of the powerful ML algorithms that may also align more closely with human thought (Fong et al., 2018). We found that many mainstream prognostic features still held important weight. The feature importance plots for the XGBoost algorithm for the prediction of admission, death, and one-year post-enrollment status showed the importance of CURB-65, which all ranked in the top 5. The CURB-65 score is one of the most commonly used predictive models for the classification of patients suffering from fever, dyspnea, and upper/lower respiratory symptoms (e.g., coughing). CURB-65 is much less computationally time-consuming; however, it has a disadvantage in that it contains only five variables (Buising et al., 2006; Capelastegui et al., 2006; Man et al., 2007). But in the cohort of elderly patients hospitalized for pneumonia in Geneva University Hospitals, the CURB-65 score was not predictive of one-year mortality (Malézieux-Picard et al., 2021).

It is incorrect to conclude that a model is a prediction of events. In fact, a model is a summarization of the collective clinical experience of events that happen in patients with similar clinical characteristics (Weick and Roberts, 1993). The value of prediction is that it places the clinical features of a new patient in the context of that clinical experience, providing a common basis for communication among clinicians, especially for those unfamiliar with each other (Jones et al., 2021). Credible models are essential to providing reliable, efficient, and equitable health care, and the models we have built are paving the way for that process.

This study has several limitations. First, despite the fact that patient data were obtained prospectively, missing values were unavoidable, and these missing values would skew the study’s conclusions. Second, in comparison with standard statistical models, ML methods have been found to generally require bigger data sets and, in particular, a higher number of events before the stable measures of prediction performance could be obtained (van der Ploeg et al., 2014). Despite this limitation, we still achieved moderate to high validation accuracy using data sets with a relatively limited number of events. Third, although 34 variables were included in this study, they still did not cover very comprehensively. Moreover, four adverse outcomes were investigated in the present study, but there are still many adverse outcomes that occur in clinical practice that have not been addressed.

The prospective validation of operational data is a critical first step in assessing the real-world performance of machine learning models. In the future, we intend to undertake prospective multicenter large-sample research to further prove the model’s utility. Furthermore, more clinical features including imaging features and more adverse outcomes should be included.

5 Conclusion

In the study, we have developed and tested the machine learning-based model to predict hospital admission, mortality, ICU admission, and one-year post-enrollment status in CAP patients. The results revealed that the ML algorithms (especially the XGBoost algorithm) were feasible and effective. There is potential to improve clinical practice if ML models based on available and common clinical-related feature variables are incorporated into future clinical decision aids when assessing the prognosis of patients with CAP. Furthermore, prospective multicenter large-sample research including more clinical features is required.

Data Availability Statement

Data acquisition from DryadData database (https://doi.org/10.5061/dryad.r282vk6). And other contributions presented in the study are included in the article/Supplementary Material; further inquiries can be directed to the corresponding author.

Ethics Statement

In the original research, written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author Contributions

CC: the design of this study and manuscript revision; ZX: data collection and analysis and manuscript drafting. KG and WC: data analysis and manuscript revision. JL: data curation and manuscript revision.

Funding

This work was financially supported by the Interventional Pulmonary Key Laboratory of Zhejiang Province (2019E10014).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

The authors are very grateful to Lopardo et al. for their contributions to the original study.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fbioe.2022.903426/full#supplementary-material

References

Aliberti, S., Dela Cruz, C. S., Sotgiu, G., and Restrepo, M. I. (2019). Pneumonia Is a Neglected Problem: it Is Now Time to Act. Lancet Respir. Med. 7 (1), 10–11. doi:10.1016/s2213-2600(18)30470-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Brajer, N., Cozzi, B., Gao, M., Nichols, M., Revoir, M., Balu, S., et al. (2020). Prospective and External Evaluation of a Machine Learning Model to Predict In-Hospital Mortality of Adults at Time of Admission. JAMA Netw. Open 3 (2), e1920733. doi:10.1001/jamanetworkopen.2019.20733

PubMed Abstract | CrossRef Full Text | Google Scholar

Buising, K. L., Thursky, K. A., Black, J. F., MacGregor, L., Street, A. C., Kennedy, M. P., et al. (2006). A Prospective Comparison of Severity Scores for Identifying Patients with Severe Community Acquired Pneumonia: Reconsidering what Is Meant by Severe Pneumonia. Thorax 61 (5), 419–424. doi:10.1136/thx.2005.051326

PubMed Abstract | CrossRef Full Text | Google Scholar

Capelastegui, A., España, P. P., Quintana, J. M., Areitio, I., Gorordo, I., Egurrola, M., et al. (2006). Validation of a Predictive Rule for the Management of Community-Acquired Pneumonia. Eur. Respir. J. 27 (1), 151–157. doi:10.1183/09031936.06.00062505

PubMed Abstract | CrossRef Full Text | Google Scholar

Cooper, G. F., Abraham, V., Aliferis, C. F., Aronis, J. M., Buchanan, B. G., Caruana, R., et al. (2005). Predicting Dire Outcomes of Patients with Community Acquired Pneumonia. J. Biomed. Inf. 38 (5), 347–366. doi:10.1016/j.jbi.2005.02.005

CrossRef Full Text | Google Scholar

Cooper, G. F., Aliferis, C. F., Ambrosino, R., Aronis, J., Buchanan, B. G., Caruana, R., et al. (1997). An Evaluation of Machine-Learning Methods for Predicting Pneumonia Mortality. Artif. Intell. Med. 9 (2), 107–138. doi:10.1016/s0933-3657(96)00367-3

PubMed Abstract | CrossRef Full Text | Google Scholar

D'Ascenzo, F., De Filippo, O., Gallone, G., Mittone, G., Deriu, M. A., Iannaccone, M., et al. (2021). Machine Learning-Based Prediction of Adverse Events Following an Acute Coronary Syndrome (PRAISE): a Modelling Study of Pooled Datasets. Lancet 397 (10270), 199–207. doi:10.1016/S0140-6736(20)32519-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Feng, D.-Y., Ren, Y., Zhou, M., Zou, X.-L., Wu, W.-B., Yang, H.-L., et al. (2021). Deep Learning-Based Available and Common Clinical-Related Feature Variables Robustly Predict Survival in Community-Acquired Pneumonia. Rmhp Vol. 14, 3701–3709. doi:10.2147/rmhp.S317735

CrossRef Full Text | Google Scholar

Fong, R. C., Scheirer, W. J., and Cox, D. D. (2018). Using Human Brain Activity to Guide Machine Learning. Sci. Rep. 8 (1), 5397. doi:10.1038/s41598-018-23618-6

PubMed Abstract | CrossRef Full Text | Google Scholar

GBD 2017 Causes of Death Collaborators (2018). Global, Regional, and National Age-sex-specific Mortality for 282 Causes of Death in 195 Countries and Territories, 1980-2017: a Systematic Analysis for the Global Burden of Disease Study 2017. Lancet 392 (10159), 1736–1788. doi:10.1016/s0140-6736(18)32203-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Ito, A., Ishida, T., Tokumasu, H., Washio, Y., Yamazaki, A., Ito, Y., et al. (2017). Prognostic Factors in Hospitalized Community-Acquired Pneumonia: a Retrospective Study of a Prospective Observational Cohort. BMC Pulm. Med. 17 (1), 78. doi:10.1186/s12890-017-0424-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Jones, B. E., Ying, J., Nevers, M., Alba, P. R., He, T., Patterson, O. V., et al. (2021). Computerized Mortality Prediction for Community-Acquired Pneumonia at 117 Veterans Affairs Medical Centers. Ann. ATS 18 (7), 1175–1184. doi:10.1513/AnnalsATS.202011-1372OC

CrossRef Full Text | Google Scholar

Kang, S. Y., Cha, W. C., Yoo, J., Kim, T., Park, J. H., Yoon, H., et al. (2020). Predicting 30-day Mortality of Patients with Pneumonia in an Emergency Department Setting Using Machine-Learning Models. Clin. Exp. Emerg. Med. 7 (3), 197–205. doi:10.15441/ceem.19.052

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, L., Qin, L., Xu, Z., Yin, Y., Wang, X., Kong, B., et al. (2020). Using Artificial Intelligence to Detect COVID-19 and Community-Acquired Pneumonia Based on Pulmonary CT: Evaluation of the Diagnostic Accuracy. Radiology 296 (2), E65–e71. doi:10.1148/radiol.2020200905

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, R., Shinde, A., Liu, A., Glaser, S., Lyou, Y., Yuh, B., et al. (2020). Machine Learning-Based Interpretation and Visualization of Nonlinear Interactions in Prostate Cancer Survival. JCO Clin. Cancer Inf. 4, 637–646. doi:10.1200/cci.20.00002

PubMed Abstract | CrossRef Full Text | Google Scholar

Lopardo, G. D., Fridman, D., Raimondo, E., Albornoz, H., Lopardo, A., Bagnulo, H., et al. (2018). Incidence Rate of Community-Acquired Pneumonia in Adults: a Population-Based Prospective Active Surveillance Study in Three Cities in South America. BMJ Open 8 (4), e019439. doi:10.1136/bmjopen-2017-019439

PubMed Abstract | CrossRef Full Text | Google Scholar

Malézieux-Picard, A., Azurmendi, L., Pagano, S., Vuilleumier, N., Sanchez, J.-C., Zekry, D., et al. (2021). Role of Clinical Characteristics and Biomarkers at Admission to Predict One-Year Mortality in Elderly Patients with Pneumonia. Jcm 11 (1), 105. doi:10.3390/jcm11010105

PubMed Abstract | CrossRef Full Text | Google Scholar

Mondal, M. R. H., Bharati, S., and Podder, P. (2021). Diagnosis of COVID-19 Using Machine Learning and Deep Learning: A Review. Cmir 17 (12), 1403–1418. doi:10.2174/1573405617666210713113439

CrossRef Full Text | Google Scholar

Obermeyer, Z., and Emanuel, E. J. (2016). Predicting the Future - Big Data, Machine Learning, and Clinical Medicine. N. Engl. J. Med. 375 (13), 1216–1219. doi:10.1056/NEJMp1606181

PubMed Abstract | CrossRef Full Text | Google Scholar

Podder, P., Bharati, S., Mondal, M., and Kose, U. (2021). Application of Machine Learning for the Diagnosis of COVID-19. Data Sci. COVID-19, 175–194. doi:10.1016/B978-0-12-824536-1.00008-3

CrossRef Full Text | Google Scholar

Qi, S., Xu, C., Li, C., Tian, B., Xia, S., Ren, J., et al. (2021). DR-MIL: Deep Represented Multiple Instance Learning Distinguishes COVID-19 from Community-Acquired Pneumonia in CT Images. Comput. Methods Programs Biomed. 211, 106406. doi:10.1016/j.cmpb.2021.106406

PubMed Abstract | CrossRef Full Text | Google Scholar

van der Ploeg, T., Austin, P. C., and Steyerberg, E. W. (2014). Modern Modelling Techniques Are Data Hungry: a Simulation Study for Predicting Dichotomous Endpoints. BMC Med. Res. Methodol. 14, 137. doi:10.1186/1471-2288-14-137

PubMed Abstract | CrossRef Full Text | Google Scholar

Weick, K. E., and Roberts, K. H. (1993). Collective Mind in Organizations: Heedful Interrelating on Flight Decks. Adm. Sci. Q. 38 (3), 357–381. doi:10.2307/2393372

CrossRef Full Text | Google Scholar

Wiemken, T. L., Furmanek, S. P., Mattingly, W. A., Guinn, B. E., Cavallazzi, R., Fernandez-Botran, R., et al. (2017). Predicting 30-Day Mortality in Hospitalized Patients with Community-Acquired Pneumonia Using Statistical and Machine Learning Approaches. Univ. Louisv. J. Respir. Infect. 1 (3), 50–56. doi:10.18297/jri/vol1/iss3/10/

CrossRef Full Text | Google Scholar

Woodman, R. J., Bryant, K., Sorich, M. J., Pilotto, A., and Mangoni, A. A. (2021). Use of Multiprognostic Index Domain Scores, Clinical Data, and Machine Learning to Improve 12-Month Mortality Risk Prediction in Older Hospitalized Patients: Prospective Cohort Study. J. Med. Internet Res. 23 (6), e26139. doi:10.2196/26139

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan Man, S., Lee, N., Ip, M., Antonio, G. E., Chau, S. S. L., Mak, P., et al. (2007). Prospective Comparison of Three Predictive Rules for Assessing Severity of Community-Acquired Pneumonia in Hong Kong. Thorax 62 (4), 348–353. doi:10.1136/thx.2006.069740

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: machine learning, community-acquired pneumonia, CAP, adverse outcomes, XGBoost

Citation: Xu Z, Guo K, Chu W, Lou J and Chen C (2022) Performance of Machine Learning Algorithms for Predicting Adverse Outcomes in Community-Acquired Pneumonia. Front. Bioeng. Biotechnol. 10:903426. doi: 10.3389/fbioe.2022.903426

Received: 24 March 2022; Accepted: 16 May 2022;
Published: 29 June 2022.

Edited by:

Rajesh Kumar Tripathy, Birla Institute of Technology and Science, India

Reviewed by:

Subrato Bharati, Bangladesh University of Engineering and Technology, Bangladesh
Amr Ali Alhossary, Nanyang Technological University, Singapore

Copyright © 2022 Xu, Guo, Chu, Lou and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Chengshui Chen, Y2hlbmNoZW5nc2h1aUB3bXUuZWR1LmNu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Performance of Machine Learning Algorithms for Predicting Adverse Outcomes in Community-Acquired Pneumonia

1 Introduction

2 Participants and Methods

2.1 Study Participants and Design

2.2 Outcome Definitions

2.3 Candidate Predictors

2.4 Machine Learning Algorithms

2.5 Statistical Analysis

3 Results

3.1 Characteristics of the Datasets Used for Machine Learning

3.2 Performance of the Machine Learning Model

3.2.1 Outcome 1: Hospital Admission

3.2.2 Outcome 2: ICU Admission

3.2.3 Outcome 3: Clinical Evolution (Death)

3.2.4 Outcome 4: One-Year Post-Enrollment Status (Death)

3.3 Feature Importance and Distributions

4 Discussion

5 Conclusion

Data Availability Statement

Ethics Statement

Author Contributions

Funding

Conflict of Interest

Publisher’s Note

Acknowledgments

Supplementary Material

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good