Skip to main content

ORIGINAL RESEARCH article

Front. Physiol., 12 December 2022
Sec. Clinical and Translational Physiology
This article is part of the Research Topic AI and Data Science in Pulmonary and Critical Care Physiology and Medicine View all 6 articles

Using machine learning for the early prediction of sepsis-associated ARDS in the ICU and identification of clinical phenotypes with differential responses to treatment

Yu Bai,&#x;Yu Bai1,2Jingen XiaJingen Xia2Xu HuangXu Huang2Shengsong Chen,Shengsong Chen1,2Qingyuan Zhan,
Qingyuan Zhan1,2*
  • 1Graduate School, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China
  • 2Department of Pulmonary and Critical Care Medicine, Center of Respiratory Medicine, National Center for Respiratory Medicine, China-Japan Friendship Hospital, Beijing, China

Background: An early diagnosis model with clinical phenotype classification is key for the early identification and precise treatment of sepsis-associated acute respiratory distress syndrome (ARDS). This study aimed to: 1) build a machine learning diagnostic model for patients with sepsis-associated ARDS using easily accessible early clinical indicators, 2) conduct rapid classification of clinical phenotypes in this population, and 3) explore the differences in clinical characteristics, outcomes, and treatment responses of different phenotypes.

Methods: This study is based on data from the Telehealth Intensive Care Unit (eICU) and Medical Information Mart for Intensive Care IV (MIMIC-IV). We trained and tested the early diagnostic model of sepsis-associated ARDS patients in the eICU. We used key predictive indicators to cluster sepsis-associated ARDS patients and determine the characteristics and clinical outcomes of different phenotypes, as well to explore the differences of in-hospital mortality among different the positive end-expiratory pressure (PEEP) levels in different phenotypes. These results are verified in MIMIC-IV to evaluate whether they are repeatable.

Results: Among the diagnostic models constructed in 19,249 sepsis patients and 5,947 sepsis-associated ARDS patients, the AdaBoost (Decision Tree) model achieved the best performance with an area under the receiver operating characteristic curve (AUC) of 0.895, which is higher than that of the traditional Logistic Regression model (Z = −2.40,p = 0.013), and the accuracy of 70.06%, sensitivity of 78.11% and specificity of 78.74%. We simultaneously identified three sepsis-associated ARDS phenotypes. Cluster 0 (n = 3,669) had the lowest in-hospital mortality rate (6.51%) and fewer laboratory abnormalities (lower WBC (median:15.000 K/mcL), lower blood glucose (median:158.000 mg/dl), lower creatinine (median:1.200 mg/dl), lower lactic acid (median:3.000 mmol/L); p < 0.001). Cluster 1 (n = 1,554) had the highest in-hospital mortality rate (75.29%) and the most laboratory abnormalities (higher WBC (median:18.300 K/mcL), higher blood glucose (median:188.000 mg/dl), higher creatinine (median:2.300 mg/dl), higher lactic acid (median:3.900 mmol/L); p < 0.001). Cluster 2 (n = 724) had the most complex condition, with a moderate in-hospital mortality rate (29.7%) and the longest intensive care unit stay. In Clusters 0 and 1, patients with high PEEP had higher in-hospital mortality rate than those with low PEEP, but the opposite trend was seen in Cluster 2. These results were repeatable in 11,935 patients with sepsis and 2,699 patients with sepsis-associated ARDS patients in the MIMIC-IV cohort.

Conclusion: A machine learning diagnostic model of sepsis-associated ARDS patients was established. Three phenotypes with different clinical features and outcomes were clustered, and these had different therapeutic responses. These findings are helpful for the early and rapid identification of sepsis-associated ARDS patients and their precise individualized treatment.

Background

Acute respiratory distress syndrome (ARDS), one of the most common respiratory syndromes in intensive care unit (ICU), is characterized by rapidly progressive respiratory failure, pulmonary edema, diffuse alveolar damage, and inflammatory cell infiltration (Thompson et al., 2017). Although many basic and clinical studies in the past 50 years have continuously clarified the pathophysiological mechanisms of ARDS and proposed new treatments (Meyer et al., 2021), the mortality rate of ARDS patients is still as high as 40% (Bellani et al., 2016; Huang et al., 2020), and at least 50% will develop complications of varying degrees (Huang et al., 2020). Sepsis is a common risk factor for ARDS. Studies have shown that sepsis-associated ARDS is more serious than non-sepsis-associated ARDS, because it is more difficult to recover from lung injury caused by the former and its mortality rate is relatively high (Zhao et al., 2020). Improving the diagnosis and phenotypic classification of ARDS, as well as performing precision medicine, remain some of the main research directions in the field of respiratory critical care. In particular, high mortality conditions such as sepsis-associated ARDS with high mortality should be more actively diagnosed to reduce burden. It is important to predict the occurrence of sepsis-related ARDS at an early time, as well as to classify the clinical subgroups in the early stage and improve the effectiveness of interventions for target subgroups.

The use of artificial intelligence in the medical field has recently become more frequent. Using machine learning to build disease prediction models can predict the occurrence of diseases at an early stage. Machine learning can also cluster diseases to aid in making clinical decisions. Artificial intelligence may have a role in guiding clinicians to make important decisions, but there are some challenges as well as obstacles in ICU, such as the weak interpretability of the model, lack of robustness, and ethical concerns (Yoon et al., 2022). At present, the diagnosis and prediction of sepsis-associated ARDS by machine learning has not been studied, and there are no specific studies on clinical subtypes of sepsis-associated ARDS, with most subtype studies focusing on the entire ARDS population. Seymour et al. (2019) identified four clinical phenotypes of sepsis through cluster analysis, which were related to host response patterns and clinical outcomes. Calfee et al. (2014) classified ARDS patients into classic hyperinflammatory and hypoinflammatory types, but the definition of this biological phenotype requires the use of plasma biomarkers as class-defining variables, such as sTNFR-1 and interleukin (ILs), which are not routinely available and cannot be quickly quantified at bedside. Therefore, the clinical applicability of this classification system may be limited. In 2021, Liu et al. (2021) proposed a new division of ARDS patients into three clinical phenotypes using rapidly accessible clinical indicators. They concluded that the clinical phenotypes of ARDS were associated with different treatment responses to randomized interventions, but this study was based on the entire ARDS population. Studies in the past few years have used clustering methods to analyze the entire subtype of ARDS or sepsis but have not identified the clinical subtype of sepsis-associated ARDS. This classification is more detailed and precise, and it can easily be applied for clinical use.

In this study, we used rapid and easy-to-obtain early clinical indicators in a large-scale population of sepsis and sepsis-associated ARDS patients to build an early diagnostic model for these patients. A rapid early clinical subgroup classification was adopted for this population, assuming that these phenotypes have different clinical outcomes and different treatment responses. We validated this model in another large sepsis and sepsis-associated ARDS population.

Methods

Study design

Our study included two large databases and was a retrospective study. First, we trained an early diagnosis model for patients with sepsis-associated ARDS in the Telehealth Intensive Care Unit (eICU) and tested the stability of the model. At the same time, subgroup clustering of sepsis-associated ARDS patients was performed using key predictors (variables that are ultimately included in the models) to determine differences in characteristics and clinical outcomes among different phenotypes, as well as to explore the differences in in-hospital mortality between different phenotypes with different early positive end-expiratory pressure (PEEP) levels. These results were validated in Medical Information Mart for Intensive Care IV (MIMIC-IV) to assess whether they were reproducible. The specific process is shown in Figure 1.

FIGURE 1
www.frontiersin.org

FIGURE 1. Flowchart of this study.

Patient population

Our training and testing data is based on the eICU database, a multicenter ICU database with over 200,000 electronic medical records from 335 units at 208 hospitals across the United States between 2014 and 2015 (Pollard et al., 2018). We used the International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM) Codes to retrieve sepsis and ARDS patients, and sepsis-associated ARDS patients were obtained by associating them with “patientunitstayid.” The external validation dataset is based on the MIMIC-IV database, which is a large, single-center, open-access database of 76,540 ICU admissions between 2008 and 2019 (Johnson et al., 2021). We used the International Classification of Diseases, Ninth and 10th Revision, Clinical Modification (ICD-9-CM,I CD-10-CM) Codes to retrieve sepsis patients and ARDS patients, and sepsis-associated ARDS patients were obtained by correlation with “stay_id.” We extracted data from the medical ICUs (MICU), surgical ICUs (SICU), and medical-surgical ICUs (Med-Surg ICU) in the two databases. Specialist critical care units such as cardiothoracic and cardio-surgical ICUs were excluded because of their specific patient cohorts with distinct presentations of sepsis. In the MIMIC-IV, sepsis was diagnosed based on the Sepsis-3 criteria, including suspected infection and a SOFA score≥2 (Seymour et al., 2016). In the eICU, sepsis and ARDS was identified according to the admission diagnosis recorded on the Acute Physiology and Chronic Health Evaluation IV (APACHE IV) dataset. We excluded patients who were admitted for less than 24 h or diagnosed with ARDS within 24 h after admission. Patient identities in MIMIC-IV database include subject_id, hadm_id and stay_id. Subject_id is unique to a patient, even if the patient has been admitted multiple times. Hadm_id is unique to a patient hospital stay, each hospital stay has a different hadm_id. Stay_id is unique to a patient ward stay, each ICU stay has a different stay_id. In MIMIC-IV database, we correlated subject_id and stay_id, the subject_id is selected only once, and excluded multiple analyses of the same patient. In eICU, only one of the methods for identifying patients is patientunitstayid, similar to the subject_id in MIMIC database. In the eICU database, we excluded multiple analyses of the same patient by screening out patients corresponding to repeated patientuistayid.

We accessed the eICU and MIMIC-IV after completion of the Protecting Human Research Participants exam (Record ID: 44151052). This study was conducted in accordance with the principles of the Declaration of Helsinki in 2013, and patients provided consent to have their data captured in the two databases. Thus, the ethical approval statement was waived in this study, as the data in the eICU and MIMIC-IV database were unidentifiable.

Predictor variables

We used structured query language programming in Navicat Premium (version 15) to extract clinical data from the eICU and MIMIC-IV database. We collected information related to the patients’ demographic characteristics, history of diseases, laboratory findings, severity scores of illness and outcomes. After evaluating data availability and clinical variable missing rates in both databases, 27 variables were selected as candidate training variables: age, gender, body mass index (BMI), hypertension, diabetes, cancer, acute physiology and chronic health evaluation IV (APACHE IV)/acute physiology score III (APSIII), and the maximum and minimum values of albumin, bicarbnonate (HCO3), lactate, bilirubin, creatinine, glucose, platelet, prothrombin time (PT), blood urea nitrogen (BUN), and white blood cell count (WBC). In order to reflect the early stage and timeliness of the model, we extracted the maximum and minimum values of laboratory indicators within 24 h after admission. Since APACHE IV index was not found in MIMIC-IV database, APSIII was used instead.

Statistical analysis

We cleaned the data and interpolated the missing values using the multiple imputation method, which is recognized as a better approach to deal with missing observations in both outcome and independent variables to ensure data integrity prior to analysis. For single variables, the specific methods of interpolation include linear regression, logistic regression and predicted mean method (pmm). By default, five data sets were generated. We averaged the data of these five data sets. We also standardized data prior to cluster analysis. The normal distribution of continuous variables was determined using the Kolmogorov-Smirnov test. The skewed distributed variables were expressed as the median and interquartile range (IQR), while categorical variables were expressed as number and percentages. Continuous variables between groups were compared by Mann-Whitney test or the Kruskal–Wallis test, as appropriate. Categorical variables between groups were compared by Pearson’s chi-squared test or Fisher’s exact test, as appropriate. SPSS23.0 software was used for statistical analysis.

Model training and testing

In building a sepsis-associated ARDS diagnostic model, we divided the patients in the eICU at a 7:3 ratio into a training set and a test set, respectively. Weight by correlation algorithm was used to screen the key predictors of sepsis-associated ARDS. We have trained five models: Naive Bayes, Logistic Regression, Gradient Boosted Trees, AdaBoost (Decision Tree), and Random Forest. Model performance was measured using the area under the receiver operating characteristic curve (AUROC), accuracy, sensitivity and specificity. We used the Youden index maximum method to determine the optimal working point of the ROC curve and the corresponding sensitivity and specificity. Area under the receiver operating characteristic curves (AUCs) were compared using DeLong’s method. After the best performance model is selected, external verification is carried out in MIMIC-IV. In the construction of sepsis-associated ARDS clinical subgroup classification, we used K-means clustering model through key predictors. After the optimal phenotypes were derived, the results were visualized using a scatter plot and centroid chart. The clinical characteristics and outcomes of different phenotypes were analyzed, and the differences of in-hospital mortality of different PEEP levels in different phenotypes were compared. The same method was used for external validation in MIMIC-IV. The overall model was built using RapidMiner Studio 9.10.001 and python 3.7.

Results

Participants

A total of 19,249 patients with sepsis were included in the eICU database, including 5,947 patients with sepsis-associated ARDS (Table 1). Meanwhile, 11,935 patients with sepsis were included in the MIMIC-IV database, including 2,699 patients with sepsis-associated ARDS (Table 2). Source of infection in septic patients and in septic-associated ARDS from gastrointestinal, cutaneous/soft tissue, pulmonary, gynecologic, renal, urinary tract infection, abdominal infection and unknown. In the eICU database, the overall population of sepsis patients had an in-hospital mortality rate of 16.84% and median ICU stay of 2.3 days, whereas for patients with sepsis-associated ARDS, these values were 27.31% and 3.4 days, respectively. For the MIMIC-IV database, the overall population of sepsis patients had an in-hospital mortality rate of 16.51% and median ICU stay of 2.1 days, whereas for patients with sepsis-associated ARDS, these values were 22.05% and 2.9 days, respectively.

TABLE 1
www.frontiersin.org

TABLE 1. Baseline demographic and clinical characteristics of patients from the eICU database.

TABLE 2
www.frontiersin.org

TABLE 2. Baseline demographic and clinical characteristics of patients from the MIMIC-IV database.

Establishment and verification of sepsis-associated ARDS diagnostic model

A total of 19,249 sepsis patients, including 5,947 patients with sepsis-associated ARDS, were selected from the eICU database. The entire cohort was randomly divided into a training cohort (13,474, 70%) and a test cohort (5,775, 30%). We used the weight by correlation algorithm to select the key predictors among the 27 input variables. We selected 14 key predictors (Figure 2) with high weight (>0.02), which are APACHE IV and minimum and maximum HCO3, lactate, creatinine, albumin, glucose, WBC, and age. Using 14 variables as input factors, five machine learning methods, including Naive Bayes, Logistic Regression, Gradient Boosted Trees, AdaBoost (Decision Tree), and Random Forest were established to predict the occurrence of sepsis-associated ARDS. In the eICU test queue, the AUC and corresponding accuracies, sensitivities and specificities of the five models are as follows: Naive Bayes: 0.644 (95%CI: 0.632–0.656) (68.89% (95%CI:68.69%–69.10%) accuracy, 72.67% (95%CI:72.42%–72.92%) sensitivity, 48.75% (95%CI:45.83%–51.67%) specificity), Logistic Regression: 0.653 (95%CI:0.641–0.667) (71.34% (95%CI:71.20%–71.48%) accuracy, 71.95% (95%CI:70.42%–73.48%) sensitivity, 44.18% (95%CI:43.23%–45.13%) specificity), Gradient Boosted Trees: 0.736 (95%CI:0.697–0.795) (67.19% (95%CI:66.46%–68.00%) accuracy, 71.43% (95%CI:70.12%–72.75%) sensitivity, 65.29% (95%CI:64.10%–66.48%) specificity), AdaBoost (Decision Tree): 0.895 (95%CI:0.834–0.936) (70.06% (95%CI:68.87%–71.28%) accuracy, 78.11% (95%CI:74.23%–81.23%) sensitivity, 78.74% (95%CI:76.13%–81.35%) specificity), and Random Forest: 0.763 (95%CI:0.731–0.820) (69.16% (95%CI:67.11%–71.21%) accuracy, 74.92% (95%CI:70.85%–78.99%) sensitivity, 69.38% (95%CI:66.82%–72.03%) specificity). The performance comparison of the models is shown in Table 3; Figure 3. From the performance index of the models, the AUC value of AdaBoost (DecisionTree) is the highest, which is higher than that of the traditional Logistic Regression model (Z = -2.40,p = 0.013), and the sensitivity, specificity and accuracy are higher. We selected AdaBoost (Decision Tree), with the best performance as the final model. We performed external verification in the MIMIC-IV cohort. A total of 11,935 septic patients were selected in the MIMIC-IV database, of which 2,699 were sepsis-associated ARDS patients. Since there is no APACHE IV in MIMIC-IV, we used APSIII instead. In the verification set, 14 key predictors were inputted into the AdaBoost (Decision Tree) model, and the resulting AUC is the 0.804. The ROC curve in MIMIC-IV external verification set is shown in Figure 4.

FIGURE 2
www.frontiersin.org

FIGURE 2. Feature screening of sepsis-associated ARDS diagnostic model.

TABLE 3
www.frontiersin.org

TABLE 3. Performance of the five machine learning models for predicting sepsis-associated ARDS.

FIGURE 3
www.frontiersin.org

FIGURE 3. Comparison of ROC of five machine learning methods for predicting sepsis-associated ARDS in eICU test set. The AUC of the five models in the eICU test cohort are as follows: Naive Bayes: 0.644, Logistic Regression: 0.653, Gradient Boosted Trees:0.736, AdaBoost (Decision Tree): 0.895, and Random Forest: 0.763.

FIGURE 4
www.frontiersin.org

FIGURE 4. The MIMIC-IV verification set validates the sepsis-associated ARDS ROC of the AdaBoost (Decision Tree) model, AUC = 0.804.

Derivation and validation of sepsis-associated ARDS phenotypes

A total of 5,947 cases of sepsis-associated ARDS were selected in the eICU database for subgroup clustering by the K-means method, and repeatability was observed in the MIMIC-IV cohort. A total of 2,699 sepsis-associated ARDS patients were selected in the MIMIC-IV database. Regarding the clustering model, after evaluating data availability and the rate of missing clinical variables, we adopt the worst value of each variable selected by the prediction model, which is a total of 8 input variables: APACHE IV, maximum HCO3, maximum lactate, maximum creatinine, minimum albumin, maximum glucose, maximum WBC, and age. To evaluate the clustering result, we considered both theoretical and practical factors including: (1) Goodness of fit (2) Adequate large cluster size (3) Salient difference in clinical characteristics between different phenotypes. To determine the optimal number of cluster k, we examined the Gap statistics and the Gap∗ statistics. The optimal number of clustering was determined as 3, which was visualized using a scatter plot (Figure 5) and centroid chart (Figure 6). We also conducted a sensitivity analysis of the clustering strategy. We applied consensus clustering on same data to compare the difference in cluster assignments and phenotype clinical characteristics. We observe a sharp decline in Delta Area of consensus cumulative distribution function from 3 class to 4 class, indicating optimal number of clusters is likely to be 3. Under cluster number = 3, cluster consensus for all cluster was above 0.8 and consensus matrix suggested that goodness of fit was high. The baseline characteristics of the three phenotypes in the eICU derivation cohort are shown in Table 4. The three phenotypes have different clinical characteristics, as follows. Patients in Cluster 0 have lower WBC (median:15.000 K/mcL), lower blood glucose (median:158.000 mg/dl), lower creatinine (median:1.200 mg/dl), lower lactic acid (median:3.000 mmol/L), p < 0.001. The age of patients in Cluster 0 was between Cluster1 and Cluster2, and the proportion of comorbidities was also between Cluster1 and Cluster2. These patients were mainly non-traumatic infections and mainly from MICU. Patients in Cluster 1 have the highest WBC (median:18.300 K/mcL), highest blood glucose (median:188.000 mg/dl), highest creatinine (median:2.300 mg/dl), highest lactic acid (median:3.900 mmol/L), p < 0.001. Cluster 1 also had the oldest age, highest proportion of comorbidities, mainly non-traumatic infections, and mainly from MICU. Lastly, patients in Cluster 2 have the lowest age, with the albumin, lactic acid, blood glucose and WBC between Clusters 0 and 1. Cluster 2 had the lowest proportion of comorbidities, and was dominated by traumatic infections, which were mainly derived from SICU. We evaluated the repeatability of phenotypes in MIMIC-IV and found that the three phenotypes were similar to those in the eICU derivation cohort (Table 5).

FIGURE 5
www.frontiersin.org

FIGURE 5. Scatter plot visualization of phenotype assignments by K-means in sepsis-associated ARDS. (A): Scatter plot visualization of phenotype assignments by K-means in the eICU derivation cohort. There were three phenotypes, Clusters 0, 1, and 2, with 3669, 1554, and 724 patients respectively. (B): Scatter plot visualization of phenotype assignments by K-means in the MIMIC-IV validation cohort. There were three phenotypes, Clusters 0, 1, and 2, with 1857, 518, and 324 patients respectively.

FIGURE 6
www.frontiersin.org

FIGURE 6. Centroid chart visualization of phenotype assignments by K-means in sepsis-associated ARDS. The laboratory indices were standardized, and the median laboratory indices differed between phenotypes. (A): Centroid chart visualization of phenotype assignments by K-means in the eICU derivation cohort. There were three phenotypes, Clusters 0, 1, and 2, with 3669, 1554, and 724 patients respectively. (B): Centroid chart visualization of phenotype assignments by K-means in the MIMIC-IV validation cohort. There were three phenotypes, Clusters 0, 1, and 2, with 1857, 518, and 324 patients respectively.

TABLE 4
www.frontiersin.org

TABLE 4. Clinical characteristics and outcomes of three phenotypes in the eICU derivation cohort.

TABLE 5
www.frontiersin.org

TABLE 5. Clinical characteristics and outcomes of three phenotypes in the MIMIC-IV derivation cohort.

Relationship between phenotype and clinical outcome

The three phenotypes had different clinical outcomes in eICU cohort, as well as in the MIMIC-IV external validation cohort. Cluster 0 had the lowest in-hospital mortality rate (6.51% in eICU and 3.45% in MIMIC-IV), whereas Cluster1 had the highest in-hospital mortality rate (75.29% in eICU and 90.35% in MIMIC-IV). Clusters 0 and 1 had similar median ICU stay length (Cluster0: eICU 2.90 days and MIMIC-IV 2.90 days; Cluster 1: eICU 2.20 days and MIMIC-IV 3.75 days). Meanwhile, Cluster 2 had a moderate in-hospital mortality rate (29.7% in eICU, 19.44% in MIMIC-IV), and the longest ICU stay (15.9 days in eICU, 17 days in MIMIC-IV). Between the two databases, there were significant differences in in-hospital mortality and ICU stay days among the three phenotypes (p < 0.05) (Tables 4, 5).

Therapeutic effects of different PEEP levels in different phenotypes

We divided PEEP within 24 h after admission into high and low PEEP levels, wherein PEEP of >10 cm H2O was defined as a high PEEP level. In order not to bias the results, we excluded patients with PEEP missing in this period. The final eICU included 1,473 patient in Cluster 0, 988 in Cluster 1, and 504 in Cluster 2. Cluster 0 in MIMIC-IV had 723 individuals, Cluster 1 had 395 individuals, and Cluster 2 had 324 individuals. The interaction between phenotype and early PEEP strategy was determined. In the eICU derivation cohort, the in-hospital mortality rates of high PEEP in Clusters 0 and 1 were higher compared to low PEEP (p < 0.05), whereas in Cluster 2, high PEEP had lower hospital mortality than low PEEP (p < 0.05). Similar results were observed in the MIMIC-IV validation cohort (Figure 7). Notably, in the eICU cohort, the mortality rates of low and high PEEP, respectively, were 6.14% (84) and 18.87% (20) in Cluster 0 (p < 0.05), 73.41% (657) and 92.47% (86) in Cluster 1 (p < 0.05), and 29.89% (130) and 13.04% (9) in Cluster 2 (p < 0.05). On the other hand, MIMIC-IV, the mortality rates of low and high PEEP, respectively, were 4.2% (30) and 12.5% (1) in Cluster 0 (p > 0.05), 89.39% (358) and 100% (37) in Cluster 1 (p < 0.05), and 21.11% (61) and 5.71% (2) in Cluster 2 (p < 0.05).

FIGURE 7
www.frontiersin.org

FIGURE 7. Comparison of in-hospital mortality across phenotypes at high versus low PEEP levels in the eICU derived cohort and MIMIC-IV validated cohort. (A): Cluster 0 (eICU): high PEEP had a higher mortality rate than low PEEP (18.87% vs. 6.14%, p < 0.05). (B): Cluster 1 (eICU): high PEEP had a higher mortality rate than low PEEP (92.47% vs. 73.41%, p < 0.05). (C): Cluster 2 (eICU) : high PEEP had a lower mortality rate than low PEEP (13.04% vs. 29.89%, p < 0.05). (D): Cluster 0 (MIMIC-IV): high PEEP had a higher mortality rate than low PEEP (12.50% vs. 4.20%, p > 0.05). (E): Cluster 1 (MIMIC-IV): high PEEP had a higher mortality rate than low PEEP (100.00% vs. 89.39%, p < 0.05). (F): Cluster 2(MIMIC-IV): high PEEP had a lower mortality rate than low PEEP (5.71% vs. 21.22%, p < 0.05).

Discussion

In this study, we developed and validated five machine learning methods using fourteen clinical variables to predict the risk of sepsis-associated ARDS. The AdaBoost (Decision Tree) model exhibited the best performance. We also used rapid key clinical predictors to conduct clinical subgroup clustering for patients with sepsis-associated ARDS. The three phenotypes had different demographics, laboratory tests and outcomes, and these notable responded differently to early high and low PEEP strategies. These results were validated in an additional cohort.

To the best of our knowledge, machine learning for the prediction and clinical subtype classification of sepsis-associated ARDS has not yet been studied. This is the largest study to confirm the prediction and clinical subgroup classification of sepsis-associated ARDS. In this study, whether in the prediction model or in the subgroup clustering, early conventional and easily accessible clinical data are used to ensure the extensibility and clinical transformation of the model, as well as to assist clinicians in the early judgment of patients’ conditions.

The advantage and difference among the development models analyzed in the study are as follow: Naive Bayes is a relatively simple machine learning model, which originates from classical mathematical theory and has a solid mathematical foundation and stable classification efficiency. It has been shown to perform relatively well in the presence of noise, missing data, and irrelevant features (Bi et al., 2019). Logistic Regression is a generalized linear regression analysis model, which is easy to use and explain, but it is sensitive to the multiple collinearity of independent variables in the model. (Edwards et al., 2021). Gradient Boosted Trees is an enhanced integrated model of decision tree, which is flexible and performs well for all kinds of data types (Friedman, 2002). The decision tree model is a easy-to-use classifier model, and the results are easy to explain and robust. Compared with other models, the preparation of data is unnecessary, the requirements for data attributes are not strict, and can achieve good results for large data sources in a relatively short time (Bi et al., 2019). AdaBoost model is a kind of classifier with high precision, which can be used to construct sub-classifiers. Adaboost algorithm provides a framework and rarely appears overfitting phenomenon. However, compared with other weak classifiers, training is more time-consuming (Freund, 1995). Random Forest is a random bagged integrated model of decision tree, which can deal with high latitude data and performs well. The model has strong generalization ability and fast speed, but it is easy to over-fit in some noisy regression or classification problems (Edwards et al., 2021).

Most of the factors involved in previous studies on the diagnosis of sepsis-associated ARDS were inflammatory factors, lung surface proteins or genes, etc. (Wang et al., 2008; Fremont et al., 2010; Ware et al., 2013; Reilly et al., 2018). In our study, clinical indicators within 24 h after admission were used (i.e., APACHE IV/APSIII, maximum HCO3, minimum HCO3, maximum lactate, minimum lactate, maximum creatinine, minimum creatinine, maximum albumin, minimum albumin, maximum glucose, minimum glucose, maximum WBC, minimum WBC, and age) combined with the machine learning model to obtain an accurate prediction of sepsis-associated ARDS. Studies have found that elevated serum lactate levels and clinical prediction scores were independently associated with the development of ARDS in severe sepsis (Mikkelsen et al., 2013). These two indicators were not only related to the development of ARDS, but also with the prognosis of ARDS patients. A recent Chinese study explored the independent risk factors of community-acquired pneumonia (CAP) complicated with ARDS (Mo et al., 2022), and used an artificial neural network model to predict ARDS in CAP patients. Some of the important predictors in their study were similar with ours, such as age, albumin, creatinine, and blood glucose, and they pointed out that these indicators may be risk factors for ARDS in patients with CAP. In clinical practice, elderly sepsis patients with severe infection, poor nutrition, abnormal renal function, and poor blood glucose control are prone to complications and increasing the risk of ARDS. WBCs are considered the most important effector cells involved in acute inflammation during the pathogenesis of ARDS, with increased WBC heralding the occurrence of ARDS. Abnormal HCO3 represents acid-base imbalance, which is seen alongside electrolyte disturbances in the development of ARDS. In conclusion, we believe that several key predictors in this study combined with machine learning models can provide more accurate predictions for the diagnosis of sepsis-associated ARDS patients. Clinical subgroup clustering of sepsis-associated ARDS patients using these key predictors yielded three phenotypes with different clinical characteristics and outcomes.

In 2014, Calfee et al. (2014) divided ARDS patients into two biological phenotypes: hyperinflammatory and hypoinflammatory types. Repeatability verification was conducted in the SAILS cohort (statins for acutely injured lungs from sepsis) in 2018, with approximately 40% of the patients classified as hyperinflammatory phenotypes (Sinha et al., 2018). The team then conducted a series of related studies (Sinha et al., 2020a; Sinha et al., 2020b; Sinha et al., 2022) to validate other ARDS cohorts, continuously simplify the model, and used clinical data to classify biological phenotypes as much as possible. However, the determination of biological phenotype is inseparable from the definition of plasma biomarkers, and thus this is difficult to determine at bedside. In 2021, Liu et al. (2021) used fast and easily available clinical indicators to classify ARDS patients into three clinical phenotypes. These subtypes were not defined by plasma biomarkers and were thus more convenient for rapid clinical application. Type I in their study is similar to Cluster 0 in our study, which included patients with fewer laboratory abnormalities and lowest hospital mortality. Type II is similar to Cluster 2 in our study, which included patients with a more complex condition and moderate mortality. Type III is similar to Cluster 1 in our study, which is closely related to renal insufficiency and acidosis, and had the highest mortality. However, our study is based on subgroup clustering in sepsis-associated ARDS patients, which is more detailed, has less confounding factors, and can explore different clinical characteristics of patients in relatively similar diseases. Alveolar recruitability varies among patients with different severities of ARDS. In patients with mild ARDS, lung tissue recruitability is low, whereas this is high among patients with moderate to severe ARDS (Gattinoni et al., 2006). In our sepsis-associated ARDS patient classification, Cluster 0 patients were the least ill and had poor lung recruitability, which is equivalent to the hypoinflammatory phenotype classification of ARDS by Calfee et al. (2014) in 2014; mortality is lower with low PEEP. In type I ARDS according to the classification of Liu et al. (2021), low PEEP had a lower 60-day mortality rate than high PEEP, which may be because the use of lower PEEP in these patients is sufficient to maintain alveolar inflation and increase functional residual capacity. On the other hand, the patients of Cluster 1 and Cluster 2 have more severe illness, which is equivalent to hyperinflammatory ARDS patients, with a higher mortality rate than Cluster 0. From our classification, these patients can be further subdivided into Clusters 1 and 2. A study in 2017 (Writing Group for the Alveolar Recruitment for Acute Respiratory Distress Syndrome Trial ART Investigators Laranjeira et al., 2017) also pointed out that the use of high PEEP and lung recruitment in patients with moderate to severe ARDS was associated with improved oxygenation but increased mortality. This may be caused by a misclassification of lung morphology in ARDS patients. Therefore, a more specific classification is necessary for moderate to severe ARDS patients. We further classified patients with moderate to severe ARDS into Clusters 1 and 2. Compared to Cluster 0, the overall condition of Cluster 2 is more severe, with higher lung recruitability and younger age, and it is more beneficial to use high PEEP. Cluster 2 patients are mainly from trauma patients, and the clinical benefit is greater when using higher PEEP. From our clinical experience, patients with ARDS from traumatic sources do clinically prefer to use high PEEP compared with other ARDS patients, because these patients have better lung recruitment ability (Burns et al., 2001; Schreiter et al., 2004). For ARDS patients with high alveolar recruitability, a higher PEEP can be selected, which can not only reduce the shear force formed by the periodic collapse of the alveoli, but also avoid ventilator-associated lung injury caused by excessive transpulmonary pressure (Caironi et al., 2010). Cluster 1 is the most severely ill, with an extremely high mortality rate; these patients may not be able to tolerate high PEEP. At this time, the side effects caused by high PEEP outweigh the benefits. Therefore, from our classification of sepsis-associated ARDS patients, lower PEEP is more favorable in Clusters 0 and 1, whereas higher PEEP is better in Cluster 2. In the next step, we will also consider transforming this research into software and other clinical tools to integrate the identification and clustering of sepsis-associated ARDS patients to assist doctors in diagnosis and treatment.

Our study has some limitations. First, many laboratory parameters were removed before model construction due to missing data in over 50% (e.g., pondus hydrogenii, partial pressure of oxygen, and partial pressure of carbon dioxide). Second, the eICU and MIMIC-IV indices of the two databases cannot be completely consistent, thus the APACHE IV and APSIII were interchanged when establishing prediction models and subgroup clustering. Third, missing data was common for some features in the eICU and MIMIC-IV datasets, and thus we performed multiple imputations before statistical analysis. The missing data could result in some bias in our results. Fourth, machine learning cannot avoid the “black box” problem, and it is still a relatively new concept in the field of medicine. Therefore, it is necessary to pilot a prospective implementation study based on system tools in the intensive care environment.

Conclusion

The use of machine learning in the early diagnosis and classification of sepsis-associated ARDS with easily accessible clinical indicators may assist clinicians in making early diagnosis of the disease, as well as further specify ARDS as a heterogeneous disease, promote individualized and precise treatment, and facilitate clinical transformation and application.

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found here: https://mimic.mit.edu/, https://eicu-crd.mit.edu/.

Ethics statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent from the participants and; legal guardian/next of kin was not required to participate in this study in accordance with the national legislation and the institutional requirements.

Author contributions

YB and QZ conceived and designed the study. YB applied for database and collected data. YB and SC drafted the manuscript, cleaned data, generated model and did statistical analyses. JX and XH revised the manuscript for important intellectual content. YB, SC, JX, and XH contributed to data interpretation. YB and QZ are the guarantor of the article, taking responsibility for the integrity of the work as a whole. All authors provided fnal approval of the version to be published.

Funding

JX received funding from the National Key Research and Development Program of China (2022YFC2504401). QZ received funding from the National High Level Hospital Clinical Research Funding (2022-NHLHCRF-LX-01-01). QZ received funding from CAMS Innovation Fund for Medical Sciences (2022-I2M-JB-016). QZ received funding from National Natural Science Foundation of China (81870072). XH received funding from the National Key Research and Development Program of China (2020YFC2003700).

Acknowledgments

We thank the eICU and MIMIC Collaborative Research Database for the data provided. We thank DINGDING translation (www.dingdingfanyi.com) for its assistance during the prepatation of the manuscript.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Abbreviations

ARDS, Acute respiratory distress syndrome; eICU, The Telehealth Intensive Care Unit; MIMIC-IV, Medical Information Mart for Intensive Care IV; PEEP, Positive end-expiratory pressure; APACHE IV, Acute physiology and chronic health evaluation IV; APSIII, Acute physiology score III; HCO3, Bicarbnonate; PT, Prothrombin time; BUN, Blood urea nitrogen; WBC, White blood cell count; IQR, Interquartile range; AUROC, The area under the receiver operating characteristic curve; BMI, Body mass index; LOS, Length of stay; SAILS, Statins for acutely injured lungs from sepsis; CAP, Community-acquired pneumonia; CHF, Congestive heart failure; COPD, Chronic obstructive pulmonary disease; AKI, Acute Kidney Injury; DIC, Disseminated intravascular coagulation; MICU, Medical ICUs; SICU, Surgical ICUs; Med-Surg ICU, Medical-surgical ICUs.

References

Bellani G., Laffey J. G., Pham T., Fan E., Brochard L., Esteban A., et al. (2016). Epidemiology, patterns of care, and mortality for patients with acute respiratory distress syndrome in intensive care units in 50 countries. JAMA 315 (8), 788–800. doi:10.1001/jama.2016.0291

PubMed Abstract | CrossRef Full Text | Google Scholar

Bi Q., Goodman K. E., Kaminsky J., Lessler J. (2019). What is machine learning? A primer for the epidemiologist. Am. J. Epidemiol. 188 (12), 2222–2239. doi:10.1093/aje/kwz189

PubMed Abstract | CrossRef Full Text | Google Scholar

Burns D., West T. A., Hawkins K., O'Keefe G. E. (2001). Immediate effects of positive end-expiratory pressure and low and high tidal volume ventilation upon gas exchange and compliance in patients with acute lung injury. J. Trauma 51 (6), 1177–1181. doi:10.1097/00005373-200112000-00026

PubMed Abstract | CrossRef Full Text | Google Scholar

Caironi P., Cressoni M., Chiumello D., Ranieri M., Quintel M., Russo S. G., et al. (2010). Lung opening and closing during ventilation of acute respiratory distress syndrome. Am. J. Respir. Crit. Care Med. 181 (6), 578–586. doi:10.1164/rccm.200905-0787OC

PubMed Abstract | CrossRef Full Text | Google Scholar

Calfee C. S., Delucchi K., Parsons P. E., Thompson B. T., Ware L. B., Matthay M. A., et al. (2014). Subphenotypes in acute respiratory distress syndrome: Latent class analysis of data from two randomised controlled trials. Lancet. Respir. Med. 2 (8), 611–620. doi:10.1016/S2213-2600(14)70097-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Edwards A. S., Kaplan B., Jie T. (2021). A primer on machine learning. Transplantation 105 (4), 699–703. doi:10.1097/TP.0000000000003316

PubMed Abstract | CrossRef Full Text | Google Scholar

Fremont R. D., Koyama T., Calfee C. S., Wu W., Dossett L. A., Bossert F. R., et al. (2010). Acute lung injury in patients with traumatic injuries: Utility of a panel of biomarkers for diagnosis and pathogenesis. J. Trauma 68 (5), 1121–1127. doi:10.1097/TA.0b013e3181c40728

PubMed Abstract | CrossRef Full Text | Google Scholar

Freund Y. (1995). Boosting a weak learning algorithm by majority. Inf. Comput. 121 (2), 256–285. doi:10.1006/inco.1995.1136

CrossRef Full Text | Google Scholar

Friedman J. H. (2002). Stochastic gradient boosting. Comput. Stat. Data Anal. 38 (4), 367–378. doi:10.1016/s0167-9473(01)00065-2

CrossRef Full Text | Google Scholar

Gattinoni L., Caironi P., Cressoni M., Chiumello D., Ranieri V. M., Quintel M., et al. (2006). Lung recruitment in patients with the acute respiratory distress syndrome. N. Engl. J. Med. 354 (17), 1775–1786. doi:10.1056/NEJMoa052052

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang X., Zhang R., Fan G., Wu D., Lu H., Wang D., et al. CHARDSnet group (2020). Incidence and outcomes of acute respiratory distress syndrome in intensive care units of mainland China: A multicentre prospective longitudinal study. Crit. Care 24 (1), 515. doi:10.1186/s13054-020-03112-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Johnson A., Bulgarelli L., Pollard T., Horng S., Celi L. A., Mark R. (2021). MIMIC-IV (version 1.0). PhysioNet. doi:10.13026/s6n6-xd98

CrossRef Full Text | Google Scholar

Liu X., Jiang Y., Jia X., Ma X., Han C., Guo N., et al. (2021). Identification of distinct clinical phenotypes of acute respiratory distress syndrome with differential responses to treatment. Crit. Care 25 (1), 320. doi:10.1186/s13054-021-03734-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Meyer N. J., Gattinoni L., Calfee C. S. (2021). Acute respiratory distress syndrome. Lancet 398 (10300), 622–637. doi:10.1016/S0140-6736(21)00439-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Mikkelsen M. E., Shah C. V., Meyer N. J., Gaieski D. F., Lyon S., Miltiades A. N., et al. (2013). The epidemiology of acute respiratory distress syndrome in patients presenting to the emergency department with severe sepsis. Shock 40 (5), 375–381. doi:10.1097/SHK.0b013e3182a64682

PubMed Abstract | CrossRef Full Text | Google Scholar

Mo J. P., Jia Z. Z., Tang Y., Yang M., Qin H. (2022). Establishing prediction model of community-acquired pneumonia complicated with acute respiratory distress syndrome based on artificial neural network. Zhonghua Wei Zhong Bing Ji Jiu Yi Xue 34, 367–372. doi:10.3760/cma.j.cn121430-20210927-01406

PubMed Abstract | CrossRef Full Text | Google Scholar

Pollard T. J., Johnson A. E. W., Raffa J. D., Celi L. A., Mark R. G., Badawi O. (2018). The eICU collaborative research database, a freely available multi-center database for critical care research. Sci. Data 5, 180178. doi:10.1038/sdata.2018.178

PubMed Abstract | CrossRef Full Text | Google Scholar

Reilly J. P., Wang F., Jones T. K., Palakshappa J. A., Anderson B. J., Shashaty M. G. S., et al. (2018). Plasma angiopoietin-2 as a potential causal marker in sepsis-associated ARDS development: Evidence from mendelian randomization and mediation analysis. Intensive Care Med. 44 (11), 1849–1858. doi:10.1007/s00134-018-5328-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Schreiter D., Reske A., Stichert B., Seiwerts M., Bohm S. H., Kloeppel R., et al. (2004). Alveolar recruitment in combination with sufficient positive end-expiratory pressure increases oxygenation and lung aeration in patients with severe chest trauma. Crit. Care Med. 32 (4), 968–975. doi:10.1097/01.ccm.0000120050.85798.38

PubMed Abstract | CrossRef Full Text | Google Scholar

Seymour C. W., Kennedy J. N., Wang S., Chang C. C. H., Elliott C. F., Xu Z., et al. (2019). Derivation, validation, and potential treatment implications of novel clinical phenotypes for sepsis. JAMA 321 (20), 2003–2017. doi:10.1001/jama.2019.5791

PubMed Abstract | CrossRef Full Text | Google Scholar

Seymour C. W., Liu V. X., Iwashyna T. J., Brunkhorst F. M., Rea T. D., Scherag A., et al. (2016). Assessment of clinical criteria for sepsis: For the third international consensus definitions for sepsis and septic shock (Sepsis-3). JAMA 315 (8), 762–774. doi:10.1001/jama.2016.0288

PubMed Abstract | CrossRef Full Text | Google Scholar

Sinha P., Churpek M. M., Calfee C. S. (2020). Machine learning classifier models can identify acute respiratory distress syndrome phenotypes using readily available clinical data. Am. J. Respir. Crit. Care Med. 202 (7), 996–1004. doi:10.1164/rccm.202002-0347OC

PubMed Abstract | CrossRef Full Text | Google Scholar

Sinha P., Delucchi K. L., McAuley D. F., O'Kane C. M., Matthay M. A., Calfee C. S. (2020). Development and validation of parsimonious algorithms to classify acute respiratory distress syndrome phenotypes: A secondary analysis of randomised controlled trials. Lancet. Respir. Med. 8 (3), 247–257. doi:10.1016/S2213-2600(19)30369-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Sinha P., Delucchi K. L., Thompson B. T., McAuley D. F., Matthay M. A., Calfee C. S., et al. (2018). Latent class analysis of ARDS subphenotypes: A secondary analysis of the statins for acutely injured lungs from sepsis (SAILS) study. Intensive Care Med. 44 (11), 1859–1869. doi:10.1007/s00134-018-5378-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Sinha P., Delucchi K. L., Chen Y., Zhuo H., Abbott J., Wang C., et al. (2022). Latent class analysis-derived subphenotypes are generalisable to observational cohorts of acute respiratory distress syndrome: A prospective study. Thorax 77 (1), 13–21. doi:10.1136/thoraxjnl-2021-217158

PubMed Abstract | CrossRef Full Text | Google Scholar

Thompson B. T., Chambers R. C., Liu K. D. (2017). Acute respiratory distress syndrome. N. Engl. J. Med. 377 (6), 562–572. doi:10.1056/NEJMra1608077

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang Z., Beach D., Su L., Zhai R., Christiani D. C. (2008). A genome-wide expression analysis in blood identifies pre-elafin as a biomarker in ARDS. Am. J. Respir. Cell Mol. Biol. 38 (6), 724–732. doi:10.1165/rcmb.2007-0354OC

PubMed Abstract | CrossRef Full Text | Google Scholar

Ware L. B., Koyama T., Zhao Z., Janz D. R., Wickersham N., Bernard G. R., et al. (2013). Biomarkers of lung epithelial injury and inflammation distinguish severe sepsis patients with acute respiratory distress syndrome. Crit. Care 17 (5), R253. doi:10.1186/cc13080

PubMed Abstract | CrossRef Full Text | Google Scholar

Writing Group for the Alveolar Recruitment for Acute Respiratory Distress Syndrome Trial ART Investigators Cavalcanti A. B., Suzumura É. A., Laranjeira L. N., Paisani D. M., Damiani L. P., Guimarães H. P., et al. (2017). Effect of lung recruitment and titrated positive end-expiratory pressure (PEEP) vs low PEEP on mortality in patients with acute respiratory distress syndrome: A randomized clinical trial. JAMA 318 (14), 1335–1345. doi:10.1001/jama.2017.14171

PubMed Abstract | CrossRef Full Text | Google Scholar

Yoon J. H., Pinsky M. R., Clermont G. (2022). Artificial intelligence in critical care medicine. Crit. Care 26 (1), 75. doi:10.1186/s13054-022-03915-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhao J., Tan Y., Wang L., Shi Y. (2020). Discriminatory ability and prognostic evaluation of presepsin for sepsis-related acute respiratory distress syndrome. Sci. Rep. 10 (1), 9114. doi:10.1038/s41598-020-66121-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: acute respiratory distress syndrome, sepsis, phenotype, cluster analysis, machine learning

Citation: Bai Y, Xia J, Huang X, Chen S and Zhan Q (2022) Using machine learning for the early prediction of sepsis-associated ARDS in the ICU and identification of clinical phenotypes with differential responses to treatment. Front. Physiol. 13:1050849. doi: 10.3389/fphys.2022.1050849

Received: 22 September 2022; Accepted: 28 November 2022;
Published: 12 December 2022.

Edited by:

Yi-Ju Tseng, National Yang Ming Chiao Tung University, Taiwan

Reviewed by:

Chang Su, Temple University, United States
Holger A. Lindner, University of Heidelberg, Germany

Copyright © 2022 Bai, Xia, Huang, Chen and Zhan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Qingyuan Zhan, ZHJ6aGFucXlAMTYzLmNvbQ==

These authors share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.