Prediction of postoperative cardiopulmonary complications after lung resection in a Chinese population: A machine learning-based study

Huang, Guanghua; Liu, Lei; Wang, Luyi; Li, Shanqing

doi:10.3389/fonc.2022.1003722

ORIGINAL RESEARCH article

Front. Oncol., 23 September 2022

Sec. Thoracic Oncology

Volume 12 - 2022 | https://doi.org/10.3389/fonc.2022.1003722

Prediction of postoperative cardiopulmonary complications after lung resection in a Chinese population: A machine learning-based study

Guanghua Huang

Lei Liu

Luyi Wang

Shanqing Li^*

Department of Thoracic Surgery, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China

Background: Approximately 20% of patients with lung cancer would experience postoperative cardiopulmonary complications after anatomic lung resection. Current prediction models for postoperative complications were not suitable for Chinese patients. This study aimed to develop and validate novel prediction models based on machine learning algorithms in a Chinese population.

Methods: Patients with lung cancer receiving anatomic lung resection and no neoadjuvant therapies from September 1, 2018 to August 31, 2019 were enrolled. The dataset was split into two cohorts at a 7:3 ratio. The logistic regression, random forest, and extreme gradient boosting were applied to construct models in the derivation cohort with 5-fold cross validation. The validation cohort accessed the model performance. The area under the curves measured the model discrimination, while the Spiegelhalter z test evaluated the model calibration.

Results: A total of 1085 patients were included, and 760 were assigned to the derivation cohort. 8.4% and 8.0% of patients experienced postoperative cardiopulmonary complications in the two cohorts. All baseline characteristics were balanced. The values of the area under the curve were 0.728, 0.721, and 0.767 for the logistic, random forest and extreme gradient boosting models, respectively. No significant differences existed among them. They all showed good calibration (p > 0.05). The logistic model consisted of male, arrhythmia, cerebrovascular disease, the percentage of predicted postoperative forced expiratory volume in one second, and the ratio of forced expiratory volume in one second to forced vital capacity. The last two variables, the percentage of forced vital capacity and age ranked in the top five important variables for novel machine learning models. A nomogram was plotted for the logistic model.

Conclusion: Three models were developed and validated for predicting postoperative cardiopulmonary complications among Chinese patients with lung cancer. They all exerted good discrimination and calibration. The percentage of predicted postoperative forced expiratory volume in one second and the ratio of forced expiratory volume in one second to forced vital capacity might be the most important variables. Further validation in different scenarios is still warranted.

Introduction

Lung cancer is the most common cancer in China, accounting for 23.9% of new cancer cases and 18.1% of cancer deaths (1). Surgery is a mainstay in the treatment of lung cancer. Postoperative cardiopulmonary complications may occur in approximately 20% of patients (2, 3). They are associated with higher risks of readmission, chronicity of these complications, and cancer recurrence (2–5). Therefore, reducing the incidence of complications is of vital importance for clinicians. Some strategies have been developed, such as prehabilitation and standardized enhanced recovery after surgery programs (6, 7). Among them, preoperational screening has the highest cost-effectiveness. An accurate prediction model is critical for screening and can enhance shared preoperative decision-making and medical care quality monitoring.

Several prediction models have been established over the past few years, including the Brunelli, Eurolung, and parsimonious Eurolung models (8–10). Some comorbidity risk calculators, such as the age-adjusted Charlson Comorbidity Index (ACCI), have also shown potential predictive efficacy (11). Most of these models were built based on the European population. However, these models did not perform satisfactory discrimination among the Chinese population due to patient characteristics discrepancies, with the values of the area under the curve (AUC) less than 0.7 (12). Predictive studies based on Chinese populations have mainly focused on a certain type of complication, a subgroup of patients, or a particular predictor (13–18). For example, Li et al. developed two prediction models for pneumonia and arrhythmia, but they did not pay attention to prolonged air leak, atelectasis, and other complications (16). Currently, no generalized models have been established, which could predict the overall incidence of complications and be suitable for the broad Chinese population. Recent advances in machine learning enhance the development of prediction models. The random forest and extreme gradient boosting (XGBoost) algorithms show promising performance and often outperform the logistic model (19). However, only a few studies applied machine learning to develop models for postoperative cardiopulmonary complications (20).

Therefore, this study aimed to develop and validate generalized prediction models for postoperative cardiopulmonary complications based on a Chinese population. It would be the first to address the needs of Chinese patients while applying machine learning. This article was presented based on the transparent reporting of a multivariable prediction model for individual prognosis or diagnosis reporting checklist (21).

Materials and methods

Patient selection

This retrospective study collected information on patients who underwent lung surgeries at our center from September 1, 2018 to August 31, 2019. Patients were eligible if they were > 18 years old, had undergone anatomic lung surgeries, and had no prior neoadjuvant therapies. Patients who lacked lung function metrics or were confirmed to have non-lung cancer by pathological reports were excluded. The study protocol was reviewed and approved by the Institutional Review Board of Peking Union Medical College Hospital (No. K2038). The requirement for informed consent was waived due to the study’s retrospective nature.

Variables and outcomes

Information on sex, age, body mass index (BMI), history of smoking and alcohol intake, comorbidities, forced expiratory volume in one second (FEV1), forced vital capacity (FVC), surgical procedure, and extended resection was collected. The Charlson Comorbidity Indices (CCI) and FEV1/FVC were calculated. The percentage of predicted postoperative forced expiratory volume in one second (ppoFEV1%) was calculated as follows: (FEV1/predicted FEV1) x (1-a/b), where a was the number of removed segments and b was the number of total segments (22). The predicted FEV1 and FVC were estimated using formulas initially developed from Chinese populations rather than Caucasian populations (23).

Outcome variables included prolonged air leakage, pneumonia, pulmonary edema, atelectasis, arrhythmia, acute myocardial infarction, and other complications listed at length in our previous study (12). Their definitions were based on the instructions of the Society of Thoracic Surgeons and the European Society of Thoracic Surgeons (24).

Model development and validation

Pearson or Spearman correlation analysis was first performed using the ‘stats’ R package for age, ACCI, CCI, and lung function metrics, according to their normality. Those pairs with correlation coefficients > 0.7 were carefully screened based on clinical experiences. The remaining variables were used to develop predictive models through three algorithms: logistic regression, random forest, and XGBoost. The entire dataset was randomly split into a derivation cohort and a validation cohort at a 7:3 ratio using the ‘caret’ R package. As for the logistic model performed using the ‘stats’ R package, all variables were screened by univariate analysis first, and then those with p values < 0.05 were further underwent backward stepwise selection. Akaike’s information criterion was implemented during stepwise selection, simplifying the model and maintaining its efficacy. A nomogram was plotted using the ‘rms’ package based on the logistic model to facilitate model interpretation. Random forest, performed with the ‘randomForest’ package, is a bagging-based machine learning method that reduces the risk of overfitting, determines feature importance, and has high flexibility. XGBoost, performed using the ‘xgboost’ package, is a boosting-based method designed to be highly efficient, flexible, and portable. Random searches and 5-fold cross-validation performed with the ‘mlr’ package were employed for hyperparameter tuning of the machine learning models. The random forest and XGBoost models were interpreted based on the mean decreased Gini index and total gain, respectively.

The models were constructed and internally validated in the derivation cohort. Five-fold cross-validation is a well-accepted method for internal validations. The validation cohort was used solely to measure the model performance. Model discrimination and calibration must be reported, whereas sensitivity, specificity, and accuracy are optional. The AUC assessed discrimination. An AUC > 0.7 was regarded as good discrimination, while AUC > 0.6 meant acceptable discrimination. DeLong’s test was used to compare the differences between two AUCs. Calibration curves were plotted after correcting for bias using 1000 bootstrap iterations. A perfect curve is closely fitted to the diagonal line. In addition, the Spiegelhalter z test assessed the calibration accuracy, and a non-significant p value indicated good calibration (25, 26). The AUC and DeLong’s test were performed with the ‘pROC’ package, while calibration curves and z test were performed with the ‘rms’ package. Numeric variables with normal distribution were described as means and standard deviations, and non-normally distributed variables were expressed as medians and interquartile ranges. Categorical variables were presented as counts and percentages. Group differences of numeric variables were tested using the t-test or the median test, and those of categorical variables were compared using the chi-square test or Fisher’s exact test, according to their distributions. Statistical significance was set at p < 0.05. All analyses were performed using R version 4.1.2 (RRID: SCR_001905).

Results

The flow chart of patient selection is presented in Figure 1. After selection, 1085 patients were included in the final analysis, of whom 760 patients (70%) were randomly assigned to the derivation cohort. This was a complete case analysis. No missing data needed to be handled. The correlation coefficients among the aforementioned variables are listed in Supplementary Table 1. CCI not ACCI, ppoFEV1% not FEV1 or FEV1%, FVC% not FVC, and FEV1/FVC were selected. The demographic, clinical, and surgical characteristics of the two cohorts are summarized in Table 1. All baseline characteristics were balanced. The ppoFEV1%, FVC%, and FEV1/FVC for both cohorts were 76.8 vs. 75.2 (p = 0.094), 89.8 vs. 88.5 (p = 0.192), and 76.0 vs. 75.9 (p = 0.865), respectively. Sixty-four (8.4%) and 26 (8.0%) patients experienced postoperative cardiopulmonary complications in the derivation and validation cohorts, respectively. Details of postoperative cardiopulmonary complications were described in our previous study (12).

FIGURE 1

Figure 1 The flow chart of patient selection.

TABLE 1

Table 1 Characteristics of the derivation cohort and the validation cohort.

Logistic model

The results of the univariate logistic analysis are summarized in Supplementary Table 2. Nine variables were statistically significant: male sex, age, smoking status, alcohol use, chronic obstructive pulmonary disease, arrhythmia, cerebrovascular disease, ppoFEV1%, and FEV1/FVC ratio. They were further screened using a backward stepwise selection, which yielded five variables. The coefficients and odds ratios (OR) are listed in Table 2. Male sex (OR 1.986, 95% confidence interval [CI] 1.142-3.454, p = 0.015), arrhythmia (OR 3.606, 95%CI 1.095-11.880, p = 0.035), and cerebrovascular disease (OR 5.415, 95%CI 1.852-15.832, p = 0.002) were independent risk factors, while FEV1/FVC (OR 0.020, 95%CI 0.001-0.810, p = 0.038) was an independent protective factor. The logistic equation was as follows: 1.430 + 0.686 × male (yes = 1) + 1.283 × arrhythmia (yes = 1) + 1.689 × cerebrovascular disease (yes = 1) - 1.859 × ppoFEV1% - 3.894 × FEV1/FVC. The mean value of the AUC in the 5-fold cross-validation was 0.722, and its mean accuracy reached 0.787, indicating a qualified model performance for further validation. Table 3 summarizes the metrics of model performance. The validation cohort also displayed good discrimination (AUC 0.728, 95% CI 0.619-0.836, Figure 2) and good calibration (p = 0.656 > 0.05). A nomogram was also plotted to facilitate clinical use (Figure 3). Moreover, an online calculator of the nomogram can be found at https://onlinepresentation.shinyapps.io/complication.

TABLE 2

Table 2 Risk factors and their parameters of the logistic model.

TABLE 3

Table 3 Model performance of the logistic model, random forest model and XGBoost model.

FIGURE 2

Figure 2 Performance of three models. (A) shows the receiver operating characteristic curves. (B) shows the calibration curves. The blue line indicates the logistic model The red line indicates the random forest model. The yellow line indicates the XGBoost model.

FIGURE 3

Figure 3 The nomogram of the logistic model. CVD, cerebrovascular disease; ppoFEV1%, the percentage of predicted postoperative forced expiratory volume in one second; FEV1/FVC, the ratio of forced expiratory volume in one second to forced vital capacity.

Machine learning models

In the random forest model, the hyperparameters were set as follows: number of trees = 300, node size = 8, maximum nodes = 8, and mtry = 1. The mean AUC was 0.718 in the 5-fold internal cross-validation. In the validation cohort, the AUC reached 0.721 (95% CI 0.614-0.828), and good calibration was obtained (p = 0.628 > 0.05). The calibration curve of the random forest model was the closest to the diagonal line (Figure 2). Sensitivity and specificity were 0.692 and 0.699, respectively. The feature importance is illustrated in Figure 4A. PpoFEV1%, FEV1/FVC, FVC%, age, and cerebrovascular disease were the top five important variables. Male sex ranked seventh, while arrhythmia ranked tenth. The mean decreases in the Gini indices of ppoFEV1% and FEV1/FVC were visually higher than others, indicating their prominent roles in prediction.

FIGURE 4

Figure 4 The feature importance of (A) the random forest model and (B) the XGBoost model. PpoFEV1%, the percentage of predicted postoperative forced expiratory volume in one second; FEV1/FVC, the ratio of forced expiratory volume in one second to forced vital capacity; FVC%, the percentage of forced vital capacity; CVD, cerebrovascular disease; BMI, body mass index; COPD, chronic obstructive pulmonary disease; CCI, the Charlson Comorbidity Index; CAD, coronary artery disease; ILD, Interstitial lung disease; DM, diabetes mellitus; HTN, hypertension; CKD, chronic kidney disease.

In the XGBoost model, the tuned hyperparameters are indicated as follows: booster = gbtree; objective = binary:logistic, nround = 97, max_depth = 13, eta = 0.29, min_child_weight = 15.7, subsample = 0.558, colsample_bytree = 0.659, gamma = 0. It performed well in internal validation, with a mean AUC of 0.727. The XGBoost model also showed good discrimination (AUC 0.767, 95% CI 0.671-0.862) and calibration (p = 0.368 > 0.05) simultaneously in the validation cohort. The sensitivity, specificity, and accuracy were 0.692, 0.749, and 0.745, respectively (Table 3). Figure 4B shows the importance of variables. The top five variables were ppoFEV1%, BMI, FVC%, FEV1/FVC, and age, while male sex ranked sixth.

Although the AUC of the XGBoost model was the highest among the three models, no significant differences were observed after examining by DeLong’s test (logistic vs. random forest, p = 0.801; logistic vs. XGBoost, p = 0.600; random forest vs. XGBoost, p = 0.534).

Discussion

This was the first study that met Chinese patients’ demand of prediction of postoperative cardiopulmonary complications after lung resection. Three models using various algorithms were developed and validated internally, and all of them showed good discrimination and calibration. PpoFEV1% and FEV1/FVC were identified as the most important predictive factors.

Three algorithms were used for model development: logistic regression, random forest, and XGBoost. They all have their strengths and weaknesses. The conventional logistic regression is strong in convenient implementation, clear presentation, and intuitive interpretation, but weak in capturing nonlinear relationships between outcomes and variables. As for novel machine learning algorithms like random forest and XGBoost, although they are difficult to interpret and apply in clinical settings due to technical problems, they can provide more insights when dealing with high-dimensional and nonlinear datasets. Hence, they are widely applied in prediction using radiomics, genomics, and large databases (27–29). Many studies showed the predictive ability of novel machine learning models was better than that of the logistic model, but Choi et al. still suggested the logistic model should serve as a benchmark due to its easy interpretation (19, 30). Our results showed that the AUC of the logistic model (AUC = 0.728) was lower than that of XGBoost model (AUC = 0.767), but higher than that of random forest model (AUC = 0.721). The p values for DeLong’s test and z test were all greater than 0.05. Therefore, we concluded that the logistic model had non-inferior performance to the random forest and XGBoost models. The possible reason was that the nonlinear relationships were weak, concealing the performance gaps among them. After a thorough consideration of model performance and interpretation, we recommend using the logistic model rather than the random forest or XGBoost model in clinical practice.

The three models had several important variables in common: ppoFEV1%, FEV1/FVC, FVC%, age, and cerebrovascular diseases. These contributed to postoperative cardiopulmonary complications in different ways. PpoFEV1%, FEV1/FVC, and FVC% reflect lung function. Patients’ lung function decreases significantly after surgery. The resistance of small airways increases, while the mucociliary clearance ability decreases. Afterward airway obstruction occurs, resulting in atelectasis, pneumonia, and more complications. On the other hand, the reduced pulmonary perfusion and increased circulatory resistance lead to elevated cardiac load and decreased oxygen supply, causing hypoxemia, arrhythmia, and others. PpoFEV1% is regarded as one of the most important indicators of postoperative cardiopulmonary complications and mortality. One reason is that ppoFEV1% is a synthetic parameter adjusted by height, age, sex, and the extent of surgical resections. Hence, it was also included in the Brunelli, Eurolung model, and European Society Objective Scores (8, 9, 13, 31, 32). A lower ppoFEV1% reflects the reduction of lung volume and decreased lung function. This study showed that a lower FEV1/FVC value, indicating weaker lung function, was associated with a higher risk of complications, which was consistent with results of previous studies (33, 34). FVC%, an adjusted ventilatory function indicator, also served as a predictor in the Brunelli model (8). Old age has been identified as a risk factor for postoperative complications consistently (32, 35–37). Elder patients have poor physical performance in terms of respiratory muscle strength and eliminating pathogens. Cerebrovascular diseases may impair a patient’s neurological function and mobility. Therefore, they are not conducive to postoperative recovery, but are prone to develop complications.

An accurate prediction model shows great importance in multiple settings. Regarding preoperative decision-making, the risk of complications can be easily calculated using the logistic model or nomogram, which makes precise management possible. For example, according to our logistic model, the probability of developing complications would be 36.5% for a male patient with a ppoFEV1% of 60%, an FEV1/FVC of 40%, and no arrhythmia or cerebrovascular diseases. The patient could directly know his risk of complications before surgery, which may help him better weigh the risks against the benefits and would improve compliance during subsequent treatment. Moreover, to gain better outcomes, clinicians could advise the patient to use bronchodilators and perform preoperative pulmonary rehabilitation which significantly improves FVC%, FEV1%, and FEV1/FVC (38–40). Clinicians could also consider performing a segmentectomy rather than lobectomy to improve ppoFEV1% if possible. For hospital managers and policymakers, a risk-adjusted model facilitates medical care quality monitoring. For instance, Pompili et al. used the Eurolung models to evaluate the performance of three thoracic medical centers (41). The rationale for this is clear; if the observed morbidity or mortality are lower than the predicted values, it indicates good performance. Risk models could also help audit the performance of a surgeon, a new instrument, or a novel surgical technique. By collecting these quantitative data, managers and policymakers can further identify root causes of problems and take appropriate actions to improve the quality of care. As regards medical education, a good prediction model helps students recognize the most meaningful factors and perform patient assessments quickly.

Nevertheless, our study has several limitations. First, an inevitable selection bias may exist because of the study’s retrospective nature. For instance, very few patients had chronic kidney disease or interstitial lung disease at our center. Therefore, our results cannot be directly applied to medical centers with different patient distributions. Second, some potentially essential variables could not be captured, which is a common phenomenon in model development using large databases. For example, the percent of diffusion capacity for carbon monoxide of the lung (DLco%) and the maximal oxygen consumption (VO₂max), which were strongly associated with postoperative complications, were not included in this study (42, 43). DLco% and VO₂max are not routinely evaluated in many medical centers including ours. Third, the models did not undergo extensive external validation; therefore, their efficacy must be further verified. However, we applied a 5-fold cross-validation for internal validation and tested their performance in a separate cohort, and the models showed consistently good predictive ability.

In conclusion, three models using logistic regression, random forest and XGBoost were developed and validated successfully for the prediction of postoperative cardiopulmonary complications after anatomic lung resection. The models were suitable for Chinese patients. PpoFEV1% and FEV1/FVC may be the most important predictive factors. Extensive external validation is warranted to verify the model’s performance in various clinical scenarios.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving human participants were reviewed and approved by the Institutional Review Board of Peking Union Medical College Hospital. The ethics committee waived the requirement of written informed consent for participation.

Author contributions

GH, LL, and SL contributed to conception and design of the study. GH and LL organized the database. GH performed the statistical analysis. GH wrote the first draft of the manuscript. GH and LW wrote sections of the manuscript. All authors contributed to manuscript revision, read, and approved the submitted version

Funding

This study was funded by the College Student Innovation Training Program of Peking Union Medical College, Beijing (No. 2022zglc06050 to GH).

Acknowledgments

We would like to thank Editage (www.editage.cn) for English language editing.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2022.1003722/full#supplementary-material

References

1. Xia C, Dong X, Li H, Cao M, Sun D, He S, et al. Cancer statistics in China and united states, 2022: Profiles, trends, and determinants. Chin Med J (Engl) (2022) 135(5):584–90. doi: 10.1097/cm9.0000000000002108

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Nojiri T, Inoue M, Takeuchi Y, Maeda H, Shintani Y, Sawabata N, et al. Impact of cardiopulmonary complications of lung cancer surgery on long-term outcomes. Surg Today (2015) 45(6):740–5. doi: 10.1007/s00595-014-1032-z

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Nojiri T, Hamasaki T, Inoue M, Shintani Y, Takeuchi Y, Maeda H, et al. Long-term impact of postoperative complications on cancer recurrence following lung cancer surgery. Ann Surg Oncol (2017) 24(4):1135–42. doi: 10.1245/s10434-016-5655-8

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Brown LM, Thibault DP, Kosinski AS, Cooke DT, Onaitis MW, Gaissert HA, et al. Readmission after lobectomy for lung cancer: Not all complications contribute equally. Ann Surg (2021) 274(1):e70–e9. doi: 10.1097/sla.0000000000003561

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Okada S, Shimomura M, Ishihara S, Ikebe S, Furuya T, Inoue M. Clinical significance of postoperative pulmonary complications in elderly patients with lung cancer. Interact Cardiovasc Thorac Surg (2022) 35(2):ivac153. doi: 10.1093/icvts/ivac153

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Liu Z, Qiu T, Pei L, Zhang Y, Xu L, Cui Y, et al. Two-week multimodal prehabilitation program improves perioperative functional capability in patients undergoing thoracoscopic lobectomy for lung cancer: A randomized controlled trial. Anesth Analg (2020) 131(3):840–9. doi: 10.1213/ane.0000000000004342

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Rogers LJ, Bleetman D, Messenger DE, Joshi NA, Wood L, Rasburn NJ, et al. The impact of enhanced recovery after surgery (Eras) protocol compliance on morbidity from resection for primary lung cancer. J Thorac Cardiovasc Surg (2018) 155(4):1843–52. doi: 10.1016/j.jtcvs.2017.10.151

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Brunelli A, Xiume F, Al Refai M, Salati M, Marasco R, Sabbatini A. Risk-adjusted morbidity, mortality and failure-to-Rescue models for internal provider profiling after major lung resection. Interact Cardiovasc Thorac Surg (2006) 5(2):92–6. doi: 10.1510/icvts.2005.118703

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Brunelli A, Salati M, Rocco G, Varela G, Van Raemdonck D, Decaluwe H, et al. European Risk models for morbidity (Eurolung1) and mortality (Eurolung2) to predict outcome following anatomic lung resections: An analysis from the European society of thoracic surgeons database. Eur J Cardiothorac Surg (2017) 51(3):490–7. doi: 10.1093/ejcts/ezw319

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Brunelli A, Cicconi S, Decaluwe H, Szanto Z, Falcoz PE. Parsimonious eurolung risk models to predict cardiopulmonary morbidity and mortality following anatomic lung resections: An updated analysis from the European society of thoracic surgeons database. Eur J Cardiothorac Surg (2020) 57(3):455–61. doi: 10.1093/ejcts/ezz272

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Maezawa Y, Aoyama T, Kano K, Tamagawa H, Numata M, Hara K, et al. Impact of the age-adjusted charlson comorbidity index on the short- and long-term outcomes of patients undergoing curative gastrectomy for gastric cancer. J Cancer (2019) 10(22):5527–35. doi: 10.7150/jca.35465

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Huang G, Liu L, Wang L, Wang Z, Wang Z, Li S. External validation of five predictive models for postoperative cardiopulmonary morbidity in a Chinese population receiving lung resection. PeerJ (2022) 10:e12936. doi: 10.7717/peerj.12936

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Mao X, Zhang W, Ni YQ, Niu Y, Jiang LY. A prediction model for postoperative pulmonary complication in pulmonary function-impaired patients following lung resection. J Multidiscip Healthc (2021) 14:3187–94. doi: 10.2147/jmdh.S327285

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Chang S, Zhou K, Wang Y, Lai Y, Che G. Prognostic value of preoperative peak expiratory flow to predict postoperative pulmonary complications in surgical lung cancer patients. Front Oncol (2021) 11:782774. doi: 10.3389/fonc.2021.782774

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Yao L, Luo J, Liu L, Wu Q, Zhou R, Li L, et al. Risk factors for postoperative pneumonia and prognosis in lung cancer patients after surgery: A retrospective study. Med (Baltimore) (2021) 100(13):e25295. doi: 10.1097/md.0000000000025295

CrossRef Full Text | Google Scholar

16. Li Y, Ma YL, Gao YY, Wang DD, Chen Q. Analysis of the risk factors of postoperative cardiopulmonary complications and ability to predicate the risk in patients after lung cancer surgery. J Thorac Dis (2017) 9(6):1565–73. doi: 10.21037/jtd.2017.05.42

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Li S, Zhou K, Lai Y, Shen C, Wu Y, Che G. Estimated intraoperative blood loss correlates with postoperative cardiopulmonary complications and length of stay in patients undergoing video-assisted thoracoscopic lung cancer lobectomy: A retrospective cohort study. BMC Surg (2018) 18(1):29. doi: 10.1186/s12893-018-0360-0

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Wang C, Wang S, Li Z, He W. A multiple-center nomogram to predict pneumonectomy complication risk for non-small cell lung cancer patients. Ann Surg Oncol (2022) 29(1):561–9. doi: 10.1245/s10434-021-10504-1

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Bolourani S, Wang P, Patel VM, Manetta F, Lee PC. Predicting respiratory failure after pulmonary lobectomy using machine learning techniques. Surgery (2020) 168(4):743–52. doi: 10.1016/j.surg.2020.05.032

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Salati M, Migliorelli L, Moccia S, Andolfi M, Roncon A, Guiducci GM, et al. A machine learning approach for postoperative outcome prediction: Surgical data science application in a thoracic surgery setting. World J Surg (2021) 45(5):1585–94. doi: 10.1007/s00268-020-05948-7

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Collins GS, Reitsma JB, Altman DG, Moons KG. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (Tripod): The tripod statement. Bmj (2015) 350:g7594. doi: 10.1136/bmj.g7594

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Beckles MA, Spiro SG, Colice GL, Rudd RM. The physiologic evaluation of patients with lung cancer being considered for resectional surgery. Chest (2003) 123(Suppl 1):105s–14s. doi: 10.1378/chest.123.1_suppl.105s

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Jian W, Gao Y, Hao C, Wang N, Ai T, Liu C, et al. Reference values for spirometry in Chinese aged 4-80 years. J Thorac Dis (2017) 9(11):4538–49. doi: 10.21037/jtd.2017.10.110

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Fernandez FG, Falcoz PE, Kozower BD, Salati M, Wright CD, Brunelli A. The society of thoracic surgeons and the European society of thoracic surgeons general thoracic surgery databases: Joint standardization of variable definitions and terminology. Ann Thorac Surg (2015) 99(1):368–76. doi: 10.1016/j.athoracsur.2014.05.104

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Demirjian S, Bashour CA, Shaw A, Schold JD, Simon J, Anthony D, et al. Predictive accuracy of a perioperative laboratory test-based prediction model for moderate to severe acute kidney injury after cardiac surgery. JAMA (2022) 327(10):956–64. doi: 10.1001/jama.2022.1751

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Huang Y, Li W, Macheret F, Gabriel RA, Ohno-Machado L. A tutorial on calibration measurements and calibration models for clinical prediction models. J Am Med Inform Assoc (2020) 27(4):621–33. doi: 10.1093/jamia/ocz228

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Yu Y, He Z, Ouyang J, Tan Y, Chen Y, Gu Y, et al. Magnetic resonance imaging radiomics predicts preoperative axillary lymph node metastasis to support surgical decisions and is associated with tumor microenvironment in invasive breast cancer: A machine learning, multicenter study. EBioMedicine (2021) 69:103460. doi: 10.1016/j.ebiom.2021.103460

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Poirion OB, Jing Z, Chaudhary K, Huang S, Garmire LX. Deepprog: An ensemble of deep-learning and machine-learning models for prognosis prediction using multi-omics data. Genome Med (2021) 13(1):112. doi: 10.1186/s13073-021-00930-x

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Goto T, Camargo CA Jr., Faridi MK, Freishtat RJ, Hasegawa K. Machine learning-based prediction of clinical outcomes for children during emergency department triage. JAMA Netw Open (2019) 2(1):e186937. doi: 10.1001/jamanetworkopen.2018.6937

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Choi Y, Boo Y. Comparing logistic regression models with alternative machine learning methods to predict the risk of drug intoxication mortality. Int J Environ Res Public Health (2020) 17(3):897. doi: 10.3390/ijerph17030897

CrossRef Full Text | Google Scholar

31. Bradley A, Marshall A, Abdelaziz M, Hussain K, Agostini P, Bishay E, et al. Thoracoscore fails to predict complications following elective lung resection. Eur Respir J (2012) 40(6):1496–501. doi: 10.1183/09031936.00218111

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Sezen CB, Gokce A, Kalafat CE, Aker C, Tastepe AI. Risk factors for postoperative complications and long-term survival in elderly lung cancer patients: A single institutional experience in Turkey. Gen Thorac Cardiovasc Surg (2019) 67(5):442–9. doi: 10.1007/s11748-018-1031-x

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Ferguson MK, Siddique J, Karrison T. Modeling major lung resection outcomes using classification trees and multiple imputation techniques. Eur J Cardiothorac Surg (2008) 34(5):1085–9. doi: 10.1016/j.ejcts.2008.07.037

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Roy E, Rheault J, Pigeon MA, Ugalde PA, Racine C, Simard S, et al. Lung cancer resection and postoperative outcomes in copd: A single-center experience. Chron Respir Dis (2020) 17:1479973120925430. doi: 10.1177/1479973120925430

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Motono N, Ishikawa M, Iwai S, Iijima Y, Usuda K, Uramoto H. Individualization of risk factors for postoperative complication after lung cancer surgery: A retrospective study. BMC Surg (2021) 21(1):311. doi: 10.1186/s12893-021-01305-0

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Hino H, Karasaki T, Yoshida Y, Fukami T, Sano A, Tanaka M, et al. Risk factors for postoperative complications and long-term survival in lung cancer patients older than 80 years. Eur J Cardiothorac Surg (2018) 53(5):980–6. doi: 10.1093/ejcts/ezx437

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Motono N, Ishikawa M, Iwai S, Yamagata A, Iijima Y, Uramoto H. Analysis of risk factors for postoperative complications in non-small cell lung cancer: Comparison with the Japanese national clinical database risk calculator. BMC Surg (2022) 22(1):180. doi: 10.1186/s12893-022-01628-6

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Brunelli A, Charloux A, Bolliger CT, Rocco G, Sculier JP, Varela G, et al. Ers/Ests clinical guidelines on fitness for radical therapy in lung cancer patients (Surgery and chemo-radiotherapy). Eur Respir J (2009) 34(1):17–41. doi: 10.1183/09031936.00184308

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Brunelli A, Kim AW, Berger KI, Addrizzo-Harris DJ. Physiologic evaluation of the patient with lung cancer being considered for resectional surgery: Diagnosis and management of lung cancer, 3rd Ed: American college of chest physicians evidence-based clinical practice guidelines. Chest (2013) 143(5 Suppl):e166S–e90S. doi: 10.1378/chest.12-2395

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Wang YQ, Liu X, Jia Y, Xie J. Impact of breathing exercises in subjects with lung cancer undergoing surgical resection: A systematic review and meta-analysis. J Clin Nurs (2019) 28(5-6):717–32. doi: 10.1111/jocn.14696

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Pompili C, Shargall Y, Decaluwe H, Moons J, Chari M, Brunelli A. Risk-adjusted performance evaluation in three academic thoracic surgery units using the eurolung risk models. Eur J Cardiothorac Surg (2018) 54(1):122–6. doi: 10.1093/ejcts/ezx483

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Ohsawa M, Tsutani Y, Fujiwara M, Mimae T, Miyata Y, Okada M. Predicting severe postoperative complication in patients with lung cancer and interstitial pneumonia. Ann Thorac Surg (2020) 109(4):1054–60. doi: 10.1016/j.athoracsur.2019.11.012

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Rocco G, Gatani T, Di Maio M, Meoli I, La Rocca A, Martucci N, et al. The impact of decreasing cutoff values for maximal oxygen consumption (Vo(2)Max) in the decision-making process for candidates to lung cancer surgery. J Thorac Dis (2013) 5(1):12–8. doi: 10.3978/j.issn.2072-1439.2012.12.04

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: lung cancer, prediction model, machine learning, postoperative complication, ppoFEV1%, FEV1/FVC

Citation: Huang G, Liu L, Wang L and Li S (2022) Prediction of postoperative cardiopulmonary complications after lung resection in a Chinese population: A machine learning-based study. Front. Oncol. 12:1003722. doi: 10.3389/fonc.2022.1003722

Received: 26 July 2022; Accepted: 12 September 2022;
Published: 23 September 2022.

Edited by:

Wei Song, Wuhan University, China

Reviewed by:

Xiaohui Chen, Fujian Medical University Cancer Hospital, China
Jinghong Liang, Sun Yat-sen University, China

Copyright © 2022 Huang, Liu, Wang and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Shanqing Li, bGlzaGFucWluZ0BwdW1jaC5jbg==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.