- 1State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Shanghai Institute of Hematology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
- 2Department of Cardiovascular Surgery, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
- 3Department of Cardiovascular Surgery, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
- 4Department of Cardiovascular Surgery, Shanghai East Hospital, Tongji University School of Medicine, Shanghai, China
Background: Our study aimed to develop machine learning algorithms capable of predicting red blood cell (RBC) transfusion during valve replacement surgery based on a preoperative dataset of the non-anemic cohort.
Methods: A total of 423 patients who underwent valvular replacement surgery from January 2015 to December 2020 were enrolled. A comprehensive database that incorporated demographic characteristics, clinical conditions, and results of preoperative biochemistry tests was used for establishing the models. A range of machine learning algorithms were employed, including decision tree, random forest, extreme gradient boosting (XGBoost), categorical boosting (CatBoost), support vector classifier and logistic regression (LR). Subsequently, the area under the receiver operating characteristic curve (AUC), accuracy, recall, precision, and F1 score were used to determine the predictive capability of the algorithms. Furthermore, we utilized SHapley Additive exPlanation (SHAP) values to explain the optimal prediction model.
Results: The enrolled patients were randomly divided into training set and testing set according to the 8:2 ratio. There were 16 important features identified by Sequential Backward Selection for model establishment. The top 5 most influential features in the RF importance matrix plot were hematocrit, hemoglobin, ALT, fibrinogen, and ferritin. The optimal prediction model was CatBoost algorithm, exhibiting the highest AUC (0.752, 95% CI: 0.662–0.780), which also got relatively high F1 score (0.695). The CatBoost algorithm also showed superior performance over the LR model with the AUC (0.666, 95% CI: 0.534–0.697). The SHAP summary plot and the SHAP dependence plot were used to visually illustrate the positive or negative effects of the selected features attributed to the CatBoost model.
Conclusions: This study established a series of prediction models to enhance risk assessment of intraoperative RBC transfusion during valve replacement in no-anemic patients. The identified important predictors may provide effective preoperative interventions.
1 Introduction
Cardiac surgery patients are prone to suffer from blood transfusion therapy. Firstly, cardiovascular patients have a higher incidence of perioperative anemia (1). Secondly, cardiac surgery easily encounters ongoing blood loss and hemodilution intraoperatively. Thus, allogeneic transfusion is the standard treatment for perioperative anemia in patients undergoing cardiac surgery, and a wide variety of blood products have saved most patients’ lives. Historical cohort studies have consistently shown that allogeneic transfusion, particularly when administered intraoperatively, is related to increased postoperative complications, including infectious disease, renal failure, acute lung injury, and neurological adverse events (2–4). Our previous cohort study also revealed that intraoperative red blood cell (RBC) transfusion significantly increased the risk of postoperative hypoxemia in non-anemic adults undergoing isolated valve replacements (5). Therefore, it is widely accepted that intraoperative transfusion can act as a trigger for adverse events in cardiac surgery.
With the rapid advances in artificial intelligence, data-driven machine learning algorithms are being used to predict diverse clinical outcomes (6–8). Importantly, machine learning models have been widely used for the prediction of events in the cardiovascular field as well (9, 10). Machine learning algorithms provides new tools to overcome challenges for which traditional statistical methods are ill-suited, especially for making classification predictions using very specific sets of input features. The prediction of blood transfusion is a very suitable project for artificial intelligence, which has already been gaining sophistication with the help of various machine learning algorithms (11). Accurate prediction of intraoperative transfusion contributes to patient blood management and the prevention of the potential risks after transfusion. Patient blood management in cardiac surgery will facilitate a reduction in the misuse of allogeneic blood transfusion (12), illustrating the patients who are most likely to undergo transfusion should benefit most from disciplined blood management strategy. Therefore, it is necessary to build predictive models in cardiac surgery patients.
Herein, we attempted to establish machine learning-based algorithms in depth for predicting intraoperative RBC transfusion by incorporating preoperative data of the non-anemic adults undergoing valvular replacements and to evaluate its advantages compared with the traditional linear model, providing a new paradigm for predicting blood transfusion in cardiac surgery patients.
2 Methods
2.1 Study design and ethics statement
We enrolled the consecutive patients aged 18 to 80 years undergoing valvular replacements at Shanghai General Hospital, School of Medicine, Shanghai Jiao Tong University between January 2015 and December 2020. Exclusion criteria for this study included: (1) hemoglobin levels ≤12 g/dl for females and ≤13 g/dl for males, (2) emergency surgery, (3) massive transfusion, (4) re-operation of valvular surgery, (5) combined coronary surgery, aortic surgery, or ablation surgery, and (6) left ventricular ejection fraction (LVEF) <35%.
This retrospective study was approved by the ethics committee of Shanghai General Hospital (Approval number: 2020KY223). All the research procedures were carried out in accordance with relevant guidelines and regulations. As the data involved in this study were recruited from electronic medical records and reported without personal identifiers, the need for informed consent from the patients was waived.
2.2 Transfusion criteria
Intraoperative RBC transfusion was primarily determined by a hemoglobin level below 8.0 g/dl or hematocrit less than 25% after cardiopulmonary bypass, as per the established criterion (13). In our study, we defined massive transfusion as the administration of 5 or more units of RBCs within 4 h (14, 15). This definition is based on evidence indicating that massive hemorrhage is often linked to significant trauma, abnormal anatomy, or surgical techniques (16). The transfusion of other blood components varied among surgeons based on the extent of blood loss and intraoperative coagulation and bleeding.
2.3 Data collection
Preoperative patient characteristics were extracted from medical records, encompassing demographic information (e.g., age, gender, BMI), past medical history, and findings from laboratory tests, electrocardiogram, and echocardiography. These details also included smoking/drinking habits, comorbidities such as hypertension, diabetes, cerebrovascular disease, and atrial fibrillation, New York Heart Association (NYHA) class, LVEF, and a variety of blood tests such as serum ferritin, albumin, and platelet count.
2.4 Construction of machine learning algorithms
The patients included in this study were categorized into two groups based on whether they received intraoperative RBC transfusion (IRT), referred as the IRT group and non-IRT group respectively. The entire dataset was then randomly split into a training set and a testing set using an 8:2 ratio. Basically, the training dataset was used for feature selection. Then, a random forest (RF) algorithm-based feature selection method, Sequential Backward Selection (SBS), was used to filter the variables. In detail, firstly the RF algorithm uses feature importance as a ranking criterion. Secondly, the SBS method removes the least important feature in a full feature subset until the desired number of features is obtained. Subsequently, we measured the performance of each RF model with varying numbers of variables by plotting their accuracy. Finally, we identified the optimal number of features based on the highest model accuracy.
Then, the proposed prediction models for binary classification include the advanced ensemble learning methods, consisting of both nonlinear and linear algorithms. Nonlinear models included decision tree classifier, RF classifier, extreme gradient boosting (XGBoost) classifier, categorical boosting (CatBoost) classifier, support vector classifier (SVC). In contrast, the linear model was logistic regression (LR). Basically, the unit of a RF is the decision tree. RF consists of hundreds or thousands of decision trees and integrates all the categorical voting results, designating the category with the most votes as the final output (17). Unlike the bagging idea of RF, the boosting algorithms improve the prediction power by converting several weak learners to strong learners, such as CatBoost and XGBoost, both of which are based on the Gradient Boosting Decision Tree (18). Different from the tree-based methods, SVC finds a hyper-plane that creates a boundary between the types of data, which is a supervised machine learning model that has higher speed and better performance with a limited number of samples (19).
After developing the models, we aimed to evaluate the predictive validity of various machine learning models. To achieve this, we calculated and compared the areas under the receiver operating characteristic curve (AUC) and the accuracy of the algorithms. Moreover, we analyzed precision, which is the fraction of true positive examples among those classified as positive by the model, and recall, which is the fraction of positive examples classified correctly by the model. We also used the F1-score, defined as the harmonic mean of the model's precision and recall, to determine the model's overall accuracy on the dataset. Finally, to ensure correct interpretation of the prediction model, we employed the SHapley Additive exPlanation (SHAP) values. These values provided consistent and locally accurate attribution for each feature, allowing us to determine not only the importance of each feature but also its positive or negative impact on predictions (20). As most machine learning models are often considered as “black boxes”, SHAP values can be particularly useful in interpreting their results.
2.5 Statistical analysis
Continuous variables were expressed as either mean and standard deviation or median and interquartile range depending on whether they followed a Gaussian distribution. Categorical variables were reported as frequency and percentage. For group comparison of continuous variables, the Student's T-test was used if they followed a Gaussian distribution, while the Mann–Whitney U test was used for non-Gaussian data. The difference in the distribution of categorical variables between groups was tested using either the chi-squared test or Fisher's exact test, as appropriate.
For statistical analyses, we used SPSS software (ver. 22.0, SPSS Inc., Chicago, IL, USA). Model development was carried out using Python (ver. 3.9.6) with the packages of catboost (1.0.6), sklearn (3.0.0), shap (0.41.0), and xgboost (1.6.1). Graphics were generated using Python (ver. 3.9.6) with the package of matplotlib (3.5.2). A P-value <0.05 was considered statistically significant for a two-sided test.
3 Results
3.1 Characteristics of study population
We finally enrolled the medical records of 423 consecutive patients undergoing isolated valvular replacements from January 2015 to December 2020. The structured workflow of the patient recruitment and grouping was presented in Figure 1. The patients had a mean age of 60.1 ± 11.4 years, and 102 participants (24.1%) received intraoperative RBC transfusion. They were randomly assigned to a training set (n = 338) or a testing set (n = 85). An overview over the preoperative patient data is given in Table 1. Each individual patient had 30 preoperative clinical characteristics. The incidences of intraoperative RBC transfusion were 22.8% and 29.4% in the training set and testing set, respectively.
Figure 1. Workflow chart for recruitment in this study. LVEF, left ventricular ejection fraction; IRT, intraoperative red blood cell transfusion.
3.2 Machine learning predictive model development
A RF algorithm was applied to find the most critical variable associated with intraoperative RBC transfusion. The importance of each feature was ranked in descending order (Figure 2). The top 10 most significant preoperative feature listed in Supplementary Table S1 were hematocrit, hemoglobin, ALT, fibrinogen, ferritin, platelet count, APTT, folic acid, transferrin saturation (TAST), and vitamin B12. Afterward, all features were included in SBS one by one in order of their feature ranks. Based on the highest model accuracy (Figure 3), SBS excluded 14 features, illustrating that there were 16 important features incorporated in the subsequent machine learning models.
Figure 2. Importance matrix plot generated by the RF model to aid in feature selection. This plot provided a visual representation of the importance of each covariate and assisted in developing the final predictive model. LVEF, left ventricular ejection fraction; BMI, body mass index; NYHA, New York Heart Association; PT, prothrombin time; APTT, activated partial thromboplastin time, TT, thrombin time; ALT, alanine transaminase; eGFR, estimated glomerular filtration rate; HbA1c, hemoglobin A1c; TAST, transferrin saturation.
Figure 3. Feature selection by the Sequential Backward Selection (SBS) feature selection method. Firstly, the random forest algorithm was used to determine the importance of each feature. The features were then ranked in descending order based on their importance. Secondly, SBS was employed to remove the least important feature of the full feature subset until a new feature subspace with the desired number of features was obtained. Lastly, the optimal number of variables excluded from the model was determined by achieving maximum accuracy (indicated by the red circle). Specifically, 14 variables were deemed to be optimal for exclusion from the model.
Using solely these 16 features, we trained six different models. As Figure 4 showed, the best AUC in the testing set were yielded by CatBoost method (AUC: 0.752), followed by RF model (AUC: 0.722) and XGBoost classifier (AUC: 0.705). In addition, RF classifier got the highest accuracy (accuracy: 0.734) and recall (recall: 0.728), while CatBoost method was slightly inferior (accuracy: 0.732 and recall: 0.725). CatBoost classifier also achieved decent F1 score (F1: 0.695), making the optimal model overall. Notably, the linear model, LR, was not effective enough and achieved the lowest recall, precision and F1 score among all the learning models. The detailed parameters of other algorithms are presented in Table 2.
Figure 4. Comparison of receiver operating characteristic curves (AUCs) among machine learning models in the testing set. Among them, CatBoost displayed the most remarkable performance with the highest AUC value for predicting outcomes, demonstrating its potential in the field of predictive modeling. SVC: support vector classifier.
3.3 Machine learning-based model interpretation
To identify the features that influenced the prediction model the most, the importance of selected feature in the CatBoost algorithm was calculated (Figure 5A). The importance of each feature was listed in Supplementary Table S2. Importantly, SHAP values were established to provide accurate attribution values for each feature within the CatBoost model. Finally, we depicted the SHAP summary plot of CatBoost algorithm (Figure 5B) and the SHAP dependence plot of all the features within CatBoost model (Supplementary Figure S1) to visualize the factors that facilitated intraoperative RBC transfusion.
Figure 5. Feature importance and SHapley Additive exPlanation (SHAP) summary plot of the CatBoost model (A). The variables are ranked in decreasing order of their impact on the final model output (B). It is displayed that a dot estimation of the CatBoost model's output for each patient in the dataset. The color of each dot indicates the SHAP value of specific features; red denotes a higher SHAP value, while blue indicates a lower value. Higher SHAP values are associated with an increased risk of intraoperative red blood cell transfusion. LVEF, left ventricular ejection fraction; APTT, activated partial thromboplastin time; TT, thrombin time; ALT, alanine transaminase; TAST, transferrin saturation.
4 Discussion
Given the strong association between intraoperative RBC transfusion and postoperative complications, predicting the need for such transfusions is critical to enabling early interventions that can address potential complications and improve patient prognosis following cardiac surgery. However, few studies have been conducted to predict intraoperative transfusion in cardiac surgery. To our knowledge, we were the earliest to establish the artificial intelligence algorithm for predicting intraoperative transfusion in non-anemic cohorts. As is known, nonlinear machine learning methods have remarkable superiority only if there is no way to fit the dataset linearly (21). Thus, we established five nonlinear machine learning models and one classical statistical regression model with preoperative characteristics. As we investigated, the optimal model, CatBoost algorithm, was superior to LR with an AUC of 0.752 (CatBoost) vs. 0.666 (LR), implying intraoperative RBC transfusion is not simply linearly related to preoperative risk factors based on our dataset. Undoubtedly, with the proficiency of artificial intelligence in handling nonlinear prediction tasks, more and more machine learning models are being used to predict transfusions in a series of different diseases, offering new approaches to improve clinical outcomes (22, 23).
Currently, machine learning technology is so advanced that it has given birth to many applications of predictive tasks in the field of medicine, which has the potential to revolutionize medical strategies and paradigms (24–26). However, there are still concerns about the overuse and misuse of machine learning in clinical research (27). The prejudice of machine learning algorithm mainly originates from the biased datasets, which is more common in the medical field. When the overall event rate is very low, machine learning models can easily build predictions with high accuracy and high AUC due to the high prediction rate for negative events. Mitterecker et al. (28) developed machine learning models of transfusion prediction based on large cohort using preoperative characteristics. When the overall event rate (transfusion) reached 12.4%, the models acquired not only very high accuracy, AUC, and negative predictive values, but also satisfactory recall and precision. On the contrary, accuracy and AUC of models still preserved a very high level but the recall and precision decreased sharply, when the overall event rate (massive transfusion) was only 0.4%. Interestingly, there are many measurements of model performance, but most of them do not show positive correlation, such as recall and precision. In fact, the performance of the model is closely related to the purpose of prediction. For the prediction of clinical outcomes, a higher prediction rate for positive events represents a lower rate of missed diagnoses, even if there are false predictions of negative events that can be further ruled out by other diagnostic methods. Historically, there have also been articles on predicting transfusion in cardiac surgery that sacrifice some performance metrics, such as recall and precision, to achieve high accuracy and AUC, which may not contribute much to the successful prediction of positive events from our point of view (29, 30). Therefore, in this study, we were not pursuing high accuracy and AUC of the models, but we try to build a more balanced model with improved precision and recall. For the biased datasets, we believe that the F1 score is more convincing for the evaluation of the model. Thus, based on the combination of AUC, recall, precision and F1 score, the CatBoost algorithm was selected as the optimal model even though the other methods were just slightly worse.
Of note, machine learning models are only a collection of predictive tools, as clinicians need to understand how they work. With SHAP values, the predictive pattern of the optimal machine learning model was fundamentally understood in our study and many risk factors, including female, low hemoglobin, low ferritin, low platelets, low folic acid, hypoalbuminemia, and high HbA1c, were detected. Actually, several factors have already been found to be associated with an increased risk of perioperative transfusion, such as the female gender and preoperative anemia (31). The anemic cardiac surgery patients have been reported to be more likely to get perioperative transfusion (31–33). It is of concern that low hemoglobin also played an important role in the prediction of positive events in non-anemic cohorts according to our results. However, there is no clinical evidence for direct intervention for relatively low hemoglobin levels in non-anemic patients. Interestingly, in previous work, we identified some factors that affect intraoperative RBC transfusion using LR model, such as female, hemoglobin, and the state of iron deficiency (5). Both linear and nonlinear models confirmed iron deficiency as a risk factor for intraoperative RBC transfusion, indicating that correcting preoperative iron deficiency might contribute to reducing the risk of intraoperative transfusion. Recently, a range of study found that preoperative intravenous iron treatment would significantly increase hemoglobin level (34) and reduce allogeneic transfusion (35) in non-anemic patients with iron deficiency undergoing cardiac surgery, supporting our findings about the impact of iron deficiency on non-anemic patients. In addition, another indicator that can be corrected preoperatively was hypoalbuminemia, which was reported as a risk factor of transfusion in other kinds of surgeries (36, 37). Of note, diabetes was not included in the machine learning model, while high HbA1c was adopted as a risk factor. Indeed, long-term high blood glucose levels can have detrimental effects on vascular health, potentially leading to increased vascular fragility in patients. This heightened vulnerability to intraoperative bleeding underscores the importance of closely monitoring and actively regulating perioperative blood glucose levels. By maintaining optimal blood glucose control during the perioperative period, healthcare providers can mitigate the risk of excessive bleeding and promote better patient outcomes.
Although our study provides valuable insights into predicting intraoperative RBC transfusion, it is important to acknowledge and consider certain limitations inherent in the research design. Firstly, the design was a single-center retrospective study with a small scale. The absence of an external validation set limited the accuracy of predictions. Consequently, before the models can be applied to external datasets, their predictive performance would require training and evaluation again. Secondly, our dataset is still a biased set. Continued inclusion of more patients and more characteristics is an effective way to increase the diversity of the datasets, addressing the bias. Thirdly, the certain data included in our study consists of continuous variables, which may not provide precise guidance for subsequent clinical applications. It is suggested that the establishment of additional criteria to convert these continuous variables into categorical variables, enabling their inclusion in the machine learning model for further analysis and refinement.
5 Conclusions
In summary, we have successfully identified and selected 16 crucial preoperative predictors for predicting intraoperative RBC transfusion following admission. Our study employed a range of cutting-edge machine learning algorithms, with the CatBoost classifier demonstrating the most robust performance for risk prediction. Furthermore, by leveraging the SHAP values from the CatBoost model, our team was able to identify key risk factors for intraoperative RBC transfusion in non-anemic patients, ultimately providing new insights and recommendations for improving cardiac surgical care.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by the ethics committee of Shanghai General Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because the data involved in this study were recruited from electronic medical records and reported without personal identifiers, the need for informed consent from the patients was waived.
Author contributions
RZ: Conceptualization, Formal Analysis, Investigation, Methodology, Visualization, Writing – original draft. ZL: Data curation, Formal Analysis, Software, Visualization, Writing – original draft. JL: Data curation, Formal Analysis, Writing – original draft. DQ: Data curation, Writing – original draft. XM: Data curation, Writing – original draft. LG: Data curation, Writing – original draft. XS: Data curation, Writing – original draft. HL: Conceptualization, Methodology, Validation, Writing – review & editing. MY: Conceptualization, Funding acquisition, Supervision, Validation, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article.
This work was supported by Clinical Research Plan of Shanghai Hospital Development Center (No. SHDC 2020CR3100B).
Acknowledgments
We thank machine learning engineer, Shuxun Zhou, for the statistical guidance of this paper.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcvm.2024.1344170/full#supplementary-material
References
1. Anand IS, Gupta P. Anemia and iron deficiency in heart failure: current concepts and emerging therapies. Circulation. (2018) 138(1):80–98. doi: 10.1161/CIRCULATIONAHA.118.030099
2. Crescenzi G, Torracca L, Capestro F, Matteucci ML, Rossi M. Allogenic blood transfusion in cardiac surgery. J Card Surg. (2012) 27(5):594–9. doi: 10.1111/j.1540-8191.2012.01522.x
3. Vlot EA, Verwijmeren L, Van De Garde EMW, Kloppenburg GTL, Van Dongen EPA, Noordzij PG. Intra-operative red blood cell transfusion and mortality after cardiac surgery. BMC Anesthesiol. (2019) 19(1):65. doi: 10.1186/s12871-019-0738-2
4. Sankar A, Rotstein AJ, Teja B, Carrier FM, Belley-Cote EP, Bolliger D, et al. Prolonged mechanical ventilation after cardiac surgery: substudy of the transfusion requirements in cardiac surgery iii trial. Can J Anaesth. (2022) 69(12):1493–506. doi: 10.1007/s12630-022-02319-9
5. Zhou R, Qian D, Li H, Wang Z, Shi S, Shen F, et al. Clinical presentation and in-hospital outcomes of intraoperative red blood cell transfusion in non-anemic patients undergoing elective valve replacement. Front Cardiovasc Med. (2022) 9:1053209. doi: 10.3389/fcvm.2022.1053209
6. Tomasev N, Glorot X, Rae JW, Zielinski M, Askham H, Saraiva A, et al. A clinically applicable approach to continuous prediction of future acute kidney injury. Nature. (2019) 572(7767):116–9. doi: 10.1038/s41586-019-1390-1
7. Heo J, Yoon JG, Park H, Kim YD, Nam HS, Heo JH. Machine learning-based model for prediction of outcomes in acute stroke. Stroke. (2019) 50(5):1263–5. doi: 10.1161/STROKEAHA.118.024293
8. Dong J, Feng T, Thapa-Chhetry B, Cho BG, Shum T, Inwald DP, et al. Machine learning model for early prediction of acute kidney injury (aki) in pediatric critical care. Crit Care. (2021) 25(1):288. doi: 10.1186/s13054-021-03724-0
9. Krittanawong C, Zhang H, Wang Z, Aydar M, Kitai T. Artificial intelligence in precision cardiovascular medicine. J Am Coll Cardiol. (2017) 69(21):2657–64. doi: 10.1016/j.jacc.2017.03.571
10. Ranka S, Reddy M, Noheria A. Artificial intelligence in cardiovascular medicine. Curr Opin Cardiol. (2021) 36(1):26–35. doi: 10.1097/HCO.0000000000000812
11. Meier JM, Tschoellitsch T. Artificial intelligence and machine learning in patient blood management: a scoping review. Anesth Analg. (2022) 135(3):524–31. doi: 10.1213/ANE.0000000000006047
12. Meybohm P, Westphal S, Ravn HB, Ranucci M, Agarwal S, Choorapoikayil S, et al. Perioperative anemia management as part of pbm in cardiac surgery - a narrative updated review. J Cardiothorac Vasc Anesth. (2020) 34(4):1060–73. doi: 10.1053/j.jvca.2019.06.047
13. Carson JL, Guyatt G, Heddle NM, Grossman BJ, Cohn CS, Fung MK, et al. Clinical practice guidelines from the aabb: red blood cell transfusion thresholds and storage. JAMA. (2016) 316(19):2025–35. doi: 10.1001/jama.2016.9185
14. Stanworth SJ, Morris TP, Gaarder C, Goslings JC, Maegele M, Cohen MJ, et al. Reappraising the concept of massive transfusion in trauma. Crit Care. (2010) 14(6):R239. doi: 10.1186/cc9394
15. Zatta AJ, Mcquilten ZK, Mitra B, Roxby DJ, Sinha R, Whitehead S, et al. Elucidating the clinical characteristics of patients captured using different definitions of massive transfusion. Vox Sang. (2014) 107(1):60–70. doi: 10.1111/vox.12121
16. Petrou A, Tzimas P, Siminelakis S. Massive bleeding in cardiac surgery. Definitions, predictors and challenges. Hippokratia. (2016) 20(3):179–86.29097882
17. Roy MH, Larocque D. Prediction intervals with random forests. Stat Methods Med Res. (2020) 29(1):205–29. doi: 10.1177/0962280219829885
18. Luna JM, Gennatas ED, Ungar LH, Eaton E, Diffenderfer ES, Jensen ST, et al. Building more accurate decision trees with the additive tree. Proc Natl Acad Sci U S A. (2019) 116(40):19887–93. doi: 10.1073/pnas.1816748116
19. Rodriguez-Perez R, Bajorath J. Evolution of support vector machine and regression modeling in chemoinformatics and drug discovery. J Comput Aided Mol Des. (2022) 36(5):355–62. doi: 10.1007/s10822-022-00442-9
20. Lundberg SM, Nair B, Vavilala MS, Horibe M, Eisses MJ, Adams T, et al. Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat Biomed Eng. (2018) 2(10):749–60. doi: 10.1038/s41551-018-0304-0
21. Couronne R, Probst P, Boulesteix AL. Random forest versus logistic regression: a large-scale benchmark experiment. BMC Bioinformatics. (2018) 19(1):270. doi: 10.1186/s12859-018-2264-5
22. Liu LP, Zhao QY, Wu J, Luo YW, Dong H, Chen ZW, et al. Machine learning for the prediction of red blood cell transfusion in patients during or after liver transplantation surgery. Front Med (Lausanne). (2021) 8:632210. doi: 10.3389/fmed.2021.632210
23. Zhang J, Jiang L, Zhu X. A machine learning-modified novel nomogram to predict perioperative blood transfusion of total gastrectomy for gastric cancer. Front Oncol. (2022) 12:826760. doi: 10.3389/fonc.2022.826760
24. Rush B, Stone DJ, Celi LA. From big data to artificial intelligence: harnessing data routinely collected in the process of care. Crit Care Med. (2018) 46(2):345–6. doi: 10.1097/CCM.0000000000002892
25. Greener JG, Kandathil SM, Moffat L, Jones DT. A guide to machine learning for biologists. Nat Rev Mol Cell Biol. (2022) 23(1):40–55. doi: 10.1038/s41580-021-00407-0
26. Deo RC. Machine learning in medicine. Circulation. (2015) 132(20):1920–30. doi: 10.1161/CIRCULATIONAHA.115.001593
27. Volovici V, Syn NL, Ercole A, Zhao JJ, Liu N. Steps to avoid overuse and misuse of machine learning in clinical research. Nat Med. (2022) 28(10):1996–9. doi: 10.1038/s41591-022-01961-6
28. Mitterecker A, Hofmann A, Trentino KM, Lloyd A, Leahy MF, Schwarzbauer K, et al. Machine learning-based prediction of transfusion. Transfusion. (2020) 60(9):1977–86. doi: 10.1111/trf.15935
29. Liu S, Zhou R, Xia XQ, Ren H, Wang LY, Sang RR, et al. Machine learning models to predict red blood cell transfusion in patients undergoing mitral valve surgery. Ann Transl Med. (2021) 9(7):530. doi: 10.21037/atm-20-7375
30. Tschoellitsch T, Bock C, Mahecic TT, Hofmann A, Meier J. Machine learning-based prediction of massive perioperative allogeneic blood transfusion in cardiac surgery. Eur J Anaesthesiol. (2022) 39(9):766–73. doi: 10.1097/EJA.0000000000001721
31. Task Force on Patient Blood Management for Adult Cardiac Surgery of the European Association for Cardio-Thoracic S, The European Association of Cardiothoracic A, Boer C, Meesters MI, Milojevic M, Benedetto U, et al. 2017 EACTS/EACTA guidelines on patient blood management for adult cardiac surgery. J Cardiothorac Vasc Anesth. (2018) 32(1):88–120. doi: 10.1053/j.jvca.2017.06.026
32. Ranucci M, Aronson S, Dietrich W, Dyke CM, Hofmann A, Karkouti K, et al. Patient blood management during cardiac surgery: do we have enough evidence for clinical practice? J Thorac Cardiovasc Surg. (2011) 142(2):249.e1–32. doi: 10.1016/j.jtcvs.2011.04.007
33. Hung M, Besser M, Sharples LD, Nair SK, Klein AA. The prevalence and association with transfusion, intensive care unit stay and mortality of pre-operative anaemia in a cohort of cardiac surgery patients. Anaesthesia. (2011) 66(9):812–8. doi: 10.1111/j.1365-2044.2011.06819.x
34. Houston BL, Hurrie D, Graham J, Perija B, Rimmer E, Rabbani R, et al. Efficacy of iron supplementation on fatigue and physical capacity in non-anaemic iron-deficient adults: a systematic review of randomised controlled trials. BMJ Open. (2018) 8(4):e019240. doi: 10.1136/bmjopen-2017-019240
35. Spahn DR, Schoenrath F, Spahn GH, Seifert B, Stein P, Theusinger OM, et al. Effect of ultra-short-term treatment of patients with iron deficiency or anaemia undergoing cardiac surgery: a prospective randomised trial. Lancet. (2019) 393(10187):2201–12. doi: 10.1016/S0140-6736(18)32555-8
36. Kim K, Seo H, Chin JH, Son HJ, Hwang JH, Kim YK. Preoperative hypoalbuminemia and anemia as predictors of transfusion in radical nephrectomy for renal cell carcinoma: a retrospective study. BMC Anesthesiol. (2015) 15:103. doi: 10.1186/s12871-015-0089-6
Keywords: intraoperative transfusion, machine learning algorithm, prediction, non-anemic, valve replacement
Citation: Zhou R, Li Z, Liu J, Qian D, Meng X, Guan L, Sun X, Li H and Yu M (2024) Prediction of intraoperative red blood cell transfusion in valve replacement surgery: machine learning algorithm development based on non-anemic cohort. Front. Cardiovasc. Med. 11:1344170. doi: 10.3389/fcvm.2024.1344170
Received: 25 November 2023; Accepted: 20 February 2024;
Published: 29 February 2024.
Edited by:
Yongfeng Shao, Nanjing Medical University, ChinaReviewed by:
Antonino S. Rubino, University of Campania Luigi Vanvitelli, ItalyWei He, Southeast University, China
© 2024 Zhou, Li, Liu, Qian, Meng, Guan, Sun, Li and Yu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Min Yu minyudr@163.com Haiqing Li drlihaiqing@163.com
†These authors have contributed equally to this work