- Department of Pediatrics, Tsugaruhoken Medical COOP Kensei Hospital, Hirosaki, Japan
We constructed an optimal machine learning (ML) method for predicting intravenous immunoglobulin (IVIG) resistance in children with Kawasaki disease (KD) using commonly available clinical and laboratory variables. We retrospectively collected 98 clinical records of hospitalized children with KD (2–109 months of age). We found that 20 (20%) children were resistant to initial IVIG therapy. We trained three ML techniques, including logistic regression, linear support vector machine, and eXtreme gradient boosting with 10 variables against IVIG resistance. Moreover, we estimated the predictive performance based on nested 5-fold cross-validation (CV). We also selected variables using the recursive feature elimination method and performed the nested 5-fold CV with selected variables in a similar manner. We compared ML models with the existing system regardless of their predictive performance. Results of the area under the receiver operator characteristic curve were in the range of 0.58–0.60 in the all-variable model and 0.60–0.75 in the select model. The specificities were more than 0.90 and higher than those in existing scoring systems, but the sensitivities were lower. Three ML models based on demographics and routine laboratory variables did not provide reliable performance. This is possibly the first study that has attempted to establish a better predictive model. Additional biomarkers are probably needed to generate an effective prediction model.
Introduction
In developed countries, Kawasaki disease (KD) is the major cause of acquired heart disease in children (1). The main complication of KD is coronary artery abnormality (CAA) due to systemic vasculitis (1). The effectiveness of high-dose intravenous immunoglobulin (IVIG) therapy has been established as an initial KD treatment (2). However, approximately 10–20% children with KD are refractory to this treatment and develop persistent or recurrent fever after initial IVIG therapy (3, 4). IVIG resistance is a risk factor for the occurrence of CAA (5). Moreover, the development of a more effective treatment options has been challenging. The American Heart Association has reported that patients who were predicted to be at a high risk for development of CAA may benefit from primary adjunctive therapy such as IVIG and corticosteroids (2). Therefore, developing a reliable tool for predicting IVIG resistance is important to reduce the occurrence of CAA.
Several scoring systems (6–12) have been proposed. However, the predictive capacity of the existing scoring systems may not be sufficient, and some scoring systems have poor predictive performance for external datasets (13–15). Machine learning (ML) techniques have been applied to clinical diagnosis and prognosis prediction in many fields of medicine (16). To the best of our knowledge, few studies have applied ML methods for predicting resistance to initial IVIG therapy in patients with KD (17). We aimed to construct an optimal ML method for predicting IVIG resistance in children with KD using commonly available clinical and laboratory variables.
Materials and Methods
Patients and Data Collection
We retrospectively collected clinical records of patients with KD who were diagnosed based on the Japanese diagnostic guidelines for KD (18) and hospitalized at Tsugaruhoken Medical COOP Kensei Hospital between January 2010 and October 2019. Patients diagnosed with KD presented with minimum five of the six major symptoms, including fever. Patients with only four or less major symptoms and those with CAA were not included. We excluded children who received initial IVIG treatment ≥10 days after the onset and children administered initial doses of <2 g/kg/day. We defined the first illness day as the first day on which a patient had fever. We defined a responder as a patient whose temperature had decreased to <37.5°C within 36 h after initial IVIG treatment (9, 15).
We collected the following data before the initial IVIG treatment: months of age, gender, illness days with IVIG administration, white blood cell count (WBC), neutrophil percentage, hematocrit (Ht), platelet count (PLT), aspartate aminotransferase (AST), alanine aminotransferase (ALT), total bilirubin (TBil), and sodium (Na), albumin (Alb), and C-reactive protein (CRP) levels. All these variables were available before treatment.
We defined coronary arteries as abnormal when the luminal diameters were more than 3.0 mm in children younger than 5 years or more than 4.0 mm in those 5 years and older, when the internal diameter of a segment was 1.5 times or greater than that of an adjacent segment, or when the luminal contour was evidently irregular (19). We recorded the maximum coronary artery diameter within 1 month after the onset of the disease.
Statistical Analysis
We performed statistical analyses using Python version 3.6 (Python Software Foundation). We applied Mann–Whitney U-tests for continuous variables and Chi-square tests for categorical variables.
We evaluated the predictive performance of the three supervised ML classifiers and existing scoring systems. We trained logistic regression (LR) with L2 regularization, linear support vector machine (SVM), and eXtreme gradient boosting (XGB) models to predict IVIG resistance, using scikit-learn and XGBoost packages. We evaluated the predictive performance based on sensitivity, specificity, and area under the receiver operator characteristic curve (AUC). We produced three ML models with 10 variables that did not contain missing values (months of age, gender, illness days with IVIG administration, WBC, Ht, PLT, AST, Na, Alb, and CRP).
To evaluate the predictive performance of the three ML models and algorithms, we used the nested 5-fold cross-validation (CV) approach (20) with GridSearchCV for hyper-parameter optimization. We applied a nested CV procedure to estimate an unbiased generalization performance of ML algorithms (21). The two CV cycles included an inner loop for tuning hyper-parameters and outer loop for estimating performance in nested CV (Figure 1). First, the original dataset was divided into five data folds with approximately equal numbers of respondent and non-respondent cases. One data fold was reserved for test fold. The remaining four data folds (training folds) were passed to the inner loop. The inner loop performed 5-fold CVs to identify the best hyper-parameter combination. We selected the hyper-parameter combination that maximized each performance metrics over all steps of the inner loop. In LR and linear SVM models, the penalty parameter C was explored in [0.01, 0.1, 1, 10, and 100]. In XGB model, the maximum depth of a tree (max_depth), the minimum sum of instance weight needed in a child (min_child_weight), and gamma were explored in [3, 5], [1, 2, 3], and [0, 3, 10], respectively. For the other hyper-parameters, we used default values of the scikit-learn method. These were tuned by testing all possible hyper-parameter combinations in the inner CV. We then trained our model on training folds using the best hyper-parameter combination; thereafter, we evaluated model performance on the test fold. This process was repeated five times, once for each iteration of the outer loop. Finally, we calculated the average performance over 5-folds. We also repeated nested CVs 10 times in separate splits and derived the average of the results to avoid sampling bias and data overfitting.
Additionally, we selected variables using the recursive feature elimination method. Then, we performed a nested 5-fold CV with selected variables in a similar manner. In all, we have constructed and then evaluated two types of models: all-variable model and select-variable model.
Results
Characteristics of Patients
We collected data from 109 children with KD treated at our hospital. We excluded data from 11 children because 9 children had received initial IVIG at <2 g/kg/day and 4 had received initial IVIG treatment ≥10 days after the onset of the disease. Consequently, we statistically analyzed data from 98 children aged 2–109 months. Table 1 summarizes the demographic and laboratory data of patients. Among them, 20 (20%) children were resistant to the initial IVIG therapy. Only the AST and ALT levels were significantly higher in the IVIG-responsive group than in the IVIG resistant group. The proportion of CAA in the IVIG resistant group was higher than that in the IVIG-responsive group.
Table 1. Comparison of clinical and laboratory characteristics in IVIG-responsive and -resistant patients.
Predictive Performance of the ML Model
As shown in Table 2, the AUCs of the all-variable models were 0.58–0.60 in all models, and those of the select-variable models were 0.60–0.75. The results on specificity and accuracy were 0.94–0.99 and 0.78–0.79 in the all-variable models, and 0.96–1.00 and 0.78–0.80 in the select-variable models. The results of specificity and accuracy were high, but those on sensitivity were all lower.
Discussion
We retrospectively evaluated the performances of three ML models to predict the resistance to initial IVIG therapy in a single-center pediatric population of KD. Our results revealed that the three ML models based on demographics and routine laboratory variables did not perform reliably.
Different clinical scoring systems have been established to predict IVIG resistance, including those by Kobayashi et al. (6), Egami et al. (7), and Sano et al. (8) in Japan. The sensitivities and specificities of those systems were reported to be 0.76–0.78 and 0.76–0.86 in the original studies. However, almost all clinical scores published are limited in their predictive capacity. Similar predictive accuracies were not achieved in other populations (13–15). As shown in Table 2, the existing scoring systems also did not achieve a good prediction against our dataset.
Additional clinical information may be needed to improve the prediction model. Owing to the similarity of each clinical and laboratory characteristics between IVIG-responsive and -resistant patients in the current dataset, neither our model nor the existing model may have performed reliably. There may be a need to construct and evaluate new models that also incorporate clinical major symptoms (10) and/or other laboratory data such as erythrocyte sedimentation rate (10) or N-terminal pro-brain natriuretic peptide (22).
Our prediction models using three ML techniques have equally less reliable performance as the existing scoring systems; particularly, the sensitivity were low in all ML algorithms. Our results serve as a first step to establish a good prediction tool. Feature engineering or ensemble learning, which combines several ML techniques into one predictive model, may help improve performance. Alternatively, ML models have advantages over the existing prediction scoring systems. The predictive performances of scoring systems could differ depending on countries or ethnicities (11, 13, 23). ML is flexible and can be suitable for many tasks. Therefore, the ML approach makes it easy for the model to retrain and update the using the newest data.
To our best knowledge, this is the first study to compare the performances of ML methods for predicting IVIG resistance. There is a study which was designed to develop the prediction model using random forest (17). However, validation procedures were not conducted, though the performance was excellent. Conversely, there are certain limitations. First, the dataset was relatively small. However, we used nested CVs to obtain unbiased estimates of the true error. We also repeated the nested CVs 10 times and averaged the validation error to reduce sampling bias. Nested CV can choose the classification model by obtaining reliable classification performance and avoiding overfitting (24). Second, the present study was conducted based on dataset derived from a single center. Accordingly, our results may not apply to other populations. However, we consider it meaningful to rebuild the model, similarly using the center's original data. Third, this is a retrospective study. We need to perform a combined analysis of three ML models on a prospective basis.
In conclusion, we evaluated the performance of ML models for predicting resistance to IVIG therapy in children with KD. However, our three ML models based only on demographics and routine laboratory variables did not provide reliable performances. Further studies are needed to improve predictive models. Additional biomarkers are likely to be needed to generate an effective prediction model.
Data Availability Statement
The raw data supporting the conclusions of this article will be made available by the authors, Yasutaka Kuniyoshi, upon reasonable request.
Ethics Statement
The studies involving human participants were reviewed and approved by Ethics Committee of Tsugaruhoken Medial COOP Kensei Hospital. Written informed consent from the participants' legal guardian/next of kin was not required to participate in this study in accordance with the national legislation and the institutional requirements.
Author Contributions
YK designed the study, drafted the manuscript, performed the statistical analysis, and interpreted the results. All authors have read and approved the final manuscript and contributed to data collection.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
The authors would like to thank Enago (www.enago.jp) for the English language review.
Abbreviations
Alb, albumin; ALT, alanine aminotransferase; AST, aspartate aminotransferase; AUC, area under the receiver operator characteristic curve; CAA, coronary artery abnormality; CRP, C-reactive protein; CV, cross-validation; Ht, hematocrit; IVIG, intravenous immunoglobulin; KD, Kawasaki disease; LR, logistic regression; ML, machine learning; Na, sodium; PLT, platelet count; SVM, support vector machine; TBil, total bilirubin; WBC, white blood cell count; XGB, eXtreme gradient boosting.
References
1. Gordon JB, Kahn AM, Burns JC. When children with Kawasaki disease grow up: myocardial and vascular complications in adulthood. J Am Coll Cardiol. (2009) 54:1911–20. doi: 10.1016/j.jacc.2009.04.102
2. McCrindle BW, Rowley AH, Newburger JW, Burns JC, Bolger AF, Gewitz M, et al. Diagnosis, treatment, and long-term management of Kawasaki disease: a scientific statement for health professionals from the American Heart Association. Circulation. (2017) 135:e927–99. doi: 10.1161/CIR.0000000000000484
3. Bar-Meir M, Kalisky I, Schwartz A, Somekh E, Tasher D, Israeli Kawasaki Group. Prediction of resistance to intravenous immunoglobulin in children with Kawasaki disease. J Pediatr Infect Dis Soc. (2018) 7:25–9. doi: 10.1093/jpids/piw075
4. Kido S, Ae R, Kosami K, Matsubara Y, Makino N, Sasahara T, et al. Seasonality of i.v. immunoglobulin responsiveness in Kawasaki disease. Pediatr Int. (2019) 61:539–43. doi: 10.1111/ped.13863
5. Kibata T, Suzuki Y, Hasegawa S, Matsushige T, Kusuda T, Hoshide M, et al. Coronary artery lesions and the increasing incidence of Kawasaki disease resistant to initial immunoglobulin. Int J Cardiol. (2016) 214:209–15. doi: 10.1016/j.ijcard.2016.03.017
6. Kobayashi T, Inoue Y, Takeuchi K, Okada Y, Tamura K, Tomomasa T, et al. Prediction of intravenous immunoglobulin unresponsiveness in patients with Kawasaki disease. Circulation. (2006) 113:2606–12. doi: 10.1161/CIRCULATIONAHA.105.592865
7. Egami K, Muta H, Ishii M, Suda K, Sugahara Y, Iemura M, et al. Prediction of resistance to intravenous immunoglobulin treatment in patients with Kawasaki disease. J Pediatr. (2006) 149:237–40. doi: 10.1016/j.jpeds.2006.03.050
8. Sano T, Kurotobi S, Matsuzaki K, Yamamoto T, Maki I, Miki K, et al. Prediction of non-responsiveness to standard high-dose gamma-globulin therapy in patients with acute Kawasaki disease before starting initial treatment. Eur J Pediatr. (2007) 166:131–7. doi: 10.1007/s00431-006-0223-z
9. Tang Y, Yan W, Sun L, Huang J, Qian W, Ding Y, et al. Prediction of intravenous immunoglobulin resistance in Kawasaki disease in an East China population. Clin Rheumatol. (2016) 35:2771–6. doi: 10.1007/s10067-016-3370-2
10. Li X, Chen Y, Tang Y, Ding Y, Xu Q, Sun L, et al. Predictors of intravenous immunoglobulin-resistant Kawasaki disease in children: a meta-analysis of 4442 cases. Eur J Pediatr. (2018) 177:1279–92. doi: 10.1007/s00431-018-3182-2
11. Yang S, Song R, Zhang J, Li X, Li C. Predictive tool for intravenous immunoglobulin resistance of Kawasaki disease in Beijing. Arch Dis Child. (2019) 104:262–7. doi: 10.1136/archdischild-2017-314512
12. Tan XH, Zhang XW, Wang XY, He XQ, Fan C, Lyu TW, et al. A new model for predicting intravenous immunoglobin-resistant Kawasaki disease in Chongqing: a retrospective study on 5277 patients. Sci Rep. (2019) 9:1722. doi: 10.1038/s41598-019-39330-y
13. Sleeper LA, Minich LL, McCrindle BM, Li JS, Mason W, Colan SD, et al. Evaluation of Kawasaki disease risk-scoring systems for intravenous immunoglobulin resistance. J Pediatr. (2011) 158:831–5.e3. doi: 10.1016/j.jpeds.2010.10.031
14. Song R, Yao W, Li X. Efficacy of four scoring systems in predicting intravenous immunoglobulin resistance in children with Kawasaki disease in a children's hospital in Beijing, North China. J Pediatr. (2017) 184:120–4. doi: 10.1016/j.jpeds.2016.12.018
15. Qian W, Tang Y, Yan W, Sun L, Lv H. A comparison of efficacy of six prediction models for intravenous immunoglobulin resistance in Kawasaki disease. Ital J Pediatr. (2018) 44:33. doi: 10.1186/s13052-018-0475-z
16. Rajkomar A, Dean J, Kohane I. Machine learning in medicine. N Engl J Med. (2019) 380:1347–58. doi: 10.1056/NEJMra1814259
17. Takeuchi M, Inuzuka R, Hayashi T, Shindo T, Hirata Y, Shimizu N, et al. Novel risk assessment tool for immunoglobulin resistance in Kawasaki disease: application using a random forest classifier. Pediatr Infect Dis J. (2017) 36:821–6. doi: 10.1097/INF.0000000000001621
18. Ayusawa M, Sonobe T, Uemura S, Ogawa S, Nakamura Y, Kiyosawa N, et al. Revision of diagnostic guidelines for Kawasaki disease (the 5th revised edition). Pediatr Int. (2005) 47:232–4. doi: 10.1111/j.1442-200x.2005.02033.x
19. Kobayashi T, Saji T, Otani T, Takeuchi K, Nakamura T, Arakawa H, et al. Efficacy of immunoglobulin plus prednisolone for prevention of coronary artery abnormalities in severe Kawasaki disease (RAISE study): a randomised, open-label, blinded-endpoints trial. Lancet. (2012) 379:1613–20. doi: 10.1016/S0140-6736(11)61930-2
20. Masino AJ, Harris MC, Forsyth D, Ostapenko S, Srinivasan L, Bonafide CP, et al. Machine learning models for early sepsis recognition in the neonatal intensive care unit using readily available electronic health record data. PLoS ONE. (2019) 14:e0212665. doi: 10.1371/journal.pone.0212665
21. Varma S, Simon R. Bias in error estimation when using cross-validation for model selection. BMC Bioinformatics. (2006) 7:91. doi: 10.1186/1471-2105-7-91
22. Shao S, Luo C, Zhou K, Hua Y, Wu M, Liu L, et al. The role of age-specific N-terminal pro-brain natriuretic peptide cutoff values in predicting intravenous immunoglobulin resistance in Kawasaki disease: a prospective cohort study. Pediatr Rheumatol. (2019) 17:65. doi: 10.1186/s12969-019-0368-8
23. Sánchez-Manubens J, Antón J, Bou R, Iglesias E, Calzada-Hernandez J, Borlan S, et al. Role of the Egami score to predict immunoglobulin resistance in Kawasaki disease among a western Mediterranean population. Rheumatol Int. (2016) 36:905–10. doi: 10.1007/s00296-016-3499-y
Keywords: area under the curve, extreme gradient boosting, support vector machine, logistic regression, nested cross-validation, predictive model
Citation: Kuniyoshi Y, Tokutake H, Takahashi N, Kamura A, Yasuda S and Tashiro M (2020) Comparison of Machine Learning Models for Prediction of Initial Intravenous Immunoglobulin Resistance in Children With Kawasaki Disease. Front. Pediatr. 8:570834. doi: 10.3389/fped.2020.570834
Received: 09 June 2020; Accepted: 09 November 2020;
Published: 03 December 2020.
Edited by:
Caroline Galeotti, Bicêtre Hospital, FranceReviewed by:
Ozgur Kasapcopur, Istanbul University-Cerrahpasa, TurkeyMarie-Louise Frémond, Institut Imagine, France
Copyright © 2020 Kuniyoshi, Tokutake, Takahashi, Kamura, Yasuda and Tashiro. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yasutaka Kuniyoshi, eWFzdXRha2FrdW5peW9zaGkmI3gwMDA0MDt5YWhvby5jby5qcA==