- 1Department of Ophthalmology, Pusan National University Hospital, Busan, Republic of Korea
- 2Biomedical Research Institute, Pusan National University Hospital, Busan, Republic of Korea
- 3Medical Research Institute, Pusan National University, Busan, Republic of Korea
- 4Department of Ophthalmology, School of Medicine, Pusan National University, Busan, Republic of Korea
Introduction: The primary treatment for active thyroid eye disease (TED) is immunosuppressive therapy with intravenous steroids. In this study, we attempted to predict responsiveness to steroid treatment in TED patients using eXtreme Gradient Boosting (XGBoost). Factors associated with steroid responsiveness were also statistically evaluated.
Methods: Clinical characteristics and laboratory results of 89 patients with TED who received steroid treatment were retrospectively reviewed. XGBoost was used to explore responsiveness to steroid treatment, and the diagnostic performance was evaluated. Factors contributing to the model output were investigated using the SHapley Additive exPlanation (SHAP), and the treatment response was investigated statistically using SPSS software.
Results: The eXtra Gradient Boost model showed high performance, with an excellent accuracy of 0.861. Thyroid-stimulating hormone, thyroid-stimulating immunoglobulin (TSI), and low-density lipoprotein (LDL) cholesterol had the highest impact on the model. Multivariate logistic regression analysis showed that less extraocular muscle limitation and high TSI levels were associated with a high risk of poor intravenous methylprednisolone treatment response. As a result of analysis through SHAP, TSH, TSI, and LDL had the highest impact on the XGBoost model
Conclusion: TSI, extraocular muscle limitation, and LDL cholesterol levels may be useful in predicting steroid treatment response in patients with TED. In terms of machine learning, XGBoost showed relatively robust and reliable results for small datasets. The machine-learning model can assist in decision-making for further treatment of patients with TED.
1 Introduction
Thyroid eye disease (TED) is a condition in which an immunological response to soft tissue in orbit creates severe patient discomfort, necessitating anti-inflammatory treatment if necessary. The current understanding of the underlying mechanism suggests that the interactions between the autoimmune response and thyroid-stimulating hormone (TSH) receptor-expressing orbital fibroblasts in orbit are the primary pathogenesis of TED. Recent research indicates that insulin-like growth factor-1 receptors play a synergistic role in TSH receptor (TSHR)-initiated signaling, in addition to TSHR activation (1, 2).
Immunosuppressive treatment with intravenous steroids has been the major treatment for active TED, and pulsed intravenous methylprednisolone has been shown to be effective. Radiation and immune modulators, such as rituximab and teprotumumab, can be employed to manage TED (3). However, as recommended by a recent consensus statement issued by the European Group on Graves Orbitopathy (EUGOGO), intravenous methylprednisolone (IVMP) remains the first-line treatment for moderate-to-severe and active TED (4).
According to previous research, up to 20%–25% of clinically active, moderate-to-severe TED patients may not respond to steroids and/or relapse after treatment discontinuation (5). The clinical activity score (CAS), TSHR antibody (TRAb) level, triglyceride level, and disease duration are reportedly associated with responsiveness to IVMP treatment (6, 7). However, the effect of these factors on IVMP remains controversial. Furthermore, systemic steroid medication may be advantageous in patients with TED; however, it should be used with caution because of possible systemic side effects (4). Moreover, the mechanism of action of high-dose steroids is unknown and must be weighed against its harmful effects (8). Therefore, predicting the steroid response in patients should be strongly considered.
In this study, we used eXtreme Gradient Boosting (XGBoost), a scalable machine-learning system for tree boosting, to predict steroid treatment responses in patients with TED. The XGBoost system is an open-source package. The usefulness and superiority of the system have been widely recognized in several machine-learning challenges (9). It uses a gradient-boosting framework to create machine learning with excellent efficiency, flexibility, and probability (9). To improve our model, we applied the synthetic minority oversampling technique (SMOTE) and feature selection (FS) to detect essential features for classification. Finally, the FS-based XGBoost model proposed the feature importance associated with responsiveness in detail with SHapley Additive exPlanations (SHAP).
To the best of our knowledge, this is the first time that the gradient boosting machine, the XGBoost system, has been used in this field. According to this study, despite the small amount of data utilized for training, the FS-based XGBoost model showed stable and high performance. Factors related to steroid responsiveness were investigated using univariate and multivariate statistical methods.
2 Methods
2.1 Patient recruitment
This retrospective study was conducted in accordance with the principles of the Declaration of Helsinki. This study was approved by the Institutional Review Board (IRB) of Pusan National University Hospital (IRB No. 2112-006-109), South Korea. Owing to the retrospective nature of the study, the IRB waived the need for patient consent.
In this study, patients with TED were enrolled at the Oculoplasty Clinic of Pusan National University Hospital between March 1, 2016 and December 31, 2020. When both eyes of a participant were eligible, the more severe eye was chosen for inclusion and data analysis.
Among 287 patients diagnosed with TED, 89 were enrolled and divided into two groups. One group showed responsiveness to steroid treatment (responsive) and the other showed no response to steroid treatment (unresponsive). Patients who were enrolled in this study satisfied all the following conditions: (1) a TED diagnosis based on the EUGOGO consensus and (2) patients who were in the active phase and moderate to severe category based on the CAS. The exclusion criteria were as follows: (1) treatment with other immunosuppressive therapies, decompression surgery, or radiotherapy within the previous 3 months or during IVMP therapy; (2) not having completed the full course of the IVMP treatment regimen; (3) incomplete ophthalmic assessment data and/or essential laboratory tests including free T4, T3, TSH, TRAb, and thyroid-stimulating immunoglobulin (TSI); and (4) patients with a previous medical history of glaucoma, diabetic retinopathy, maculopathy, or strabismus.
The treatment protocol was as follows: IVMP was administered by an endocrinologist for 12 weeks. MP was injected weekly on the same day at a 0.5-g dose and a 0.25-g dose for the remaining 6 weeks. We monitored liver function and blood glucose levels regularly. Anti-thyroid drugs or thyroxine were used to restore and maintain euthyroidism. Individuals who smoked were advised to abstain from smoking.
2.2 Data collection and outcome evaluation
One ophthalmologist in the oculoplastic division described all ophthalmic examinations in the medical records. Two investigators in the oculoplastic division analyzed the electronic medical records of these patients to determine their eligibility for the study. Demographic and biochemical data and additional pertinent clinical data were evaluated. TRAb levels were measured using the third-generation thyrotropin-binding inhibitor immunoglobulin assay, which inhibits the binding of labeled thyroid-stimulating autoantibody (TSAb) (monoclonal Ab clone #M22) to the TSH receptor. The TSI was measured using a thyroid-stimulating immunoglobulin (TSI) bioassay, which measures cyclic adenosine monophosphate production after TSAb binds to the TSH receptor. Smoking status was categorized as never smoked, ex-smoker, or current smoker. In the statistical analysis, we classified ex-smokers and never-smokers as nonsmokers.
The presence of at least two of the following five ophthalmic parameters was defined as the response after full doses of IVMP treatment: 1) a reduction of at least 2 mm in proptosis; 2) a reduction of CAS by at least 2 points or <3/7; 3) an increase in visual acuity of at least one Snellen line; 4) improvement in diplopia (decrease in Gorman degree); and 5) no recurrence or additional radiation therapy at least 6 months after IVMP treatment. Based on these assessments, we divided the patients into “responsive” and “unresponsive groups.” Patients who received additional radiation therapy after IVMP treatment or decompression surgery during IVMP treatment were classified as “unresponsive.” Detailed patient recruitment flow charts are illustrated in Figure 1.
Figure 1 Flow chart of the data extraction. P.O., per oral; TED, thyroid eye disease; IV, intravenous; 3G, 3 grams.
2.3 Data preparation: SMOTE and FS
We applied the synthetic minority oversampling technique and FS method to overcome the shortage of datasets and minimize computational complexity. We constructed a supervised machine-learning classification model using XGBoost in the following manner: 1) columns in the dataset were selected using the statistical hypothesis test (categorical variable: chi-squared or the Fisher exact test; continuous data: t-test or the Mann–Whitney U test). This eliminates redundant and irrelevant features to simplify the training model. 2) Next, FS was performed based on feature importance using a combination of random forest and XGBoost. Random forest and XGBoost FS methods have been applied in various studies and have shown good performance (10–12). We collected 17 top-ranked features in descending order of importance to describe responsiveness. Data preparation is performed by reducing data dimensionality and creating a model with only important features. 3) Finally, we applied SMOTE to increase the number of under-presented cases, as reported in another study (13). SMOTE takes each data point from a minority class and generates new members along a line that connects them to their k-nearest neighbors (13, 14). We did not employ data normalization and one-hot encoding in our model to prevent performance degradation (9).
In our experiments, the scikit-learn machine-learning framework was used to implement the XGBoost. During the 10-fold cross-validation, we optimized the parameters of the model. The hyperparameter values selected in our model are as follows: booster: gbtree; n_estimators: 300; max depth: 5; loss function binary: logistic; subsample = 1; lambda = 0.7; alpha = 0.15; and learning rate = 0.03. A flowchart of the experimental design is shown in Figure 2.
Figure 2 Flow chart of the hybrid feature selection to construct the final XGBoost predictive model.
2.4 XGBoost classifier
We underwent training for the state-of-the-art scalable boosting machine-learning method called XGBoost (9). Python language version 3.8 was used, and the hardware environment was an AMD Ryzen 7 8-core processor with 32 GB RAM and GeForce 2070Ti video cards (NVIDIA, Santa Clara, CA, USA). XGBoost is a tree-boosting technique that is highly scalable and accurate. Traditional gradient boosting incorporates the following components: (1) a regularization term into the objective function, (2) an approximate split using a weighted quantile sketch, (3) a sparsity-aware split function for parallel tree learning, and (4) out-of-core computation and cache-aware learning.
All participants were randomly assigned to either the training set (80%, 71 of 89 patients) or the test set (20%, 18 of 89 patients). The training dataset was not large, but XGBoost had significant adaptations to reduce overfitting and was applied to a wide range of issues. Other regularization approaches, such as shrinkage and column subsampling, were also included. Shrinkage reduces the impact of individual trees while allowing subsequent trees to improve the model. Column subsampling was initially widely utilized in random forests, although it has never been employed for tree boosting (15). According to feedback, column sub-sampling was applied, preventing overfitting compared to row sub-sampling. Furthermore, it also accelerates the parallel technique detailed in the subsequent sections of this paper.
2.5 Model evaluation and interpretation
The CV mean accuracy was used to evaluate the prediction performance of our model. To interpret the results and importance of each feature, we applied SHAP. This allowed us to provide a powerful and insightful measure of the relevance of a feature in a model. XGBoost model was compared with other machine learning algorithms and linear regression model which is traditional statistical model. We compared the XGboost model before and after the SMOTE statistically with CV mean accuracies.
2.6 Statistical analysis
The Kolmogorov–Smirnov test was used to determine the normality of the data distribution. Continuous data were compared using Student’s t-test or Mann–Whitney U test. Categorical variables were compared using the chi-square test or Fisher’s exact test. Univariate logistic regression analysis was performed to determine the significance of each variable. Significant variables identified in the univariate analysis were then subjected to multivariate logistic regression analysis to identify the independent factors associated with responsiveness to IVMP treatment. Statistical Package for the Social Sciences (SPSS) software (version 22.0; SPSS, Chicago, IL, USA) was used for all statistical analyses. Statistical significance was set at P<0.05.
3 Results
3.1 Patients characteristics
All variables, including the demographic, clinical, and biochemical characteristics, are presented in Table 1. Among the 89 active, moderate-to-severe TED patients who were administered 4.5 g of methylprednisolone intravenously, 71 (79.8%) were included in the responsive group, and 18 (20.1%) were included in the responsive and unresponsive groups. When comparing never smokers to ex-smokers in the responsive group, the response rate was significantly higher in former smokers than in never smokers (P = 0.029, 100% vs. 71.2%). When we classified ex-smokers as never smokers, there was no difference in responsiveness between never-smokers and current smokers (P = 0.377) (data not shown). CAS was higher in the unresponsive group, but the difference was not statistically significant (P = 0.065), and extraocular movement limitation was more severe in the responsive group (P = 0.054). Overall, there were no significant differences between the two groups, except for the smoking status.
3.2 Results of the univariate and multivariate analysis
The summary statistics of the univariate and multivariate analyses between the responsive and unresponsive groups are shown in Table 2. As shown in Table 2, univariate logistic regression analysis showed that only extraocular movement limitation was significantly associated with responsiveness after IVMP treatment (odds ratio [OR] 0.632, 95% confidence interval [CI] 0.40 to 0.97; P = 0.044). After adjusting for confounding factors, extraocular muscle limitation and TSI were associated with the IVMP response. Less extraocular muscle limitation (OR 0.463, 95% CI 0.25 to 0.85, P = 0.014) and high TSI level were associated with a high risk of poor IVMP treatment response (OR 1.005, 95% CI 1.000 to 1.009, P = 0.038).
Table 2 Factors associated with the response to the steroid treatment investigated by univariate and multivariate binary logistic regression analysis (for an unresponsive group).
3.3 FS
Based on the results shown in Table 1, the least associated variables of age, height, type of TED, symmetry, vision, total cholesterol, and T3 were excluded from the original 32 variables in the first FS method (P = 0.910, P = 0.933, P = 0.791, P = 0.729, P = 0.959, P = 0.938, and P = 0.806, respectively). Second, we selected the final 17 top-ranked features (TSI, low-density lipoprotein [LDL] cholesterol, body mass index [BMI], TSH, Gorman score, Tg, TSHR ab, Visual field (VF) index, weight, high-density lipoprotein cholesterol, difference in exophthalmos, free T4, exophthalmos, duration of Graves’ disease, and CAS) based on the random forest and XGBoost feature importance methods.
3.4 Results of XGBoost classifier
We analyzed the features to predict responsiveness to IVMP treatment using both the random forest classifier and XGBoost. Before we applied SMOTE, the cross-validation accuracies of both models were 0.84 and 080. Using SMOTE, 142 datasets were created, and the final performance of XGBoost is presented in Table 3. When evaluated using the confusion matrix, the relatively high false-positive of 14 cases in the first model and 82.9% positive predictive value (PPV) before applying SMOTE were noticeably improved after using SMOTE. The false-positive of the model after applying SMOTE were 8 and PPV was 88.8%.
3.5 Important features predicting responsiveness after IVMP treatment investigated by SHAP
The SHAP summary results are shown in Figure 3, which rank features according to their importance in predicting responsiveness to IVMP treatment. TSH, TSI, and LDL cholesterol levels had the greatest effects on the model. Among them, TSI and LDL cholesterol showed a specific pattern in which a higher TSI level was associated with a significant impact on the model and a lower LDL cholesterol level was associated with a significant impact on the model. Figure 4 shows the effect of TSI and LDL cholesterol on responsiveness. TSI levels lower than 400 decreased the predicted probability, while a high level of TSI increased the predicted treatment responsiveness probability. An LDL cholesterol level of 80–100 increases the predicted probability, and an LDL cholesterol level over 110 decreases the predicted probability. TSH showed no definite pattern on SHAP dependence analysis.
Figure 3 SHAP summary plot of feature importance to predict steroid responsiveness. SHAP, SHapley Additive exPlanations; TSH thyroid-stimulating hormone; TSI thyroid-stimulating immunoglobulin; LDL-C low-density lipoprotein cholesterol; BMI body mass index; TG triglyceride; TSHRab thyroid-stimulating hormone receptor antibody; Exo diff differences of exophthalmos; VFI_OD visual field index_oculus dexter; ExoOS exophthalmos in oculus sinister; VFI_OS visual field index_oculus sinister; HDL-C high-density lipoprotein cholesterol; Gddur disease duration of graves disease; CAS clinical activity score; ExoOD exophthalmos in oculus dexter.
Figure 4 SHAP dependence plot for essential features. SHAP dependency analysis of TSI on the left and SHAP dependency analysis of LDL-cholesterol is described on the right. SHAP, SHapley Additive explanation; TSI thyroid-stimulating immunoglobulin; LDL-C low-density lipoprotein cholesterol.
4 Discussion
In this study, we discovered that severe extraocular movement limitation indicated a better response to IVMP treatment. Concurrently, higher TSI levels have been linked to a poor response to IVMP treatment in patients with active moderate-to-severe TED. Ahn and Lee (16) reported that Extra ocular muscle(EOM) enlargement on CT is a predictor of a favorable steroid treatment response, and Xu et al. found that the thickness of the EOM on magnetic resonance imaging was considerably higher in the responsive group than in the unresponsive group. Moreover, Naik et al. (17) classified soft tissue involvement in TED as either fat growth or predominant muscle enlargement, with the latter being more susceptible to steroid treatment. Inflammation and edematous changes in the EOM cause EOM enlargement and limit motility, which is consistent with the results of several other studies. In the multivariate analysis excluding interference from other factors, we found an OR of 0.463 (p = 0.014) for patients with severe extraocular movement limitation in response to treatment. However, this finding does not indicate better extraocular movement after treatment, and the overall inflammatory symptoms resolved. Additional research is required to determine extraocular movements based on treatment.
Although there is limited research regarding TSI, in 1995, Mori et al. reported that TSAb could be a valuable marker for predicting the efficacy of methylprednisolone pulse therapy in TED patients. In nine TED patients treated with intravenous methylprednisolone in this study, they found that high levels of TSAb are expected to be a factor for good response to therapy. Conversely, our study showed that a higher TSI level was significantly associated with poor responsiveness to treatment (OR, 1.005; p = 0.038) in the multivariate analysis. Furthermore, the TSI level analyzed with XGBoost was one of the most important features for predicting treatment responsiveness.
In terms of model interpretation, we applied SHAP to obtain a compelling and insightful measure of the importance of a feature in a model (18). SHAP is a method for estimating the contribution of each factor or characteristic, and it breaks down the prediction to show the impact of each feature (19). According to our SHAP results, we found specific patterns of TSI and LDL cholesterol levels. High TSI values over 400 and lower LDL cholesterol values under approximately 120 mg/dL enhanced the model prediction and significantly impacted the model. Both of these were the top two important features analyzed with permutation feature importance in the FS process.
The TSI has been reported as an excellent marker in patients with TED because of its sensitivity and specificity (20). TED was observed in TSI-positive but TRab-negative participants in one controlled trial, and TSI had a striking positive connection with the clinical activity and severity of TED. They also found that TSI causes TED pathogenesis even after anti-thyroid medication, and that TSI positivity was low in Graves’ disease(GD) patients without TED. Moreover, several studies have emphasized the need for TSI monitoring throughout the disease follow-up and treatment (21, 22). From this point of view and from our research, TSI is useful for diagnosis and follow-up, and for predicting steroid therapy response prior to treatment. According to our study’s SHAP analysis of TSI, when the TSI level exceeds approximately 400%, the impact on model output increases, and 400% serves as a critical pivot point for predicting responsiveness to IVMP treatment. When combined with statistical analyses, it is reasonable to assume that a TSI of 400% or above does not respond well to treatment. However, it is noteworthy that the AI model does not predict the outcome on the basis of this single attribute.
Several studies have been conducted to determine the relationship between lipid-lowering agents, LDL-cholesterol, and TED. According to Stein et al. (23), patients treated with statins had a 40% decreased risk of developing TED because of the anti-inflammatory action of statins. Sabini et al. (24) found a strong association between the presence of TED and both total and LDL cholesterol in patients with newly diagnosed GD. Additionally, there is a considerable direct link between TED and LDL cholesterol levels. LDL cholesterol levels were shown to be considerably higher in individuals with TED, and cutoff values for LDL cholesterol were established at 118.4 mg/dL, with levels above these values significantly related to an elevated risk of TED. Surprisingly, a value of 115 mg/dL for LDL cholesterol in the artificial intelligence analysis in our study revealed a critical insight. Although LDL did not demonstrate any statistical significance in statistical analyses, including multiple regression, it had the greatest influence on the construction of the AI model; when the value was less than the reference point near 115–120 mg/dL, it had a higher impact on the model’s prediction. Additional inflammation caused by a high LDL cholesterol may be taken to suggest that it adversely affects the prediction of steroid response, but once again, it is unreasonable to anticipate a patient’s treatment response in the clinic based on this factor (25, 26).
Finally, a final XGBoost model with high performace was constructed using 17 factors, and a more trustworthy and useful model was constructed using the SMOTE technique to overcome the high false-positive rate.
5 Conclusion
Consequently, statistically, EOM limitation and TSI level had a significant effect on the prediction of treatment response, and among the factors proven using machine learning, TSH, TSI, and LDL-C were the top three features; however, in the case of TSH, there were difficulties in the interpretation.
Although the number of patients was limited and the study was retrospective, this is the first study to validate the steroid response factor via XGBoost. We believe that this study verifies the applicability of the gradient boosting model to a variety of ophthalmic diseases by demonstrating that noteworthy results were obtained.
Data availability statement
The data analyzed in this study is subject to the following licenses/restrictions: Due to its ethical concerns, supporting data cannot be made openly available. further inquiries can be directed to the corresponding author. Requests to access these datasets should be directed to Jungyul Park, b3BoanlwQG5hdmVyLmNvbQ==.
Ethics statement
The studies involving human participants were reviewed and approved by The Institutional Review Board (IRB) of Pusan National University Hospital (IRB No. 2112-006-109), South Korea. Owing to the retrospective nature of the study, the IRB waived the need for patient consent. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.
Author contributions
Conception and design of the study (JP, JK, HC); conduction of the study (JP, JK, HC); collection and management of data (JP, JK); data analysis (JP, JK); data interpretation (JP, D-MR, JK, HC); and preparation, review, and approval of the manuscript (JP, JK, D-MR, HC).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Hu S, Wang Y, He M, Zhang M, Ding X, Shi B. Factors associated with the efficacy of intravenous methylprednisolone in moderate-to-severe and active thyroid-associated ophthalmopathy: A single-centre retrospective study. Clin endocrinology. (2019) 90(1):175–83. doi: 10.1111/cen.13855
2. Smith TJ. TSH-receptor-expressing fibrocytes and thyroid-associated ophthalmopathy. Nat Rev Endocrinology. (2015) 11(3):171–81. doi: 10.1038/nrendo.2014.226
3. Smith TJ, Kahaly GJ, Ezra DG, Fleming JC, Dailey RA, Tang RA, et al. Teprotumumab for thyroid-associated ophthalmopathy. New Engl J Med (2017) 376(18):1748–61. doi: 10.1056/NEJMoa1614949
4. Bartalena L, Baldeschi L, Bodoridis K, Eckstein A, Kahaly GJ, Marcocci C, et al. The 2016 European thyroid Association/European group on graves' orbitopathy guidelines for the management of graves' orbitopathy. Eur Thyroid J (2016) 5(1):9–26. doi: 10.1159/000443828
5. Zang S, Ponto K, Kahaly G. Intravenous glucocorticoids for graves' orbitopathy: Efficacy and morbidity. J Clin Endocrinol Metab (2011) 96(2):320–32. doi: 10.1210/jc.2010-1962
6. Eckstein AK, Plicht M, Lax H, Neuhäuser M, Mann K, Lederbogen S, et al. Thyrotropin receptor autoantibodies are independent risk factors for graves’ ophthalmopathy and help to predict severity and outcome of the disease. J Clin Endocrinol Metab (2006) 91(9):3464–70. doi: 10.1210/jc.2005-2813
7. Mourits MP, Prummel MF, Wiersinga WM, Koornneef L. Clinical activity score as a guide in the management of patients with graves' ophthalmopathy. Clin endocrinology. (1997) 47(1):9–14. doi: 10.1046/j.1365-2265.1997.2331047.x
8. Taylor PN, Zhang L, Lee RWJ, Muller I, Ezra DG, Dayan CM, et al. New insights into the pathogenesis and nonsurgical management of graves orbitopathy. Nat Rev Endocrinology. (2020) 16(2):104–16. doi: 10.1038/s41574-019-0305-4
9. Chen T, Guestrin C. Xgboost: A scalable tree boosting system. Proc 22nd ACM sigkdd Int Conf knowledge Discovery Data mining. (2016) 2016:785–94. doi: 10.1145/2939672.2939785
10. Al-Barakati HJ, Saigo H, Newman RH. RF-GlutarySite: A random forest based predictor for glutarylation sites. Mol omics. (2019) 15(3):189–204. doi: 10.1039/C9MO00028C
11. White C, Ismail HD, Saigo H. CNN-BLPred: A convolutional neural network based predictor for β-lactamases (BL) and their classes. BMC Bioinf (2017) 18(16):221–32. doi: 10.1186/s12859-017-1972-6
12. Yu J, Shi S, Zhang F, Chen G, Cao M. PredGly: Predicting lysine glycation sites for homo sapiens based on XGboost feature optimization. Bioinformatics (2019) 35(16):2749–56. doi: 10.1093/bioinformatics/bty1043
13. Choi S, Park J, Park S, Byon I, Choi HY. Establishment of a prediction tool for ocular trauma patients with machine learning algorithm. Int J Ophthalmol (2021) 14(12):9. doi: 10.18240/ijo.2021.12.20
14. Parsa AB, Movahedi A, Taghipour H, Derrible S, Mohammadian AK. Toward safer highways, application of XGBoost and SHAP for real-time accident detection and feature analysis. Accident Anal Prev (2020) 136:105405. doi: 10.1016/j.aap.2019.105405
15. Chapelle O, Chang Y. Yahoo! learning to rank challenge overview. Proc Learn to rank challenge (2011) 14:1–24.
16. Ahn HY, Lee JK. Intravenous glucocorticoid treatment for Korean graves' ophthalmopathy patients. J Korean Med Science. (2020) 35(23):e177. doi: 10.3346/jkms.2020.35.e177
17. Naik VM, Naik MN, Goldberg RA, Smith TJ, Douglas RS. Immunopathogenesis of thyroid eye disease: Emerging paradigms. Survey ophthalmology. (2010) 55(3):215–26. doi: 10.1016/j.survophthal.2009.06.009
18. Ribeiro MT, Singh S, Guestrin C. " why should i trust you?" explaining the predictions of any classifier. Proc 22nd ACM SIGKDD Int Conf knowledge Discovery Data Min (2016) 1135–44. doi: 10.1145/2939672.2939778
19. Shapley LS. A Value for n-Person Games. In: Kuhn H., Tucker A., Eds., Contributions to the Theory of Games II, Princeton University Press, Princeton. (1953) p307–17.
20. Lytton SD, Ponto KA, Kanitz M, Matheis N, Kohn LD, Kahaly GJ. A novel thyroid stimulating immunoglobulin bioassay is a functional indicator of activity and severity of graves’ orbitopathy. J Clin Endocrinol Metab (2010) 95(5):2123–31. doi: 10.1210/jc.2009-2470
21. Fassi DE, Banga JP, Gilbert JA, Padoa C, Hegedüs L, Nielen CH. Treatment of graves' disease with rituximab specifically reduces the production of thyroid stimulating autoantibodies. Clin Immunol (2009) 130(3):252–8. doi: 10.1016/j.clim.2008.09.007
22. Salvi M, Vannucchi G, Campi I, Currò N, Dazzi D, Simonetta S, et al. Treatment of graves’ disease and associated ophthalmopathy with the anti-CD20 monoclonal antibody rituximab: An open study. Eur J endocrinology. (2007) 156(1):33–40. doi: 10.1530/eje.1.02325
23. Stein JD, Childers D, Gupta S, Talwar N, Nan B, Lee BJ, et al. Risk factors for developing thyroid-associated ophthalmopathy among individuals with graves disease. JAMA ophthalmology. (2015) 133(3):290–6. doi: 10.1001/jamaophthalmol.2014.5103
24. Sabini E, Mazzi B, Profilo MA, Mautone T, Casini G, Rocchi R, et al. High serum cholesterol is a novel risk factor for graves' orbitopathy: results of a cross-sectional study. Thyroid (2018) 28(3):386–94. doi: 10.1089/thy.2017.0430
25. Busnelli M, Manzini S, Froio A, Vargiolu A, Cerrito MG, Smolenski RT, et al. Diet induced mild hypercholesterolemia in pigs: Local and systemic inflammation, effects on vascular injury–rescue by high-dose statin treatment. PloS One (2013) 8(11):e80588. doi: 10.1371/journal.pone.0080588
Keywords: extreme gradient boost, XGBoost, machine learning, graves orbitopathy, thyroid eye disease, novel risk factors
Citation: Park J, Kim J, Ryu D and Choi H-y (2023) Factors related to steroid treatment responsiveness in thyroid eye disease patients and application of SHAP for feature analysis with XGBoost. Front. Endocrinol. 14:1079628. doi: 10.3389/fendo.2023.1079628
Received: 31 October 2022; Accepted: 12 January 2023;
Published: 31 January 2023.
Edited by:
Giulia Lanzolla, University of Pennsylvania, United StatesReviewed by:
Li Ding, Tianjin Medical University General Hospital, ChinaKeunheung Park, Busan Medical Center, Republic of Korea
Copyright © 2023 Park, Kim, Ryu and Choi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Hee-young Choi, aHljaG9pQHB1c2FuLmFjLmty