Ultrasound-based radiomics XGBoost model to assess the risk of central cervical lymph node metastasis in patients with papillary thyroid carcinoma: Individual application of SHAP

Shi, Yan; Zou, Ying; Liu, Jihua; Wang, Yuanyuan; Chen, Yingbin; Sun, Fang; Yang, Zhi; Cui, Guanghe; Zhu, Xijun; Cui, Xu; Liu, Feifei

doi:10.3389/fonc.2022.897596

ORIGINAL RESEARCH article

Front. Oncol., 26 August 2022

Sec. Head and Neck Cancer

Volume 12 - 2022 | https://doi.org/10.3389/fonc.2022.897596

This article is part of the Research TopicInsights in Head and Neck Cancer: 2021View all 6 articles

Ultrasound-based radiomics XGBoost model to assess the risk of central cervical lymph node metastasis in patients with papillary thyroid carcinoma: Individual application of SHAP

Yan Shi^1†

Ying Zou^2,3†

Jihua Liu^2,3†

Zhi Yang¹

Xu Cui¹

Feifei Liu^1,5*

¹Binzhou Medical University Hospital, Binzhou, China
²First Teaching Hospital of Tianjin University of Traditional Chinese Medicine, Tianjin, China
³National Clinical Research Center for Chinese Medicine Acupuncture and Moxibustion, Tianjin, China
⁴Nanjing No. 1 Hospital, Nanjing, China
⁵Peking University People’s Hospital, Beijing, China

Objectives: A radiomics-based explainable eXtreme Gradient Boosting (XGBoost) model was developed to predict central cervical lymph node metastasis (CCLNM) in patients with papillary thyroid carcinoma (PTC), including positive and negative effects.

Methods: A total of 587 PTC patients admitted at Binzhou Medical University Hospital from 2017 to 2021 were analyzed retrospectively. The patients were randomized into the training and test cohorts with an 8:2 ratio. Radiomics features were extracted from ultrasound images of the primary PTC lesions. The minimum redundancy maximum relevance algorithm and the least absolute shrinkage and selection operator regression were used to select CCLNM positively-related features and radiomics scores were constructed. Clinical features, ultrasound features, and radiomics score were screened out by the Boruta algorithm, and the XGBoost model was constructed from these characteristics. SHapley Additive exPlanations (SHAP) was used for individualized and visualized interpretation. SHAP addressed the cognitive opacity of machine learning models.

Results: Eleven radiomics features were used to calculate the radiomics score. Five critical elements were used to build the XGBoost model: capsular invasion, radiomics score, diameter, age, and calcification. The area under the curve was 91.53% and 90.88% in the training and test cohorts, respectively. SHAP plots showed the influence of each parameter on the XGBoost model, including positive (i.e., capsular invasion, radiomics score, diameter, and calcification) and negative (i.e., age) impacts. The XGBoost model outperformed the radiologist, increasing the AUC by 44%.

Conclusions: The radiomics-based XGBoost model predicted CCLNM in PTC patients. Visual interpretation using SHAP made the model an effective tool for preoperative guidance of clinical procedures, including positive and negative impacts.

Introduction

Central cervical lymph node metastasis (CCLNM) is a critical factor affecting prognosis and recurrence in papillary thyroid carcinoma (PTC) patients (1). Therefore, preoperative prediction of CCLNM in an accurate and non-invasive manner is important. Ultrasound is the preferred method for evaluating PTC according to American Thyroid Association (ATA) guidelines (2). However, ultrasound shows limitations for assessing central cervical lymph nodes because of the interference from gas in the esophagus and trachea. Ultrasound only detects 20-31% of CCLNM preoperatively and only alters the surgical procedure of 20% of patients (3).

Radiomics mine quantitative image features from medical imaging in a high-throughput manner to improve predictive, diagnostic, and prognostic accuracy (4). Radiomics has shown clinical importance in breast (5), thyroid (6), bladder (7), and colorectal cancers (8). Ultrasound-based radiomics can assess lymph node metastasis in PTC patients to some extent (9, 10). Previous studies were primarily based on radiomics features, and logistic regression analysis was used to construct a nomogram for clinical prediction. The logistic model has good interpretability, and its model coefficient represents the importance of features to the prediction results. However, some variables with a causal relationship with the output variables may not be statistically significant (11). If variables are excluded only from statistical assumptions, available information will be reduced and features of improving prediction ability may be missed.

Machine learning is widely used in the medical field and has a high predictive accuracy (12). However, this model is a complex nonlinear relationship and the limitation of its application in clinical practice is caused by the inexplicability of the model. Several classification algorithms were used to compare diagnostic performance, such as eXtreme Gradient Boosting (XGBoost), deep learning, and transfer learning (6, 13, 14). However, these algorithms all have the “epistemic opacity” problem. The SHapley Additive exPlanation (SHAP) concept was introduced to solve the inexplicability bug (15). SHAP was successfully used to assess mortality in patients with gastrointestinal bleeding (16), prognosis of COVID-19 (17), and mortality in critically ill influenza patients (18).

Here, we constructed a radiomics-based machine learning model based on the ultrasound features of primary PTC lesions. We examined whether SHAP could perform interpretation of CCLNM. The purposes were as follows: to extract the critical features for predicting CCLNM; to establish radiomics-based machine learning model based on key features for CCLNM prediction; and use SHAP to complete the individualized visual interpretation.

Materials and methods

Patients

The pathological records of 704 PTC patients at Binzhou Medical University Hospital, Shandong, China (2017–2021) were retrospectively analyzed. Exclusion criteria were neck surgery or radiation therapy; history of other malignancy; measuring lines on the ultrasound images; nodules too large to obtain images covering the complete outline of the nodules, even after adjusting the scanning section position; and lacking complete clinical information. All patients were older than 18 years old with complete ultrasound image data. On the other hand, skip metastases are defined as lateral lymph node metastasis without the involvement of CCLNM in PTC. For such patients, although there was no CCLNM, the presence of metastatic lymph nodes in the lateral cervical region may also affect the model we constructed, so we excluded the patients with skip metastasis. Finally, the study included 587 PTC patients. Scikit-learning frame (Python programming language, version 3.7.9) was used to divided the patients into the training and test cohorts at a ratio of 8:2 randomly (Figure 1). CCLNM was diagnosed by pathological evaluation. The principles of operation for PTC patients are shown in Appendix 1. This study was approved by the institutional ethics committee of Binzhou Medical University Hospital (No. LW-024). Given the retrospective nature of the study, the requirement for written informed consent was waived.

FIGURE 1

Figure 1 Flowchart of patient selection and group allocation for the study.

Ultrasound image acquisition and analysis

Data were scanned with three different Doppler ultrasonic diagnostic apparatuses (Appendix 2). The diagnostic criteria of PTC on ultrasound were based on ATA guidelines. Ultrasound image analysis methods are shown in Appendix 3 and Appendix Figure 1. Ultrasound parameters included diameter, location (right lobe, left lobe, or isthmus), composition (mixed or solid), echogenicity (hyper/isoechoic, hypoechoic, or very hypoechoic), shape (wider-than-tall or taller-than-wide), margin (smooth, ill-defined, or lobulated/irregular), calcification (none, macrocalcification, rim calcification, or microcalcification), vascularization, and capsular invasion.

Radiomics workflow and score

Original images were exported from the ultrasound imaging workstation in digital imaging and communications in medicine (DICOM) format and imported into ITK-SNAP (version 3.8.0). The polygon tool was used to sketch along the nodule edge to generate regions of interest. The original and segmented images were saved in NRRD format. Histogram equalization was used to preprocess the segmented images (Appendix 4 and Appendix Figure 2). Shape, first-order, texture, wavelet, square root, logarithm, gradient, square, and exponential features were automatically extracted using open-source software (Pyradiomics; http://pyradiomics.readthedocs.io/en/latest/index.html). To ensure the reliability and accuracy of the results, zero-mean normalization (z-score) was performed for data normalization to eliminate index dimension differences of data (19). Features with missing values were deleted; the moderated t-test method was used for difference analysis of the remaining elements. Interclass correlation coefficient (ICC) was used to assess the interobserver agreement of the feature extraction; ICC greater than 0.75 was excellent. A minimum-redundancy maximum relevance (mRMR) was used to remove redundant features. The least absolute shrinkage and selection operator (LASSO) logistic regression method using 10-fold cross-validation was applied to select the most useful predictive CCLNM status-related features from the training cohort. A radiomics score was generated per patient using a linear combination of the chosen features weighted by the LASSO algorithm (Figure 2).

FIGURE 2

Figure 2 Flowchart of the study.

Boruta algorithm: Key features selection

Previous screening methods based on the feature importance of decision trees tended to overestimate the importance of high frequency or high cardinality variables. Therefore, we used the Boruta algorithm, which is suitable for random forest and XGBoost classifiers, to filter out all feature sets used to predict CCLNM status (20). A Boruta algorithm incorporating the independent ultrasound variables, clinical factors, and radiomics score selected the final predictors for CCLNM.

Establishment and validation of the XGBoost model

Imbalanced data problems significantly degrade the classification performance in machine learning (21). Therefore, we applied the synthetic minority oversampling technique (SMOTE), an oversampling method that randomly generates new instances of minority classes to balance the number of categories (22), during the training process. We used the SMOTE method to generate a perfectly balanced dataset with the same number of instances based on the primary group.

The XGBoost model was constructed based on the final key features screened by the Boruta algorithm. The XGBoost algorithm is described in Appendix 5. After the XGBoost model training, a 10-fold cross-validation grid-search method was used to fine-tune the XGBoost algorithm. The balanced accuracy (BA), F-score, Matthew’s correlation coefficient (MCC), precision, recall, and area under the receiver operating characteristic (ROC) curve (AUC) were applied to assess the XGBoost model. The model accuracy was evaluated and is presented as root mean square error and coefficient of determination (R²). Unsupervised cluster analysis (K-means algorithm) was performed for risk stratification. Decision curve analysis was conducted to estimate net benefits of the XGBoost model at different threshold probabilities. We compared the CCLNM status predicted by the XGBoost model with the status assessed by the radiologist.

SHAP

SHAP provides a powerful method to measure the importance of features (23, 24) and is introduced to solve the inexplicability bug of machine-learning models. SHAP calculates each variable’s contribution value to the XGBoost model. The SHAP value corresponds to the measure of additive feature attributions. Therefore, the XGBoost model can be visually interpreted globally and locally using SHAP, thus solving the artificial intelligence “black-box” problem.

Principal components analysis (PCA)

To determine the reliability and reproducibility of the XGBoost model, we assessed potential sources of error during data collection. We assessed the batch effect of ultrasound scanner types on XGBoost using PCA.

Performance comparison with traditional machine learning models

Six machine learning classifiers, including random forest (RF), artificial neural network (ANN), support vector machine (SVM), decision tree (DT), naive Bayesian (NB), and logistic regression analysis (LRA), were established by the scikit-learn Python library (version 0.24.1). A brief description of these machine learning classifiers is shown in Appendix 6. The performance of the XGBoost model was compared with the above six classifiers.

Statistical analysis

Statistical analyses were performed using R software 4.0.5 and Python Programming Language (version 3.7.9; Python Software Foundation, Wilmington, DE, USA). R software, GraphPad Prism 9.0.0, MedCalc 18.2.1, OriginPro 9.1, and Python were used to create the graphs. A p-value less than 0.05 was considered statistically significant. The consistency of PTC ultrasonic image features was evaluated by two radiologists using Kappa. LASSO was performed by R software (glmnet package). Boruta algorithm was operated by R software (Boruta package), and SMOTE was performed by Python imbalanced-learn 0.8.0. The scikit-learn Python library (version 0.24.1) and XGBoost frame (version 1.0.0) were used to establish the XGBoost model in Python. The SHAP Python frame (version 0.39.0) was used to perform SHAP algorithms.

GitHub

The design code of this study by R software and Python is available on GitHub (https://github.com/shi4180/RadProject).

Results

Patient demographics

The baseline epidemiologic and ultrasound image characteristics of the training and test cohorts are listed in Table 1. Among the 469 patients in the training cohort, 121 had CCLNM and 348 did not have CCLNM. Of the 118 patients in the test cohort, 32 patients had CCLNM. The Kappa coefficients of the categorical variables were all greater than 0.8 in the consistency analyses; the ICC of diameter was over 0.9 (Appendix Table 1).

TABLE 1

Table 1 Demographics and ultrasound image characteristics in the training and test cohorts.

Radiomics feature screening and radiomics score

In the training cohort, 939 features were extracted from the original ultrasound images, including 9 shape features, 18 first-order features, 75 texture features, 372 wavelet features, 93 squareroot features, 93 logarithm features, 93 gradient features, 93 square features, and 93 exponential features. We removed 143 invalid features, and 424 features were screened out from the remaining 796 features after difference analysis (Appendix Figure 3). The mRMR algorithm was used to select the 100 most critical features (Appendix Figure 4). Next, 11 potential features were chosen among 100 elements in the training cohort with nonzero coefficients in the 10-fold cross-validation LASSO logistic regression model (Appendix Figure 5 and Appendix Table 2). These 11 features were used to calculate the radiomics score. The radiomics scores of CCLNM (+) were 0.36 ± 0.15 and 0.35 ± 0.14 in the training and test cohorts and 0.22 ± 0.09 and 0.23 ± 0.09 for CCLNM (-) patients in the training and test cohorts, respectively (Table 1 and Appendix Figure 6). The performance of CCLNM evaluated by the radiomics score is shown in Appendix Table 3.

Development and performance of the XGBoost model with risk stratification

Five key features were selected by the Boruta algorithm: capsular invasion, radiomics score, diameter, age, and calcification (Figure 3 and Appendix Table 4). The XGBoost model was established based on these five key features.

FIGURE 3

Figure 3 The Boruta algorithm incorporating the independent clinical, ultrasound variables and radiomics score was performed to select the final predictors for CCLNM. The key features included capsular invasion, radiomics score, diameter, age, and calcification.

The performance of the XGBoost model is shown in Table 2. The AUC values of the preoperative assessment of CCLNM in the training and test cohorts were 91.53% and 90.88%, respectively (Figure 4A); the BA was over 80% and MCC was more than 60% in the two cohorts. The XGBoost algorithm reflected a good learning curve in the training dataset (Figure 4B). The two curves representing the training and test cohorts converged to 0.85, indicating that the XGBoost model effectively prevented overfitting. The detection error tradeoff curves showed that the curves of the two cohorts concentrated in the third quadrant, indicating that the false rejection rate and false acceptance rate were both low (Figures 4C, D). The decision curve revealed that if the threshold probability of a physician was over 8%, more advantages would be added by using the XGBoost model to estimate CCLNM in PTC patients (Appendix Figure 7).

TABLE 2

Table 2 Performance of the XGBoost model.

FIGURE 4

Figure 4 ROC, learning, and DET curves of the XBoost model. (A) The ROC curves of the XGBoost model in the training and test cohorts for predicting CCLNM in PTC patients, with an AUC of 0.9153 and 0.9088, respectively. (B) The learning curve in the training cohort of the XGBoost model. The two curves finally merge near 0.85, indicating that the model is well fitted for training. (C, D) The DET curves of the XGBoost model in the training and test cohorts. They were both concentrated in the third quadrant, indicating that the false rejection rate and false acceptance rate were both low.

Compared with the radiologist, the XGBoost model showed an increased AUC value by 44%. The sensitivity and specificity were also improved to varying degrees (Appendix Figure 8 and Appendix Table 5). The diagnostic sensitivity of the radiologists was only 33.33%, that is to say, many occult metastatic lymph nodes were ignored, and the sensitivity and specificity of the XGBoost model were greatly improved, especially the sensitivity. Therefore, to a certain extent, XGBoost model could spot occult metastatic lymph nodes that cannot be assessed by radiologists.

Along with the XGBoost model, we developed a risk stratification system in the training cohort (Appendix Figure 9). All patients were grouped into three categories: low-risk (0-36%), intermediate-risk (37%–58%), and high-risk groups (59%–100%).

Visual interpretation of the XGBoost model using SHAP

The Sankey plot (Appendix Figure 10) showed the orientation of all patients from the predicted CCLNM to the true CCLNM after shunting by age, diameter, calcification, radiomics score, and capsular invasion. The results showed that 19 patients with a predicted negative CCLNM actually had CCLNM. In contrast, 40 patients predicted to have CCLNM were confirmed to be CCLNM (-) based on postoperative pathology (Appendix Figure 11).

The classified bar chart of the SHAP summary plots was obtained by extracting the average absolute value of SHAP for each parameter, including capsular invasion, radiomics score, diameter, age, and calcification, to show the global significance (Figure 5A). The scatter plot of the SHAP summary plot reflects the relationship between the five key parameters and predicted probability through the color, including the positive and negative effects (Figure 5B). Radiomics score > 0.3, presence of capsular invasion, diameter > 1.56 cm, male sex, and presence of microcalcification played a positive role in assessing CCLNM. In contrast, older age had a negative effect; that is, patients ≤ 43 years old were more likely to develop CCLNM than patients 43 years old or older.

FIGURE 5

Figure 5 SHAP plots of the XGBoost model. (A) The classified bar charts of the SHAP summary plots show the influence of each parameter on the XGBoost model. (B) The SHAP summary plot’s scatter plot shows the relationship between the characteristic value and the predicted probability through colors, including positive and negative predictive effects. (C) SHAP decision plot for all patients with PTC; (D) SHAP decision plot for 10 random patients with PTC, with one misjudgment case (dotted line).

The SHAP decision plots (Figures 5C, D) demonstrated how each critical parameter affected the final decision. Each colored line on the figure represents the predicted outcome for each patient. Moving from the bottom to the top, the SHAP of each element was added to the base value of the XGBoost model, showing how each feature contributed to the overall prediction, including the positive and negative effects. Finally, each line touched the X-axis at the top with its corresponding prediction value, which was the model’s final prediction probability. Figure 6 shows two examples of the correct prediction of CCLNM(+) and CCLNM(-).

FIGURE 6

Figure 6 Two examples of correct prediction of CCLNM+ and CCLNM-. (A–D) A 28-year-old male patient was admitted to our hospital for further treatment after physical examination found a nodule in the right lobe of the thyroid. Ultrasound examination showed a nodule in the middle of the right lobe of the thyroid, 0.69 cm in diameter, with coarse calcification and capsular invasion. Postoperative pathological findings: papillary carcinoma of the right lobe of thyroid, with lymph node metastasis in the right central region. Retrospective analysis of the case showed that the radiomics score was 0.657 and the XGBoost model predicted CCLNM+ correctly. (E–H) A 44-year-old male patient was admitted to our hospital because or volume increase of the right thyroid lobe. Ultrasound examination showed a nodule in the middle of the right lobe of thyroid, 2.74 cm in diameter, with coarse calcification and without capsular invasion. Postoperative pathological findings: papillary carcinoma of the right lobe of thyroid, with no lymph node metastasis in the right central region. Retrospective analysis of the case showed that the 4adiomics score was 0.128 and the XGBoost model predicted CCLNM- correctly.

Evaluation of the XGBoost model stability and repeatability using PCA

To determine the stability and reproducibility of the XGBoost model, we evaluated potential errors during data preparation (25). We assessed the batch effect of ultrasound scanner types on the XGBoost and radiomics score. PCA of the XGBoost for all PTC patients in the training cohort showed no association with the three ultrasound scanner types. This result indicates that the XGBoost model was unaffected by the different types of ultrasound scanners (Appendix Figure 12).

Performance comparison of XGBoost model with six classifiers

In the training cohort, AUCs for RF, ANN, SVM, DT, NB, and LRA were 89.13%, 89.48%, 90.14%, 82.12%, 90.01%, and 88.06%, respectively. Thus, the XGBoost model outperformed other machine learning algorithms in the training cohort (Appendix Table 6 and Appendix Figure13).

Discussion

In this study, we investigated the feasibility and accuracy of the radiomics-based XGBoost model for prediction of CCLNM in PTC patients based on ultrasound images of the primary tumor. Our study has three significant findings. First, capsular invasion, radiomics score, diameter, age, and calcification were the key features for predicting CCLNM status. Second, the radiomics-based XGBoost model based on the key features showed a favorable ability to discriminate between CCLNM (+) and CCLNM (-), with AUC values of 91.53% and 90.88% in the training and test cohorts, respectively. Third, SHAP provided a reasonable visual interpretation of the prediction, including positive and negative effects.

Ultrasound features, capsular invasion, diameter, and calcification were the critical features for predicting CCLNM. We speculate this may be because of the following reasons. Tumor cells enter the lymph fluid after breaking through the thyroid capsule. When metastatic tumor cells invade the capsule of the lymph nodes, the nodes eventually develop into metastatic lymph nodes (26). A larger tumor diameter indicates a more aggressive tumor. Microcalcification is an indicator of cancer tissue hyperplasia and rapid proliferation of cancer cells; microcalcification may thus be a potential promoter of CCLNM to some extent (27). Previous studies set the age threshold of the PTC patients to 45 or 55 years old according to American Joint Committee on Cancer guidelines (28, 29). In this study, younger age (≤ 43 years) was a key factor for CCLNM, which indicated that PTC occurs in younger patients in recent years.

The radiomics score was independently associated with CCLNM by the Boruta algorithm. Establishing a radiomics score with LASSO has demonstrated excellent results in predicting lymph node metastasis in breast cancer (30), cervical cancer (31), pancreatic carcinoma (32), rectal cancer (33), and lung cancer (34). Radiomics characteristics are closely related to the microstructure and biological behavior of the tumors (31). The radiomics score is based on the high-dimensional and statistical features, which were extracted from primary thyroid tumors. In this study, 11 radiomics features were used to calculate the radiomics score. These features represent the texture information of tumors, which is highly associated with tumor heterogeneity (35). For machine learning, features selection and the reproducibility of the model on different devices are crucial. The Boruta algorithm was used to filter the optimal features, and this algorithm has been recognized by many cutting-edge studies (20, 36). The PCA method was used to ensure the repeatability between different devices, and the result was satisfactory. Therefore, the XGBoost model constructed in this research was also applicable to images obtained by other ultrasound equipment.

Although some studies have used machine learning to predict lymph node metastasis in PTC (6, 37), the “black-box” problem remained to be clarified, that is, an apparent conflict between the performance of the complex model and the clinical interpretability. In this study, the XGBoost model provided a visual interpretation for individual patients using SHAP plots, including positive and negative effects. SHAP considers the impact of a single feature as well as the synergistic effects between features. The XGBoost model not only predicted the possibility of CCLNM but it also provided a rational explanation for the prediction, which may improve clinicians’ confidence in the model.

At present, there are still some CT-based (38–40) and MRI-based (41, 42) radiomics models to predict the condition of cervical lymph nodes in patients with PTC, and they have also achieved good diagnostic performance. However, CT and MRI have certain limitations compared to ultrasound-based radiomics. On the one hand, for CT-based radiomics, tumor diameters < 0.5 cm were not included because they could not be reliably identified and segmented on CT images. The usage of iodinated contrast agents might have the potential to affect the uptake of iodine during the subsequent radioiodine therapy. The increased radiation exposure during contrast-enhanced CT scan should not be ignored. On the other hand, for MRI-based radiomics, the MRI examination involved various elements, including the use of a sequence, magnetic field intensity and some parameters, such as the time of repetition (TR) and the time of echo (TE). The use of MRI radiomics is still a huge challenge due to the complexity of MRI signals, which need to be normalized and standardized.

This study has several limitations. First, this study lacked external validation in other hospitals. The XGBoost model cannot be used for performance verification on other ultrasound equipment. However, we conducted PCA and demonstrated that different equipment did not affect the predictive ability of the model. Second, this study was retrospective in nature. Therefore, we were unable to collect information on the size and number of metastatic lymph nodes. A prospective study is required to confirm the accuracy of the XGBoost model. Third, the peritumoral area was not analyzed, and this should be examined as it might provide information on tumor invasiveness and lymph node metastasis (43). Fourth, the specific location of the lesion, which is close to the upper/lower pole or the anterior/posterior capsule of the thyroid, may have a potential impact on cervical lymph node metastasis. Therefore, the above factors will be included in the next study for in-depth discussion.

In conclusion, we have proposed a radiomics-based XGBoost model for predicting CCLNM in patients with PTC and showed that the model surpassed the evaluation ability of the radiologists. The model integrated ultrasound imaging information with clinical parameters of PTC patients. SHAP provides a reasonable visual interpretation of the XGBoost model to predict CCLNM in patients with PTC. We speculate that the XGBoost model will serve as a promising adjunct in the preoperative evaluation of CCLNM and help assist clinical decision-making for patients with PTC, thereby improving patient prognosis.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Author contributions

YS, YZ, JL, and FL contributed to the conception and design of the study and wrote the draft of the manuscript. YW, YC, FS, GC, and XZ organized the database. YZ and XC performed the statistical analysis. All authors contributed to manuscript revision, read, and approved the submitted version.

Acknowledgments

We thank Medjaden Inc. for scientific editing of this manuscript.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2022.897596/full#supplementary-material

References

1. Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer statistics, 2021. CA: Cancer J Clin (2021) 71(1):7–33. doi: 10.3322/caac.21654

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Haugen BR, Alexander EK, Bible KC, Doherty GM, Mandel SJ, Nikiforov YE, et al. 2015 American thyroid association management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer. Thyroid Off J Am Thyroid Assoc (2016) 26(1):1–133. doi: 10.1089/thy.2015.0020

CrossRef Full Text | Google Scholar

3. Roh JL, Kim JM, Park CI. Central lymph node metastasis of unilateral papillary thyroid carcinoma: patterns and factors predictive of nodal metastasis, morbidity, and recurrence. Ann Surg Oncol (2011) 18(8):2245–50. doi: 10.1245/s10434-011-1600-z

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Lambin P, Leijenaar RTH, Deist TM, Peerlings J, de Jong EEC, van Timmeren J, et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol (2017) 14(12):749–62. doi: 10.1038/nrclinonc.2017.141

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Conti A, Duggento A, Indovina I, Guerrisi M, Toschi N. Radiomics in breast cancer classification and prediction. Semin Cancer Biol (2021) 72:238–50. doi: 10.1016/j.semcancer.2020.04.002

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Yu J, Deng Y, Liu T, Zhou J, Jia X, Xiao T, et al. Lymph node metastasis prediction of papillary thyroid carcinoma based on transfer learning radiomics. Nat Commun (2020) 11(1):4807. doi: 10.1038/s41467-020-18497-3

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Wu S, Zheng J, Li Y, Yu H, Shi S, Xie W, et al. A radiomics nomogram for the preoperative prediction of lymph node metastasis in bladder cancer. Clin Cancer Res (2017) 23(22):6904–11. doi: 10.1158/1078-0432.CCR-17-1510

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Huang YQ, Liang CH, He L, Tian J, Liang CS, Chen X, et al. Development and validation of a radiomics nomogram for preoperative prediction of lymph node metastasis in colorectal cancer. J Clin Oncol (2016) 34(18):2157–64. doi: 10.1200/JCO.2015.65.9128

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Jiang M, Li C, Tang S, Lv W, Yi A, Wang B, et al. Nomogram based on shear-wave elastography radiomics can improve preoperative cervical lymph node staging for papillary thyroid carcinoma. Thyroid (2020) 30(6):885–97. doi: 10.1089/thy.2019.0780

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Tong Y, Li J, Huang Y, Zhou J, Liu T, Guo Y, et al. Ultrasound-based radiomic nomogram for predicting lateral cervical lymph node metastasis in papillary thyroid carcinoma. Acad Radiol (2020) 28(12):1675–84. doi: 10.1016/j.acra.2020.07.017

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Jonczyk MM, Fisher CS, Babbitt R, Paulus JK, Freund KM, Czerniecki B, et al. Surgical predictive model for breast cancer patients assessing acute postoperative complications: The breast cancer surgery risk calculator. Ann Surg Oncol (2021) 28(9):5121–31. doi: 10.1245/s10434-021-09710-8

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Rajkomar A, Dean J, Kohane I. Machine learning in medicine. New Engl J Med (2019) 380(14):1347–58. doi: 10.1056/NEJMra1814259

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Yu Z, Ji H, Xiao J, Wei P, Song L, Tang T, et al. Predicting adverse drug events in Chinese pediatric inpatients with the associated risk factors: A machine learning study. Front Pharmacol (2021) 12:659099. doi: 10.3389/fphar.2021.659099

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Buda M, Wildman-Tobriner B, Hoang JK, Thayer D, Tessler FN, Middleton WD, et al. Management of thyroid nodules seen on US images: Deep learning may match performance of radiologists. Radiology (2019) 292(3):695–701. doi: 10.1148/radiol.2019181343

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Bach S, Binder A, Montavon G, Klauschen F, Müller KR, Samek W. On pixel-wise explanations for non-linear classifier decisions by layer wise relevance propagation. PloS One (2015) 10(7):e0130140. doi: 10.1371/journal.pone.0130140

CrossRef Full Text | Google Scholar

16. Deshmukh F, Merchant SS. Explainable machine learning model for predicting GI bleed mortality in the intensive care unit. Am J Gastroenterol (2020) 115(10):1657–68. doi: 10.14309/ajg.0000000000000632

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Pan P, Li Y, Xiao Y, Han B, Su L, Su M, et al. Prognostic assessment of COVID-19 in the intensive care unit by machine learning methods: Model development and validation. J Med Internet Res (2020) 22(11):e23128. doi: 10.2196/23128

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Hu C-A, Chen C-M, Fang Y-C, Liang S-J, Wang H-C, Fang W-F, et al. Using a machine learning approach to predict mortality in critically ill influenza patients: a cross-sectional retrospective multicentre study in Taiwan. BMJ Open (2020) 25(10(2):e033898. doi: 10.1136/bmjopen-2019-033898

CrossRef Full Text | Google Scholar

19. Yang L, Gu D, Wei J, Yang C, Rao S, Wang W, et al. A radiomics nomogram for preoperative prediction of microvascular invasion in hepatocellular carcinoma. Liver Cancer (2019) 8(5):373–86. doi: 10.1159/000494099

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Degenhardt F, Seifert S, Szymczak S. Evaluation of variable selection methods for random forests and omics data sets. Brief Bioinform (2019) 20(2):492–503. doi: 10.1093/bib/bbx124

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Yoo TK, Ryu IH, Choi H, Kim JK, Lee IS, Kim JS, et al. Explainable machine learning approach as a tool to understand factors used to select the refractive surgery technique on the expert level. Transl Vis Sci Technol (2020) 9(2):8. doi: 10.1167/tvst.9.2.8

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Parsa AB, Movahedi A, Taghipour H, Derrible S, Mohammadian AK. Toward safer highways, application of XGBoost and SHAP for real-time accident detection and feature analysis. Accid Anal Prev (2020) 136:105405. doi: 10.1016/j.aap.2019.105405

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Stojic A, Stanic N, Vukovic G, Stanisic S, Perisic M, Sostaric A, et al. Explainable extreme gradient boosting tree-based prediction of toluene, ethylbenzene and xylene wet deposition. Sci Total Environ (2019) 653:140–7. doi: 10.1016/j.scitotenv.2018.10.368

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Lundberg SM, Nair B, Vavilala MS, Horibe M, Eisses MJ, Adams T, et al. Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat BioMed Eng (2018) 2(10):749–60. doi: 10.1038/s41551-018-0304-0

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Lu H, Arshad M, Thornton A, Avesani G, Cunnea P, Curry E, et al. A mathematical-descriptor of tumor-mesoscopic-structure from computed-tomography images annotates prognostic- and molecular-phenotypes of epithelial ovarian cancer. Nat Commun (2019) 10(1):764. doi: 10.1038/s41467-019-08718-9

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Karaman S, Detmar M. Mechanisms of lymphatic metastasis. J Clin Invest (2014) 124(3):922–8. doi: 10.1172/JCI71606

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Wu Q, Zhang YM, Sun S, Li JJ, Wu J, Li X, et al. Clinical and sonographic assessment of cervical lymph node metastasis in papillary thyroid carcinoma. J Huazhong Univ Sci Technolog Med Sci (2016) 36(6):823–7. doi: 10.1007/s11596-016-1669-5

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Wang Y, Guan Q, Xiang J. Nomogram for predicting central lymph node metastasis in papillary thyroid microcarcinoma: A retrospective cohort study of 8668 patients. Int J Surg (2018) 55:98–102. doi: 10.1016/j.ijsu.2018.05.023

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Yu X, Song X, Sun W, Zhao S, Zhao J, Wang YG. Independent risk factors predicting central lymph node metastasis in papillary thyroid microcarcinoma. Horm Metab Res (2017) 49(3):201–7. doi: 10.1055/s-0043-101917

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Gao Y, Luo Y, Zhao C, Xiao M, Ma L, Li W, et al. Nomogram based on radiomics analysis of primary breast cancer ultrasound images: prediction of axillary lymph node tumor burden in patients. Eur Radiol (2021) 31(2):928–37. doi: 10.1007/s00330-020-07181-1

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Jin X, Ai Y, Zhang J, Zhu H, Jin J, Teng Y, et al. Noninvasive prediction of lymph node status for patients with early-stage cervical cancer based on radiomics features from ultrasound images. Eur Radiol (2020) 30(7):4117–24. doi: 10.1007/s00330-020-06692-1

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Bian Y, Guo S, Jiang H, Gao S, Shao C, Cao K, et al. Relationship between radiomics and risk of lymph node metastasis in pancreatic ductal adenocarcinoma. Pancreas (2019) 48(9):1195–203. doi: 10.1097/MPA.0000000000001404

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Nakanishi R, Akiyoshi T, Toda S, Murakami Y, Taguchi S, Oba K, et al. Radiomics approach outperforms diameter criteria for predicting pathological lateral lymph node metastasis after neoadjuvant (Chemo)Radiotherapy in advanced low rectal cancer. Ann Surg Oncol (2020) 27(11):4273–83. doi: 10.1245/s10434-020-08974-w

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Xie Y, Zhao H, Guo Y, Meng F, Liu X, Zhang Y, et al. A PET/CT nomogram incorporating SUVmax and CT radiomics for preoperative nodal staging in non-small cell lung cancer. Eur Radiol (2021) 31(8):6030–8. doi: 10.1007/s00330-020-07624-9

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Hu HT, Wang Z, Huang XW, Chen SL, Zheng X, Ruan SM, et al. Ultrasound-based radiomics score: a potential biomarker for the prediction of microvascular invasion in hepatocellular carcinoma. Eur Radiol (2019) 29(6):2890–901. doi: 10.1007/s00330-018-5797-0

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Costa OYA, de Hollander M, Pijl A, Liu B, Kuramae EE. Cultivation-independent and cultivation-dependent metagenomes reveal genetic and enzymatic potential of microbial community involved in the degradation of a complex microbial polymer. Microbiome (2020) 8(1):76. doi: 10.1186/s40168-020-00836-7

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Thomas J, Haertling T. AIBx, artificial intelligence model to risk stratify thyroid nodules. Thyroid Off J Am Thyroid Assoc (2020) 30(6):878–84. doi: 10.1089/thy.2019.0752

CrossRef Full Text | Google Scholar

38. Li J, Wu X, Mao N, Zheng G, Zhang H, Mou Y, et al. Computed tomography-based radiomics model to predict central cervical lymph node metastases in papillary thyroid carcinoma: A multicenter study. Front Endocrinol (Lausanne) (2021) 12:741698. doi: 10.3389/fendo.2021.741698

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Lai L, Guan Q, Liang Y, Chen J, Liao Y, Xu H, et al. A computed tomography-based radiomic nomogram for predicting lymph node metastasis in patients with early-stage papillary thyroid carcinoma. Acta Radiol (2021) 3:2841851211054194. doi: 10.1177/02841851211054194

CrossRef Full Text | Google Scholar

40. Zhou Y, Su GY, Hu H, Ge YQ, Si Y, Shen MP, et al. Radiomics analysis of dual-energy CT-derived iodine maps for diagnosing metastatic cervical lymph nodes in patients with papillary thyroid cancer. Eur Radiol (2020) 30(11):6251–62. doi: 10.1007/s00330-020-06866-x

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Zhang H, Hu S, Wang X, He J, Liu W, Yu C, et al. Prediction of cervical lymph node metastasis using MRI radiomics approach in papillary thyroid carcinoma: A feasibility study. Technol Cancer Res Treat (2020) 19:1533033820969451. doi: 10.1177/1533033820969451

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Qin H, Que Q, Lin P, Li X, Wang XR, He Y, et al. Magnetic resonance imaging (MRI) radiomics of papillary thyroid cancer (PTC): a comparison of predictive performance of multiple classifiers modeling to identify cervical lymph node metastases before surgery. Radiol Med (2021) 126(10):1312–27. doi: 10.1007/s11547-021-01393-1

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Beig N, Khorrami M, Alilou M, Prasanna P, Braman N, Orooji M, et al. Perinodular and intranodular radiomic features on lung CT images distinguish adenocarcinomas from granulomas. Radiology (2019) 290(3):783–92. doi: 10.1148/radiol.2018180910

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: radiomics, lymphatic metastasis, papillary thyroid carcinoma, ultrasound, machine learning

Citation: Shi Y, Zou Y, Liu J, Wang Y, Chen Y, Sun F, Yang Z, Cui G, Zhu X, Cui X and Liu F (2022) Ultrasound-based radiomics XGBoost model to assess the risk of central cervical lymph node metastasis in patients with papillary thyroid carcinoma: Individual application of SHAP. Front. Oncol. 12:897596. doi: 10.3389/fonc.2022.897596

Received: 16 March 2022; Accepted: 08 August 2022;
Published: 26 August 2022.

Edited by:

Andreas Dietz, Leipzig University, Germany

Reviewed by:

Xiaofeng Tao, Shanghai Jiao Tong University, China
Yukun Luo, People’s Liberation Army General Hospital, China
Shufang Pei, Guangdong Academy of Medical Sciences, China

Copyright © 2022 Shi, Zou, Liu, Wang, Chen, Sun, Yang, Cui, Zhu, Cui and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Feifei Liu, bml1LmZlaTUyMUAxNjMuY29t

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Ultrasound-based radiomics XGBoost model to assess the risk of central cervical lymph node metastasis in patients with papillary thyroid carcinoma: Individual application of SHAP

Introduction

Materials and methods

Patients

Ultrasound image acquisition and analysis

Radiomics workflow and score

Boruta algorithm: Key features selection

Establishment and validation of the XGBoost model

SHAP

Principal components analysis (PCA)

Performance comparison with traditional machine learning models

Statistical analysis

GitHub

Results

Patient demographics

Radiomics feature screening and radiomics score

Development and performance of the XGBoost model with risk stratification

Visual interpretation of the XGBoost model using SHAP

Evaluation of the XGBoost model stability and repeatability using PCA

Performance comparison of XGBoost model with six classifiers

Discussion

Data availability statement

Author contributions

Acknowledgments

Conflict of interest

Publisher’s note

Supplementary material

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good