Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 08 December 2022
Sec. Cancer Imaging and Image-directed Interventions
This article is part of the Research Topic Deep Learning Approaches in Image-guided Diagnosis for Tumors View all 14 articles

Incorporation of a machine learning pathological diagnosis algorithm into the thyroid ultrasound imaging data improves the diagnosis risk of malignant thyroid nodules

Wanying LiWanying Li1Tao Hong&#x;Tao Hong2†Jianqiang Fang,&#x;Jianqiang Fang3,4†Wencai Liu&#x;Wencai Liu5†Yuwen Liu&#x;Yuwen Liu6†Cunyu HeCunyu He4Xinxin LiXinxin Li4Chan XuChan Xu4Bing WangBing Wang4Yuanyuan ChenYuanyuan Chen7Chenyu SunChenyu Sun8Wenle Li*Wenle Li9*Wei Kang*Wei Kang10*Chengliang Yin*Chengliang Yin11*
  • 1Center for Management and Follow-up of Chronic Diseases, Xianyang Central Hospital, Xianyang, China
  • 2Pediatric Surgery Ward, Fuwai Hospital Chinese Academy of Medical Sciences, Shenzhen, China
  • 3Ultrasound Interventional Department, Xianyang Central Hospital, Xianyang, China
  • 4Clinical Medical Research Center, Xianyang Central Hospital, Xianyang, China
  • 5Department of Orthopaedic Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, China
  • 6Department of Chronic Disease and Endemic Disease Control Branch, Xiamen Municipal Center for Disease Control and Prevention, Xiamen, China
  • 7School of Statistics, RENMIN University of China, Beijing, China
  • 8AMITA Health Saint Joseph Hospital Chicago, Chicago, IL, United States
  • 9State Key Laboratory of Molecular Vaccinology and Molecular Diagnostics and Center for Molecular Imaging and Translational Medicine, School of Public Health, Xiamen University, Xiamen, China
  • 10Department of Mathematics, Physics and Interdisciplinary Studies, Guangzhou Laboratory, Guangzhou, Guangdong, China
  • 11Faculty of Medicine, Macau University of Science and Technology, Macao, Macao SAR, China

Objective: This study aimed at establishing a new model to predict malignant thyroid nodules using machine learning algorithms.

Methods: A retrospective study was performed on 274 patients with thyroid nodules who underwent fine-needle aspiration (FNA) cytology or surgery from October 2018 to 2020 in Xianyang Central Hospital. The least absolute shrinkage and selection operator (lasso) regression analysis and logistic analysis were applied to screen and identified variables. Six machine learning algorithms, including Decision Tree (DT), Extreme Gradient Boosting (XGBoost), Gradient Boosting Machine (GBM), Naive Bayes Classifier (NBC), Random Forest (RF), and Logistic Regression (LR), were employed and compared in constructing the predictive model, coupled with preoperative clinical characteristics and ultrasound features. Internal validation was performed by using 10-fold cross-validation. The performance of the model was measured by the area under the receiver operating characteristic curve (AUC), accuracy, precision, recall, F1 score, Shapley additive explanations (SHAP) plot, feature importance, and correlation of features. The best cutoff value for risk stratification was identified by probability density function (PDF) and clinical utility curve (CUC).

Results: The malignant rate of thyroid nodules in the study cohort was 53.2%. The predictive models are constructed by age, margin, shape, echogenic foci, echogenicity, and lymph nodes. The XGBoost model was significantly superior to any one of the machine learning models, with an AUC value of 0.829. According to the PDF and CUC, we recommended that 51% probability be used as a threshold for determining the risk stratification of malignant nodules, where about 85.6% of patients with malignant nodules could be detected. Meanwhile, approximately 89.8% of unnecessary biopsy procedures would be saved. Finally, an online web risk calculator has been built to estimate the personal likelihood of malignant thyroid nodules based on the best-performing ML-ed model of XGBoost.

Conclusions: Combining clinical characteristics and features of ultrasound images, ML algorithms can achieve reliable prediction of malignant thyroid nodules. The online web risk calculator based on the XGBoost model can easily identify in real-time the probability of malignant thyroid nodules, which can assist clinicians to formulate individualized management strategies for patients.

Introduction

The incidence of sonographically detected thyroid nodules is increasing in individuals; approximately 50% to 68% can be detected in healthy individuals. Most of these nodules are benign and asymptomatic (13), and only about 8% to 16% are malignant nodules (46). Due to the complexity and diversity of thyroid nodules, it is challenging for doctors to distinguish which nodules harbor clinically relevant malignancies (7). For more than 30 years, ultrasound and fine-needle aspiration (FNA) cytology were the traditional diagnostic methods as the cornerstones in the clinical management of patients with thyroid nodules (8).

FNA provides the most effective and practical diagnostic information for evaluating whether a nodule is malignant to reach a definitive diagnosis, which has traditionally been used to meet this purpose (9, 10). However, approximately 50% of all biopsied nodules proved to be benign and grew indolent with non-aggressive behavior (6, 11, 12). Moreover, biopsies in one out of seven thyroid nodules may not yield final cytological results and usually require repeated biopsies or additional evaluation (13). Obviously, it is not cost-effective to submit all these lesions to FNA.

As a non-invasive, low-cost, and convenient technique for thyroid nodule detection, ultrasound is widely accepted as the preferred imaging method for the diagnosis and monitoring of thyroid nodules. Therefore, ultrasonography has been considered as having a greater role in determining the need for FNA and follow-up planning (2, 7). In order to improve the accuracy of ultrasound-based diagnosis, various available ultrasound-based risk stratification systems have already been proposed by many national and international thyroid associations, such as the ACR TIRADS, the French TIRADS, the Korea-TIRADS, and the EU-TIRADS (14). The most commonly used thyroid nodule classification system is the Thyroid Imaging Reporting and Data System (TIRADS) developed by the American College of Radiology (ACR). However, the limitations of these systems include that subjective assessment of nodules (15) is inferior to the personal judgment by experts (8) and that different classification systems for the same thyroid nodules may yield varying results (16), which cannot be ignored. There is an urgent need to develop an improved and reliable diagnostic method to distinguish benign and malignant thyroid nodules, which could help reduce the number of unnecessary biopsies or diagnostic surgery without jeopardizing the detection of clinically relevant malignant thyroid nodules.

A predictive model based on machine learning (ML) algorithms, designed to “learn” from clinical and sonographic datasets and predict the nature of thyroid nodules, is in some cases more robust than human experts (17), and as a result, ML algorithms have been widely used to classify thyroid nodules objectively (1820). However, previous studies have classified thyroid nodules by analyzing thyroid ultrasound images. The purpose of the present study was to develop ML-ed models for predicting malignant lumps based on the database of clinical characteristics and ultrasound features of thyroid nodules confirmed by pathological examination in Chinese populations. Compared with using only image analysis, our ML-ed predictive models not only integrated ultrasound features but also included clinical features of patients with thyroid nodules, which may be more comprehensive and convenient, especially for clinicians and patients. It can carry out individualized treatment and management based on the received ultrasound reports. The new model obtained could be used to predict the malignant risk of thyroid nodules in individuals online via a web calculator.

Materials and methods

The retrospective study followed the tenets of the Declaration of Helsinki and was approved by the Ethics Committee of Xianyang Central Hospital (No. 2022-IRB-68). All the study participants provided written informed consent, which waived the requirement for informed patient consent because data for all subjects were anonymized.

Collection of patients

A total of 9,999 consecutive patients with thyroid nodules who underwent FNA cytology or surgical procedure at Xianyang Central Hospital from the year 2000 to 2020 were included in our study. All included participants met the following inclusion criteria: 1) a single thyroid nodule with a diameter of 5–50 mm, 2) complete clinical and ultrasonic data, and 3) all nodules with definite pathological confirmation. The exclusion criteria were 1) undistinguishable coalescent thyroid lesions and 2) pathology provided ambiguous diagnostic findings for their nodules. Please see Figure S1.

Collection of ultrasound data

Ultrasound images of thyroid glands and the surrounding areas were acquired by ultrasound machine with a linear array probe at Xianyang Central Hospital. The ultrasound images were performed independently by two ultrasonologists, with a senior ultrasonologist making the final decision on controversial patients. The following features of each nodule, including the size, shape, composition, echogenicity, margin, echogenicity, and cervical lymph node status, were carefully measured and recorded. Images of the thyroid are obtained according to ACR accreditation standards. Ultrasound features were divided according to the ACR TIRADS (3), and each feature had a corresponding score. The higher the score, the greater the malignant tendency. In the processing of statistical analysis, the ultrasound characteristics of each nodule were replaced with the corresponding scores in the ACR TIRADS. For example, taller-than-wide will be assigned 3 points, so we wrote the Arabic numeral 3 instead of taller-than-wide in Table 1.

TABLE 1
www.frontiersin.org

Table 1 The results of univariate and multivariable logistic regression.

The benign and malignant pathology of all thyroid nodules in all participants was confirmed by FNA or surgery. All pathological results were examined blindly and separately by two pathologists, with a final decision made by a senior pathologist.

Analysis strategy

In order to maximize the predictive performance and ultimately reduce overfitting, we used the least absolute shrinkage and selection operator (lasso) regression analysis to screen variables, followed by logistic analysis to identify independent risk factors for malignant nodules.

A total of six ML algorithms were developed in this study, including Decision Tree (DT), Extreme Gradient Boosting (XGBoost), Gradient Boosting Machine (GBM), Naive Bayes Classifier (NBC), Random Forest (RF), and Logistic Regression (LR), to predict malignant thyroid nodules based on the variables with multivariable logistic regression p-value less than 0.05. Models have been validated internally by using 10-fold cross-validation. Subsequently, the area under the receiver operating characteristic curve (AUC) values, accuracy, precision, recall, and F1 score have been calculated to measure and compare the performance of each model.

Because many machine learning algorithms are considered functional black boxes, their internal processes are not well understood. Given this issue, various interpretability methods have been proposed to assess the influence of variables on the predicted results (21, 22). For instance, the relative importance of variables, the Shapley additive explanations (SHAP) method, and the heat map of the correlation of features were employed to further visualize the interpretation of ML-ed models at the feature level. An optimal cutoff value for clinical application was determined by probability density functions (PDFs). Clinical utility curves (CUCs) were plotted to compare the net benefits of different thresholds.

The demographic and clinical characteristics of all included patients were analyzed by t-test and chi-square test via SPSS Statistics software (version 26.0, SPSS Inc., Chicago, IL, USA). Continuous and categorical variables are expressed as mean ± SD and frequency in this study. p-Values <0.05 were considered statistically significant with 95% confidence intervals (CIs) applied for all analyses. R software was applied for developing predictive models via the “rms” package and establishing a web risk calculator via the “shiny” package.

Results

Clinical and ultrasound characteristics

Of all 9,999 participants with thyroid nodules, 500 (50%) harbored malignant nodules, while 500 (50%) had benign disease based on the pathological diagnosis. We collected clinical features (age and gender) and recorded image features (thyroid nodule location, size, shape, composition, echogenicity, margin, echogenicity, and cervical lymph node status).

Selection of features

Eight of 15 variables were screened by lasso analysis into logistic regression analysis, and all statistically significant factors in the univariate logistic regression analysis were included in the multivariate logistic regression analysis. Finally, age, margin, shape, echogenic foci, echogenicity, and lymph nodes were identified as independent predictors of thyroid cancer. There was no significant statistical difference in the nodule location (laterality) and composition in the differentiation of benign and malignant thyroid nodules. The results of the univariate and multivariate analyses are demonstrated in Table 1.

Demographic baseline

A cohort of patients from Xianyang Central Hospital in China was enrolled in this study. Results of the t-test and the chi-square test indicated there was no statistically significant difference between the training and the validation cohorts at a 0.05 significance level.

Development and validation of ML-ed models

The six predictors identified in the differentiation of malignant and benign thyroid nodules were used to construct ML-ed models, including LR, NBC, DT, RF, GBM, and XGBoost, respectively, for predicting malignant thyroid nodules. The predictive performance of the six ML-ed models is shown in Figure 1. The whole cohort used 10-fold cross-validation in this study. All models had shown good performance in predicting malignant nodules. Their AUC values of XGBoost, LR, NBC, DT, RF, and GBM were 0.829, 0.821, 0.825, 0.759, 0.821, and 0.822, respectively, in the 10-fold cross-validation. The XGBoost model indicated the best performance than any of the others. Meanwhile, the XGBoost model also achieved the highest accuracy of 0.65 and precision of 0.63, as shown in Figure 2. Thus, the XGBoost was identified as our final predictive model in this study.

FIGURE 1
www.frontiersin.org

Figure 1 The results of the least absolute shrinkage and selection operator (lasso) regression analysis. The coefficients of all variables are reduced to 0 from instability to stability in (A) and obtain the model coefficient of λ value that minimizes the model deviation by cross-validation curve in (B).

FIGURE 2
www.frontiersin.org

Figure 2 The results of 10-fold cross-validation in the six models of LR, NBC, DT, RF, GBM, and XGBoost. The average AUC of XGBoost model is the highest one. LR, Logistic Regression; NBC, Naive Bayes Classifier; DT, Decision Tree; RF, Random Forest; GBM, Gradient Boosting Machine; XGBoost, Extreme Gradient Boosting; AUC, area under the receiver operating characteristic curve.

Explanation of model

To further illustrate the models at the feature level, a SHAP summary diagram was plotted to demonstrate how these features affect the presence of malignant thyroid nodules. The SHAP values for each feature plotted for each sample are shown in Figure 3. We concluded that margin, shape, echogenic foci, echogenicity, and lymph nodes exerted negative effects on predicting the risk of malignant thyroid nodules, whereas the risk of malignancy increases with age.

FIGURE 3
www.frontiersin.org

Figure 3 Circular bar plot. The performance of six models has been evaluated by five criteria of AUC, accuracy, precision, recall, and F1. AUC, area under the receiver operating characteristic curve.

Additionally, we ranked the importance of features in Figure 4 in order to explore the extent to which each independent risk factor contributed to the model. Although there were slight differences in the importance of each variable across models, the margin contributed most to the prediction of malignant nodules in most models. In the XGBoost model, the relative importance of variables decreased in the following order: margin, echogenic foci, lymph nodes, age, shape, and echogenicity. The correlation heat map indicated there was no linear correlation between the variables, and they harbor independent predictive power in clinical practice (Figure 5).

FIGURE 4
www.frontiersin.org

Figure 4 SHAP values of the selected features. The higher the SHAP value of each variable, the more impact and contribution to the model. SHAP, Shapley additive explanations.

FIGURE 5
www.frontiersin.org

Figure 5 Importance of the selected features. Importance of each feature had been demonstrated and compared in the six models of LR, NBC, DT, RF, GBM, and XGBoost. LR, Logistic Regression; (A) NBC, Naive Bayes Classifier; (B) DT, Decision Tree; (C) RF, Random Forest; (D) GBM, Gradient Boosting Machine; (E) XGBoost, Extreme Gradient Boosting (F).

Application of model

As illustrated in Figure 6, we recommend a threshold probability of 51% as the optimum cutoff value for the probability of malignant nodules. In this situation, we could detect 85.6% (red area under the blue line) of malignant nodules, while the number of biopsy procedures for benign nodules would be reduced by 89.8% (yellow area under the red line) in Figure 7. Finally, in order to facilitate the practical application of the model in clinical work, we embedded the best predictive model into a web risk calculator (Figure 8) that can easily derive the probability and risk stratification of patients with malignant nodules in real time. Figure S1 shows the flow chart of our current study.

FIGURE 6
www.frontiersin.org

Figure 6 The heat map of correlation of the selected features, including age, echogenicity, shape, margin, echogenic foci, and lymph nodes.

FIGURE 7
www.frontiersin.org

Figure 7 Probability density functions (A) and clinical utility curves (B) of the predictive model (0, benign modules group; 1, malignant nodules group). It is 85.6% (red area under the blue line) of malignant nodules, and the number of biopsy procedures for benign nodules was reduced by 89.8% (yellow area under the red line).

FIGURE 8
www.frontiersin.org

Figure 8 The application of the web risk calculator for patients with malignant nodules.

Discussion

As a highly prevalent disease, the incidence of thyroid nodules in China is 20%–35% (23), of which 7%–15% are malignant (1). It is a challenge for clinicians to distinguish malignant from benign nodules. Hence, we propose to develop a predictive model based on machine learning for assessing the malignancy risk of thyroid nodules in the Chinese population. To our knowledge, this is the first ML-ed predictive model to predict malignant thyroid nodules by integrating clinical and ultrasound features. Our model was constructed using six variables, including clinical characteristics (age) and ultrasound features (margin, shape, echogenic foci, echogenicity, and lymph nodes). Our findings indicated that the proposed model could detect malignant thyroid nodules accurately and reduce unnecessary biopsies by estimating risk stratification. Finally, through a convenient and practical web application, our model can assist doctors and patients to carry out precise and individualized management of thyroid nodules.

We found that age played an adverse role in the risk of malignancy as a predictive parameter in our model. This finding was consistent with that of Chen et al., where there was no increased risk of malignancy in those aged 28–63 in the same population of a Chinese cohort (24). In addition, a similar finding was revealed in a US cohort of 196 patients with malignant FNA cytology, and the incidence of malignant thyroid nodules in patients under 45 years old was twice that in those over 45 years old (8.1% vs. 4.0%, p < 0.001) (25). Similarly, Italian scholars have reported that cytology suspicious or indicative of papillary thyroid cancer is associated with younger age (26). However, our result was contrary to what Belfifiore et al. reported that thyroid cancer is more common in elderly patients (27). Different views on the association between age and the incidence of malignant thyroid nodules deserve further exploration.

Previous findings unveiled that the abnormal cervical lymph nodes may indicate malignant nodule metastasis (28), which is consistent with our study. It is reported that 30%–80% of patients with thyroid cancer have cervical lymph node metastasis (29). Cervical lymph nodes are usually not palpable because of their deep location and small size. Ultrasound has demonstrated its high sensitivity and specificity for the assessment of non-palpable lymph nodes (30). Cervical lymph nodes may enlarge as a result of a benign process, such as reactive hyperplasia due to inflammation in submandibular and upper cervical nodes (29). However, most investigators agree on the sonographic features of metastatic lymph nodes in thyroid cancer, including cystic degeneration, a rounded shape, loss of echogenic hilum, hypoechoic or hyperechoic mass, and calcification (3133). The cervical lymph node is the first metastatic site of malignant nodules. Thyroid nodules should be highly suspected as malignant when abnormal lymph nodes are observed.

Meanwhile, margin ranked first in the importance of features and contributed the most to our model. We found that lobulated or irregular margins and extensive extrathyroidal extension detected by ultrasound increased the risk of malignancy risk in nodules. A lobulated or irregular margin is defined as a spiculated or jagged edge. Some studies have revealed that an irregular or microlobulated margin suggests malignancy (34, 35). Extensive extrathyroidal extension refers to a frank invasion of adjacent soft tissue or vascular, which is a highly reliable characteristic of malignancy and also has a negative effect on prognosis (36).

Furthermore, another feature we found that increased the risk of malignant nodules was the shape of the nodules. As first observed by Kim et al. and subsequently confirmed in a series of studies (35, 3740), a lump with a shape taller-than-wide is another useful predictor of malignancy. These results may be associated with the growth pattern. It is found that the growth of benign nodules remains within normal tissue planes, so the shape of benign nodules can be ovoid to round, whereas malignant nodules grow centrifugally through the normal tissue plane (38, 39). In the ACR TIRADS, taller-than-wide was assigned 3 points in the TIRADS, and our results have confirmed the high-risk role of shape in the malignancy of thyroid nodules.

Notwithstanding, echogenic foci, and echogenicity were regarded as predictive variables in both our predictive model and the ACR TIRADS; there were different opinions on assessing thyroid nodules on some features. Our results showed the presence of macrocalcifications and peripheral (rim) calcifications had no statistical difference between malignant and benign nodules. However, macrocalcifications and peripheral (rim) calcifications could be assigned 1 and 2 points in the TIRADS, respectively. Macrocalcifications refer to coarse echogenic foci accompanied by acoustic shadowing. Evidence in published data describing their correlation with an increased malignancy risk is weak (41); additionally, the relationship between macrocalcifications and nodules lacking other malignant characteristics is also mixed (42, 43). Peripheral (rim) calcifications lie along all or part of the nodule’s margin. Compared with macrocalcifications, they are more strongly correlated with malignancy (41), but several studies suggested that their correlation with malignancy is variable (43).

The statistical results of logistic regression showed that patients with punctate echogenic foci had a higher tendency to develop malignant nodules. Punctate echogenic foci are smaller in size and less shadowed than macrocalcifications and may correspond to the psammomatous calcifications associated with papillary cancers in the solid components of thyroid nodules. Histologically, punctate echogenic foci are smaller and less shadowed than macrocalcifications, which are considered highly positive associations with malignancy, especially in combination with other suspicious features (3, 5).

Echogenicity refers to a nodule’s reflectivity relative to adjacent tissue. Except for the thyroid parenchyma, which is usually used as reference tissue, the neck strap muscles with very low echogenicity are also used as the basis for comparison. Previously, several studies investigated that a higher degree of hypoechogenicity was highly suggestive of malignancy, with a specificity of 92%–94% (37, 44). Interestingly, a higher degree of hypoechogenicity harbors no statistical significance for predicting malignant nodules based on our results of multivariable logistic regression. These results need more evidence to verify.

Another important finding of our paper was composition, which was not an independent predictor of malignant nodules. Thyroid nodules that are cystic or almost completely cystic have no score in the ACR TIRADS because they are highly correlated with benign cytology, and only 13%–26% of thyroid cancers harbor a cystic component (29, 4446). Spongiform, composed predominantly (>50%) of small cystic spaces, is considered a sign of benignity with high specificity (44). In our study, we found that no patient had cystic or almost completely cystic or spongiform ultrasound features. Additionally, according to ACR TIRADS, mixed cystic and solid, and solid or almost completely solid are the risk factors for malignant nodules, with scores of 1 and 2, respectively (3). Solid nodules with an eccentric configuration and acute angle are suspected to be malignant (47), whereas these conclusions were not observed in our study.

The differences between our study and the risk stratification system also illustrate the inadequacy of the classification system to evaluate thyroid nodules, such as interobserver variation/a subjective assessment of the nodules (8, 48). Therefore, it is necessary to add clinical data to further improve the accuracy and objectivity of the predictive model. As Chen et al. described in their literature, the predictive power of the new model was superior to that of ACR TIRADS when age is included. Furthermore, our study cohort enrolled retrospectively Chinese patients from a single medical center; these differences therefore may be due to demographic differences and healthcare disparities between patients in the USA and China.

Therefore, these differences may be due to the mismatch between the classification system and the current medical situation in China. It is more rational to apply a risk stratification system according to the population. Accordingly, Zhou et al. formulated Chinese guidelines for ultrasound malignancy risk stratification of thyroid nodules (C-TIRADS) that are specific to China’s national and medical conditions (23).

We offered clinicians and patients an online web application for estimating the risk of a malignant thyroid nodule using the XGBoost model, which combined six variables including age, lymph nodes, margin, shape, echogenic foci, and echogenicity. By inputting the corresponding personalized parameters of patients, visitors can quickly obtain the corresponding malignancy risk. The link is as follows: https://share.streamlit.io/liuwencai6/thyroid_final/main/thyroid_final.py.

Depending on the cutoff value in the PDF and CUC, we recommended 51% as the threshold probability of the next management strategy and risk stratification. In that case, about 85.6% of patients with malignant thyroid nodules can be detected, FNA was recommended, and careful follow-up and possibly early surgery should be considered. Moreover, we could also save approximately 89.8% of unnecessary biopsy procedures in low-risk populations (malignant risk ≤51%). This result is consistent with the main goal of all currently available sonographic risk stratification systems, that is, to eliminate unnecessary thyroid biopsies without endangering the diagnosis of clinically malignant nodules (15). We believe that the incorporation of our predictive model into clinical practice will improve the diagnostic accuracy of malignant thyroid nodules and minimize the number of unnecessary FNA in low-risk patients with thyroid nodules. Compared with existing models (49, 50), the performances are good but various due to differences in population and dataset.

There are several limitations of this existing study. First, the retrospective nature of this study may have resulted in potential bias. Second, the ultrasound features were read and provided by sonographers rather than captured directly from the ultrasonic images, which may cause bias in data quality due to extraction and interpretation. We strongly recommend that the machine learning model be used to extract the features from ultrasonic images directly and from several types of machines in future studies. In addition, all patients and ultrasound assessments are derived from a single medical center, which may restrict the accuracy; large-scale multicenter cohorts and external validation would be more forceful. Finally, we classified the features of nodules by ACR TIRADS, rather than the Chinese-TIRADS proposed by the Chinese professional society, to evaluate ultrasound parameters (23). We will expand our cohort and dataset in a further study to optimize our model and algorithm in the future.

Conclusion

In conclusion, our study yielded a machine learning-based model combining age with ultrasound parameters, including shape, margin, echogenic foci, echogenicity, and lymph nodes, to predict the presence of malignant thyroid nodules. Our model showed good performance and was embodied in a web risk calculator to estimate the risk of malignant thyroid nodules.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

This study has been approved by the Ethics Committee of Xianyang Central Hospital, Ethics No. 2022-IRB-68.

Author contributions

CLY, WK, and WLL designed the article. YWL, CX, and BW collected and evaluated the data. WYL and WLL wrote the first draft of the manuscript. All authors reviewed the manuscript. All authors contributed to the interpretation of the results. WYL,JQF and WCL wrote the final draft of the manuscript. YWL CYH and YYC read and approved the final version of the manuscript. All authors contributed to the article and approved the submitted version.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2022.968784/full#supplementary-material

References

1. Haugen BR, Alexander EK, Bible KC, Doherty GM, Mandel SJ, Nikiforov YE, et al. 2015 American thyroid association management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer: The American thyroid association guidelines task force on thyroid nodules and differentiated thyroid cancer. Thyroid (2016) 26(1):1–133. doi: 10.1089/thy.2015.0020

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Gharib H, Papini E, Garber JR, Face D, Face R, Hegedus L, et al. American Association of clinical endocrinologists, American college of endocrinology, and associazione Medici endocrinologi medical guidelines for clinical practice for the diagnosis and management of thyroid nodules–2016 update. Endocr Pract (2016) 22(5):622–39. doi: 10.4158/EP161208.GL

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Tessler FN, Middleton WD, Grant EG, William D, Grant Edward G. ACR thyroid imaging, reporting and data system (TI-RADS): White paper of the ACR TI-RADS committee. J Am Coll Radiol (2017) 14(5):587–95. doi: 10.1016/j.jacr.2017.01.046

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Burman KD, Wartofsky L. CLINICAL PRACTICE. thyroid nodules. N Engl J Med (2015) 373(24):2347–56. doi: 10.1056/NEJMcp1415786

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Rago T, Vitti P. Risk stratification of thyroid nodules: From ultrasound features to TIRADS. Cancers (Basel) (2022) 14(3):717. doi: 10.3390/cancers14030717

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Smith-Bindman R, Lebda P, Feldstein VA, Sellami D, Goldstein RB, Brasic N, et al. Risk of thyroid cancer based on thyroid ultrasound imaging characteristics: results of a population-based study. JAMA Intern Med (2013) 173(19):1788–96. doi: 10.1001/jamainternmed.2013.9245

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Lamartina L, Deandreis D, Durante C, Filetti S. ENDOCRINE TUMOURS- imaging in the follow-up of differentiated thyroid cancer- current evidence and future perspectives for a risk-adapted approach. Eur J Endocrinol (2016) 175(5):R185–202. doi: 10.1530/EJE-16-0088

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Solymosi T, Hegedus L, Bonnema SJ, Frasoldati A, Jambor L, Kovacs GL, et al. Ultrasound-based indications for thyroid fine-needle aspiration: Outcome of a TIRADS-based approach versus operators' expertise. Eur Thyroid J (2021) 10(5):416–24. doi: 10.1159/000511183

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Gharib H, Papini E, Paschke R, Duick DS, Valcavi R, Hegedüs L, et al. American Association of clinical endocrinologists, associazione Medici endocrinologi, and EuropeanThyroid association medical guidelines for clinical practice for the diagnosis and management of thyroid nodules. Endocr Pract (2010) 16 Suppl 1:1–43. doi: 10.4158/EP.16.3.468

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Singh Ospina N, Brito JP, Maraka S, Espinosa de Ycaza AE, Rodriguez-Gutierrez R, Gionfriddo MR, et al. Diagnostic accuracy of ultrasound-guided fine needle aspiration biopsy for thyroid malignancy: systematic review and meta-analysis. Endocrine (2016) 53(3):651–61. doi: 10.1007/s12020-016-0921-x

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Davies L, Welch HG. Current thyroid cancer trends in the united states. JAMA Otolaryngol Head Neck Surg (2014) 140(4):317–22. doi: 10.1001/jamaoto.2014.1

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Bongiovanni M, Spitale A, Faquin WC, Mazzucchelli L, Baloch ZW. The Bethesda system for reporting thyroid cytopathology: a meta-analysis. Acta Cytol (2012) 56(4):333–9. doi: 10.1159/000339959

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Ali SZ, Siperstein A, Sadow PM, Golding AC, Kennedy GC, Kloos RT, et al. Extending expressed RNA genomics from surgical decision making for cytologically indeterminate thyroid nodules to targeting therapies for metastatic thyroid cancer. Cancer Cytopathol (2019) 127(6):362–9. doi: 10.1002/cncy.22132

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Swan KZ, Thomas J, Nielsen VE, Jespersen ML, Bonnema SJ. External validation of AIBx, an artificial intelligence model for risk stratification, in thyroid nodules. Eur Thyroid J (2022) 11(2):e210129. doi: 10.1530/ETJ-21-0129

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Grani G, Lamartina L, Ascoli V, Bosco D, Biffoni M, Giacomelli L, et al. Reducing the number of unnecessary thyroid biopsies while improving diagnostic accuracy: Toward the "Right" TIRADS. J Clin Endocrinol Metab (2019) 104(1):95–102. doi: 10.1210/jc.2018-01674

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Huang BL, Ebner SA, Makkar JS, Bentley-Hibbert S, McConnell RJ, Lee JA, et al. A multidisciplinary head-to-Head comparison of American college of radiology thyroid imaging and reporting data system and American thyroid association ultrasound risk stratification systems. Oncologist (2020) 25(5):398–403. doi: 10.1634/theoncologist.2019-0362

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Oberije C, Nalbantov G, Dekker A, Boersma L, Borger J, Reymen B, et al. A prospective study comparing the predictions of doctors versus models for treatment outcome of lung cancer patients: a step toward individualized care and shared decision making. Radiother Oncol (2014) 112(1):37–43. doi: 10.1016/j.radonc.2014.04.012

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Buda M, Wildman-Tobriner B, Hoang JK, Thayer D, Tessler FN, Middleton WD, et al. Management of thyroid nodules seen on US images: Deep learning may match performance of radiologists. Radiology (2019) 292(3):695–701. doi: 10.1148/radiol.2019181343

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Thomas J, Haertling T. AIBx, artificial intelligence model to risk stratify thyroid nodules. Thyroid (2020) 30(6):878–84. doi: 10.1089/thy.2019.0752

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Guan Q, Wang Y, Du J, Qin Y, Lu H, Xiang J, et al. Deep learning based classification of ultrasound images for thyroid nodules: a large scale of pilot study. Ann Transl Med (2019) 7(7):137. doi: 10.21037/atm.2019.04.34

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Souza LR, Colonna JG, Comodaro JM, Naveca FG. Using amino acids co-occurrence matrices and explainability model to investigate patterns in dengue virus proteins. BMC Bioinf (2022) 23(1):80. doi: 10.1186/s12859-022-04597-y

CrossRef Full Text | Google Scholar

22. Lundberg S, Lee SI. A unifed approach to interpreting model predictions. Adv Neural Inf Process Syst (2017) 30:4765–74. doi: 10.5555/3295222.3295230

CrossRef Full Text | Google Scholar

23. Zhou J, Yin L, Wei X, Zhang S, Song Y, Luo B, et al. Chinese Guidelines for ultrasound malignancy risk stratification of thyroid nodules: the c-TIRADS. Endocrine (2020) 70(2):256–79. doi: 10.1007/s12020-020-02441-y

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Chen L, Zhang J, Meng L, Lai Y, Huang W. A new ultrasound nomogram for differentiating benign and malignant thyroid nodules. Clin Endocrinol (Oxf) (2019) 90(2):351–9. doi: 10.1111/cen.13898

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Bessey LJ, Lai NB, Coorough NE, Chen H, Sippel RS. The incidence of thyroid cancer by fine needle aspiration varies by age and gender. J Surg Res (2013) 184(2):761–5. doi: 10.1016/j.jss.2013.03.086

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Rago T, Fiore E, Scutari M, Santini F, Di Coscio G, Romani R, et al. Male Sex, single nodularity, and young age are associated with the risk of finding a papillary thyroid cancer on fine-needle aspiration cytology in a large series of patients with nodular thyroid disease. Eur J Endocrinol (2010) 162(4):763–70. doi: 10.1530/EJE-09-0895

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Belfiore A, La Rosa GL, La Porta GA, Giuffrida D, Milazzo G, Lupo L, et al. Cancer risk in patients with cold thyroid nodules: relevance of iodine uptake, sex, age, and multinodularity. Am J Med (1992) 93:363–9. doi: 10.1016/0002-9343(92)90164-7

PubMed Abstract | CrossRef Full Text | Google Scholar

28. AIUM practice parameter for the performance of a thyroid and parathyroid ultrasound examination. J Ultrasound Med (2016) 35:1–11. doi: 10.7863/ultra.35.9.1-c

CrossRef Full Text | Google Scholar

29. Patel NU, McKinney K, Kreidler SM, Bieker TM, Russ P, Roberts K, et al. Ultrasound-based clinical prediction rule model for detecting papillary thyroid cancer in cervical lymph nodes: A pilot study. J Clin Ultrasound (2016) 44(3):143–51. doi: 10.1002/jcu.22309

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Snozek CL, Chambers EP, Reading CC, Sebo TJ, Sistrunk JW, Singh RJ, et al. Serum thyroglobulin, high-resolution ultrasound, and lymph node thyroglobulin in diagnosis of differentiated thyroid carcinoma nodal metastases. J Clin Endocrinol Metab (2007) 92(11):4278–81. doi: 10.1210/jc.2007-1075

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Sohn YM, Kwak JY, Kim EK, Moon HJ, Kim SJ, Kim MJ. Diagnostic approach for evaluation of lymph node metastasis from thyroid cancer using ultrasound and fine-needle aspiration biopsy. AJR Am J Roentgenol (2010) 194(1):38–43. doi: 10.2214/AJR.09.3128

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Langer JE, Mandel SJ. Sonographic imaging of cervical lymph nodes in patients with thyroid cancer. Neuroimaging Clin N Am (2008) 18(3):479–489, vii-viii. doi: 10.1016/j.nic.2008.03.007

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Kamaya A, Gross M, Akatsu H, Jeffrey RB. Recurrence in the thyroidectomy bed: sonographic findings. AJR Am J Roentgenol (2011) 196(1):66–70. doi: 10.2214/AJR.10.4474

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Papini E, Guglielmi R, Bianchini A, Crescenzi A, Taccogna S, Nardi F, et al. Risk of malignancy in nonpalpable thyroid nodules: Predictive value of ultrasound and color-Doppler features. J Clin EndocrinolMetab (2002) 87:941–1946. doi: 10.1210/jcem.87.5.8504

CrossRef Full Text | Google Scholar

35. Kim EK, Park CS, Chung WY, Oh KK, Kim DI, Lee JT, et al. New sonographic criteria for recommending fine-needle aspiration biopsy of nonpalpable solid nodules of the thyroid. AJR Am J Roentgenol (2002) 178(3):687–91. doi: 10.2214/ajr.178.3.1780687

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Hoang JK, Lee WK, Lee M, Johnson D, Farrell S. US Features of thyroid malignancy: pearls and pitfalls. Radiographics (2007) 27:847–60. doi: 10.1148/rg.273065038

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Moon WJ, Jung SL, Lee JH, Na DG, Baek JH, Lee YH, et al. Thyroid study group, Korean society of neuro- and head and neck radiology. benign and malignant thyroid nodules: US differentiation–multicenter retrospective study. Radiology (2008) 247(3):762–70. doi: 10.1148/radiol.2473070944

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Cappelli C, Castellano M, Pirola I, Gandossi E, De Martino E, Cumetti D, et al. Thyroid nodule shape suggests malignancy. Eur J Endocrinol (2006) 155(1):27–31. doi: 10.1530/eje.1.02177

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Alexander EK, Marqusee E, Orcutt J, Benson CB, Frates MC, Doubilet PM, et al. Thyroid nodule shape and prediction of malignancy. Thyroid (2004) 14(11):953–8. doi: 10.1089/thy.2004.14.953

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Na DG, Baek JH, Sung JY, Kim JH, Kim JK, Choi YJ, et al. Thyroid imaging reporting and data system risk stratification of thyroid nodules: Categorization based on solidity and echogenicity. Thyroid (2016) 26(4):562–72. doi: 10.1089/thy.2015.0460

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Arpaci D, Ozdemir D, Cuhaci N, Dirikoc A, Kilicyazgan A, Guler G, et al. Evaluation of cytopathological findings in thyroid nodules with macrocalcification: macrocalcification is not innocent as it seems. Arq Bras Endocrinol Metabol (2014) 58(9):939–45. doi: 10.1590/0004-2730000003602

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Na DG, Kim DS, Kim SJ, Ryoo JW, Jung SL. Thyroid nodules with isolated macrocalcification: malignancy risk and diagnostic efficacy of fine-needle aspiration and core needle biopsy. Ultrasonography (2016) 35(3):212–9. doi: 10.14366/usg.15074

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Kim MJ, Kim EK, Kwak JY, Park CS, Chung WY, Nam KH, et al. Differentiation of thyroid nodules with macrocalcifications: role of suspicious sonographic findings. J Ultrasound Med (2008) 27(8):1179–84. doi: 10.7863/jum.2008.27.8.1179

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Bonavita JA, Mayo J, Babb J, Bennett G, Oweity T, Macari M, et al. Pattern recognition of benign nodules at ultrasound of the thyroid: which nodules can be left alone? AJR Am J Roentgenol (2009) 193(1):207–13. doi: 10.2214/AJR.08.1820

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Chan BK, Desser TS, McDougall IR, Weigel RJ, Jeffrey RB Jr. Common and uncommon sonographic features of papillary thyroid carcinoma. J Ultrasound Med (2003) 22:1083–90. doi: 10.7863/jum.2003.22.10.1083

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Russ G. Risk stratification of thyroid nodules on ultrasonography with the French TI-RADS: description and reflections. Ultrasonography (2016) 35(1):25–38. doi: 10.14366/usg.15027

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Kim DW, Park JS, In HS, Choo HJ, Ryu JH, Jung SJ. Ultrasound-based diagnostic classification for solid and partially cystic thyroid nodules. AJNR Am J Neuroradiol (2012) 33(6):1144–9. doi: 10.3174/ajnr.A2923

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Persichetti A, Di Stasio E, Coccaro C, Graziano F, Bianchini A, Di Donna V, et al. Inter- and intraobserver agreement in the assessment of thyroid nodule ultrasound features and classification systems: A blinded multicenter study. Thyroid (2020) 30(2):237–42. doi: 10.1089/thy.2019.0360

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Zhao CK, Ren TT, Yin YF, Shi H, Wang HX, Zhou BY, et al. A comparative analysis of two machine learning-based diagnostic patterns with thyroid imaging reporting and data system for thyroid nodules: diagnostic performance and unnecessary biopsy rate. Thyroid (2021) 31(3):470–81. doi: 10.1089/thy.2020.0305

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Zhang B, Tian J, Pei S, Chen Y, He X, Dong Y, et al. Machine learning–assisted system for thyroid nodule diagnosis. Thyroid (2019) 29(6):858–67. doi: 10.1089/thy.2018.0380

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: thyroid nodules, malignant, machine learning, predictive model, web calculator

Citation: Li W, Hong T, Fang J, Liu W, Liu Y, He C, Li X, Xu C, Wang B, Chen Y, Sun C, Li W, Kang W and Yin C (2022) Incorporation of a machine learning pathological diagnosis algorithm into the thyroid ultrasound imaging data improves the diagnosis risk of malignant thyroid nodules. Front. Oncol. 12:968784. doi: 10.3389/fonc.2022.968784

Received: 14 June 2022; Accepted: 21 October 2022;
Published: 08 December 2022.

Edited by:

Shahid Mumtaz, Instituto de Telecomunicações, Portugal

Reviewed by:

Chong-Ke Zhao, Fudan University, China
Abdul Rehman Javed, Air University, Pakistan
Thippa Reddy Gadekallu, VIT University, India

Copyright © 2022 Li, Hong, Fang, Liu, Liu, He, Li, Xu, Wang, Chen, Sun, Li, Kang and Yin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Chengliang Yin, Y2hlbmdsaWFuZ3lpbkAxNjMuY29t; Wei Kang, a2FudmU4MjJAaG90bWFpbC5jb20=; Wenle Li, ZHJsZWUwOTEwQDE2My5jb20=

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.