- 1Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, Hangzhou, China
- 2Second Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou, China
Purpose: The aim of this study was to investigate the value of S-Detect for predicting the malignant risk of cytologically indeterminate thyroid nodules (CITNs).
Methods: The preoperative prediction of 159 CITNs (Bethesda III, IV and V) were performed using S-Detect, Thyroid Imaging Reporting and Data System of American College of Radiology (ACR TI-RADS) and Chinese TI-RADS (C-TIRADS). First, Linear-by-Linear Association test and chi-square test were used to analyze the malignant risk of CITNs. McNemar’s test and receiver operating characteristic curve were used to compare the diagnostic efficacy of S-Detect and the two TI-RADS classifications for CITNs. In addition, the McNemar’s test was used to compare the diagnostic accuracy of the above three methods for different pathological types of nodules.
Results: The maximum diameter of the benign nodules was significantly larger than that of malignant nodules [0.88(0.57-1.42) vs 0.57(0.46-0.81), P=0.002]. The risk of malignant CITNs in Bethesda system and the two TI-RADS classifications increased with grade (all P for trend<0.001). In all the enrolled CITNs, the diagnostic results of S-Detect were significantly different from those of ACR TI-RADS and C-TIRADS, respectively (P=0.021 and P=0.007). The sensitivity and accuracy of S-Detect [95.9%(90.1%-98.5%) and 88.1%(81.7%-92.5%)] were higher than those of ACR TI-RADS [87.6%(80.1%-92.7%) and 81.8%(74.7%-87.3%)] (P=0.006 and P=0.021) and C-TIRADS [84.3%(76.3%-90.0%) and 78.6%(71.3%-84.5%)] (P=0.001 and P=0.001). Moreover, the negative predictive value and the area under curve value of S-Detect [82.8% (63.5%-93.5%) and 0.795%(0.724%-0.855%)] was higher than that of C-TIRADS [54.8%(38.8%-69.8%) and 0.724%(0.648%-0.792%] (P=0.024 and P=0.035). However, the specificity and positive predictive value of S-Detect were similar to those of ACR TI-RADS (P=1.000 and P=0.154) and C-TIRADS (P=1.000 and P=0.072). There was no significant difference in all the evaluated indicators between ACR TI-RADS and C-TIRADS (all P>0.05). The diagnostic accuracy of S-Detect (97.4%) for papillary thyroid carcinoma (PTC) was higher than that of ACR TI-RADS (90.4%) and C-TIRADS (87.8%) (P=0.021 and P=0.003).
Conclusion: The diagnostic performance of S-Detect in differentiating CITNs was similar to ACR TI-RADS and superior to C-TIRADS, especially for PTC.
Introduction
Thyroid cancer is the most frequent endocrine cancer worldwide and ranks the ninth for incidence among all malignancies (1). The incidence of thyroid cancer has increased over the past few decades in many countries (1, 2). Nowadays, the thyroid nodule (TN) has become one of the most common causes of seeking medical advice in daily clinical practice. Ultrasonography (US) has been used as the preferred imaging method for detecting and diagnosing TNs in the routine clinical setting (3). The increase of incidence of TN is likely caused to increased imaging usage, which increases from 4-7% by palpation to 19-67% by US in asymptomatic population (4). In addition, US could be used to reveal information about the margin, shape, composition and echogenicity, and to make a differential diagnosis of TNs (3, 4). Many systems have been established to predict the malignant risk of TNs based on US characteristics, such as Thyroid Imaging Reporting and Data System of American College of Radiology (ACR TI-RADS) (5) and Chinese TI-RADS (C-TIRADS) (6), and are useful for describing thyroid disease (7).
At present, US-guided fine-needle aspiration (FNA) is considered as the standard workup for judging benign or malignant TNs preoperatively (8), but FNA still can not reliably give definitive results in about 10-20% of TNs (9–11). The results of FNA are presented by using the Bethesda System for Reporting Thyroid Cytopathology (TBSRTC) (12). There are six categories in this system. Bethesda I (Nondiagnostic or unsatisfactory) TNs are usually required repeated FNA. Bethesda II (Benign) and VI (Malignant) TNs will be managed without any confusion. For TNs with Bethesda IV (Follicular neoplasm or Suspicious for a follicular neoplasm) and V (Suspicious for malignancy), preoperative molecular testing (Bethesda IV) or surgical procedure (Bethesda V) is suggested (12). But there is still the possibility that postoperative specimens are benign lesions, leading to over-diagnosis and over-treatment (13). The nature of TNs with Bethesda III (Atypia of undetermined significance or Follicular lesion of undetermined significance) is also uncertain, which even accounts for about 15% at the first FNA (10, 11). Therefore, the above three categories (Bethesda III, IV and IV) of TNs can be considered as the cytologically indeterminate TNs (CITNs), which lead to confusion in clinical practice. As benign TNs have little effects on patient’s health and could be managed conservatively, efforts to better select candidates for operation are necessary.
For further evaluation of CITNs, repeated FNA, molecular testing, or diagnostic thyroidectomy may be arranged for the patient. However, these also remain a challenge. There are 20-28% patients with Bethesda III category presented as CITNs again on the second FNA (14). The risk of malignancy in nodules with Bethesda III category sent directly to surgery is only 15.7% (15). Although BRAF mutation testing could be used as a supplement to routine cytology, its value in improving the preoperative diagnostic rate for malignancy is limited (16). As mentioned above, the findings of US could be used to predict malignancy, especially when cytology is not conclusive. The predicted value of several stratified systems that are based on US features (such as ACR TI-RADS and C-TIRADS) for assessing CITNs has been investigated in previous studies (17, 18). However, these classifications might be intricate for inexperienced physicians (19).
In recent years, computer-aided diagnosis system using artificial intelligence (AI-CADS) based on US image analysis techniques has been developed and introduced to the clinical application of US medicine (20–23). Its utility has been preliminarily verified in the detection of breast and thyroid malignant nodules with satisfactory results (21, 22). Many previous studies have shown that AI-CADS for the classification of TNs can achieve accuracy comparable to senior sonologists, reduce operator dependency, and provide second advice in imaging diagnosis (22–24). As a kind of AI-CADS, S-Detect software (Samsung Ultrasound RS80A, Samsung Medison Co. Ltd., Seoul, South Korea) is an image-analytic program developed on the basis of statistical data mining algorithm, which can directly judge the nature of TNs in ultrasound images and achieve the consistency of the US features of the same nodules (25, 26). Previous studies have shown that S-Detect has reliable diagnostic performance for TNs with Bethesda II and VI categories (26–28). However, as far as we know, there is no relevant study on the performance of S-Detect in differentiating CITNs.
Therefore, the purpose of this study was to investigate the value of S-Detect based on US images for predicting the malignant risk of CITNs by comparing with the two TI-RADS classifications in CITNs.
Materials and methods
This retrospective study was performed in line with the principles of the Declaration of Helsinki. Approval was approved by the Ethics Committee of our institution (No. 20201217-38). Informed consent was obtained from each of the participants prior to the data collection for scientific research and potential publication of their anonymized images.
Patients
Between January 2020 and December 2021, the imaging data of 536 CITNs from 497 patients who underwent thyroidectomy in our hospital were collected retrospectively. Excluding cases without thyroidectomy (358 CITNs) and incomplete imaging data (19 CITNs), 145 patients with 159 CITNs were finally enrolled (30 males and 115 females) in the study. The mean age was 46.09 ± 12.09 years (range from 18 to 76 years) and the mean size of the maximum diameter of the TNs was 0.97 ± 0.97 cm (range from 0.24 to 6.63 cm).
The inclusion criteria were as follows: (1) the result of FNA was indeterminate (Bethesda III, IV and V categories); (2) underwent thyroidectomy in our hospital and with clear postoperative pathological diagnosis (confirmed by two experienced pathologists); (3) the US examination and S-Detect were performed before FNA, and the time interval between US and surgery was less than 3 weeks. The exclusion criteria were as follows: (1) the result of FNA was Bethesda I, II or VI category; (2) TNs with repeated FNA; (3) incomplete ultrasound images and clinical date.
S-Detect software for TNs
In this study, a US machine (Samsung Ultrasound RS80A, Samsung Medison Co. Ltd., Seoul, South Korea) with a 3-12 MHz linear array probe was used. The S-Detect was integrated into the US system. The sonologists opened the S-Detect interface and entered the optimal preset for thyroid. First, longitudinal and transverse scans were performed to the suspected TNs, and the maximum diameter of the TNs were taken as the representative images for S-Detect and two TI-RADS classifications. The contours of the TNs could then be drawn automatically by the S-Detect software or manually when needed. S-Detect evaluated US characteristics of TNs, including composition, echogenicity, orientation, margin, shape and spongiform status, and gave two diagnostic opinions: possibly benign and possibly malignant. Finally, all image data were stored. The US images were collected by two sonologists with 6 and 9 years of experience in the thyroid US who were blinded to the pathology results.
ACR TI-RADS and C-TIRADS
According to the 2017 ACR TI-RADS publication (5), each US feature received point(s) and the individual points summed up. The classification was arranged based on the total score (TR1: 0 point; TR2: 2 points; TR3: 3 points; TR4: 4-6 points; TR5: ≥7 points). US characteristics include composition, echogenicity of solid components, shape, margin and calcification (Table 1).
Table 1 Thyroid Imaging Reporting and Data System of American College of Radiology (5).
The TNs were assessed according to C-TIRADS proposed by Zhou (6). The category was established by counting US features, including solid composition, microcalcifications, markedly hypoechoic, irregular margin/ill-defined or extrathyroidal extension, vertical orientation and Comet tail artifacts (subtraction point). The classification ranged from TR1 to TR6 (TR1: no nudule; TR2: -1 point; TR3: 0 point; TR4A: 1 point; TR4B: 2 points; TR4C: 3-4 points; TR5: 5 points; TR6: biopsy proved malignant) (Table 2).
Table 2 Chinese thyroid imaging reporting and data system (6).
In this study, the postoperative pathology of TNs was taken as the gold standard, and the cut-off values were obtained by receiver operating characteristic (ROC) curves analysis of the results of the two TI-RADS classifications.
Image interpretation
Firstly, all indicators of ACR TI-RADS and C-TIRADS were listed separately and arranged uniformly (Table 3). Then, two sonologists with more than 10 years of experience in thyroid US assessed the enrolled images of CITNs independently and ticked the indicators in the images respectively. If no consensus was reached, arbitration from another sonologist (with more than 20 years of experience in thyroid US) was performed. Finally, a sonologist with more than 3 years of experience accumulated the scores according to the indicators, and obtained ACR TI-RADS and C-TIRADS results of each image respectively. The above three reviewers were blinded to the findings of each other, the results of S-Detect, the pathological results and other clinical information of the CITNs.
Pathology
All of the cytological and histological findings of the enrolled CITNs were reviewed and confirmed by two pathologists (with more than 10 years experience of cytopathology or histopathology respectively) independently, and they were blinded to the findings of each other. Another pathologist with more than 15 years experience made the final decision if no consensus was reached. To ensure the corresponding relationship between pathology and US image, the size and location of each CITN was compared, such as left or right lobe or isthmus; superior pole or inferior pole or middle; ventral or dorsal or middle; and interior or middle or lateral.
Statistical analysis
All statistical analyses were performed using SPSS v19.0 (SPSS Inc, Chicago, IL) and MedCalc v15.0 (MedCalc Software, Ostend, Belgium). If the variables were quantitative and normal, the mean ± standard deviation (SD) was used for statistical description, and the median with interquartile range (IQR) was used for the non-normal variables. Counting data is presented as numbers and percentage. The normal distribution data were compared between groups using an independent sample t-test. For non-normal distribution data, differences were analyzed using a Mann-Whitney U test. Chi-square test or McNemar’s test was used to compare the differences between groups for counting data. Linear-by-Linear Association test was used to analyze the linear trend of the malignant risk of CITNs. The ROC curve was drawn according to the histological results as the reference standard to find the best diagnostic cutoff value for the two TI-RADS classifications. In addition, the area under curve (AUC) of ROC curve was used to compare the diagnostic efficacy between S-Detect and the two TI-RADS classifications by Delong test (29). P values less than 0.05 were considered as statistically significant.
Results
Clinicopathological characteristics of enrolled nodules
Totally, 159 CITNs from 145 patients were enrolled in this study (Table 4). Postoperative pathology revealed 38 benign TNs and 121 malignant TNs. The maximum diameter of benign nodules was significantly larger than that of malignant nodules (P=0.002), while there was no significant difference between benign and malignant nodules in terms of age and gender (P=0.092 and P=0.167).
According to ROC curve analysis, the ACR TI-RADS took TR4 as the cut-off value, and the nodules above TR4 were considered as possible malignant. The C-TIRADS took TR4B as the cut-off value, and the nodules above TR4B were also considered as possible malignant. ACR TI-RADS detected 39 benign nodules and 120 malignant nodules. C-TIRADS detected 42 benign nodules and 117 malignant nodules. S-Detect detected 29 benign nodules and 130 malignant nodules. The diagnostic results of the above three methods were statistically different from the pathological results (all P<0.001).
Malignancy risk of the CITNs in different categories
The malignant risk of CITNs in Bethesda system and different TI-RADS classifications increased with classification grade (Table 5). The malignant risk of nodules among Bethesda III, IV and V categories was 55.0% (33/60), 66.7% (8/12) and 92.0% (80/87), respectively (P for trend < 0.001). The risk of malignancy among ACR TI-RADS 3, 4 and 5 was 0% (0/0), 42.9% (15/35) and 88.3% (106/120), respectively (P for trend < 0.001). And the risk of malignancy in C-TIRADS 3, 4A, 4B, 4C and 5 was 0% (0/0), 7.7% (1/13), 62.1% (18/29), 85.4% (82/96) and 95.2% (21/21) (P for trend < 0.001).
Comparison of diagnostic performance between S-Detect and the two TI-RADS classifications for differentiating CITNs
In all the CITNs, the diagnostic results of S-Detect were significantly different from those of ACR TI-RADS and C-TIRADS, respectively (P=0.021 and P=0.007). However, in each of Bethesda categories, such as Bethesda III, IV, or V, there was no significant difference between the S-Detect and the two TI-RADS classifications (all P>0.05) (Table 6).
In all the enrolled CITNs, the sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and accuracy for diagnosing malignant TNs were 95.9%, 63.2%, 89.2%, 82.8%, and 88.1% for S-Detect, 87.6%, 63.2%, 88.3%, 61.5%, and 81.8% for ACR TI-RADS, and 84.3%, 60.5%, 87.2%, 54.8% and 78.6% for C-TIRADS, respectively (Table 7). The sensitivity and accuracy of S-Detect were higher than those of ACR TI-RADS (P=0.006 and P=0.021) and C-TIRADS (P=0.001 and P=0.001). And the NPV of S-Detect was higher than that of C-TIRADS (P=0.024). While specificity and PPV of S-Detect were similar to those of ACR TI-RADS (P=1.000 and P=0.154) and C-TIRADS (P=1.000 and P=0.072). Moreover, there was no significant difference in all evaluated indexes between ACR TI-RADS and C-TIRADS (all P>0.05) (Figures 1 and 2).
Figure 1 The ultrasound features of a thyroid nodule in a 28-year-old woman were solid, hypoechoic, wider than tall, smooth margin and without echogenic foci. S-Detect indicated “Possibly Malignant” (A). The ACR TI-RADS was TR4 with 4 points, and the C-TIRADS was TR4A with 1 point. It was considered to be a benign nodule. The nodule was classified as Bethesda V category after fine-needle aspiration (B) (HE staining; magnification ×400). And the final pathological result showed papillary thyroid carcinoma (C) (HE staining; magnification ×400).
Figure 2 The ultrasound features of a thyroid nodule in a 32-year-old woman were solid, hypoechoic, wider than tall, ill-defined margin and punctate echogenic foci. S-Detect indicated “Possibly Benign” (A). However, the ACR TI-RADS was TR5 with 7 points, and the C-TIRADS was TR4C with 3 points. It was considered to be a malignant nodule. The nodule was classified as Bethesda III category after fine-needle aspiration (B) (HE staining; magnification ×400). And the final pathological result suggested nodular goiter (C) (HE staining; magnification ×400).
ROC curves were drawn for S-Detect and the two TI-RADS classifications (Figure 3). The AUC value of S-Detect [0.795, 95% confidence interval (CI): 0.724-0.855] was higher than C-TIRADS (0.724, 95% CI: 0.648-0.792) (Z=2.111, P=0.035). However, the AUC value between S-Detect and ACR TI-RADS (0.754, 95% CI: 0.679-0.819) (Z=1.375, P=0.169) was not statistically significant difference, nor was the AUC between ACR TI-RADS and C-TIRADS (Z=1.793, P=0.073).
Figure 3 The receiver operating characteristic (ROC) curves of S-Detect and two TI-RADS classifications.
Comparison of diagnostic efficacy of S-Detect and the two TI-RADS classifications in different pathological types
Seven pathological types were present in 38 benign nodules, including nodular goiter (n=13), lymphocytic thyroiditis (n=6), follicular adenoma (n=10), subacute thyroiditis (n=2), thyroid fibro-hyaline nodule (n=5), focal fibroplasia (n=1), and granulomatous inflammation (n=1). Three pathological types were found in 121 malignant nodules, including papillary thyroid carcinoma (PTC) (n=115) and follicular thyroid carcinoma (FTC) (n=6) (Table 8). The diagnostic accuracy of S-Detect for PTC was higher than that of ACR TI-RADS and C-TIRADS (P=0.021, P=0.003, respectively), while for other pathological types, there was no significant difference in the diagnostic results of the three methods (all P>0.05).
Discussion
At present, the main method to diagnose suspicious TNs is US-guided FNA combined with TBSRTC. However, CITN is a gray zone of cytological diagnosis and remains a challenge in clinical practice (30). For further diagnosis, S-Detect and the two TI-RADS classifications were used for preoperative assessment of CITNs in this study. S-Detect is a highly recognized US thyroid imaging platform based on AI-CADS. The system include two outputs: TI-RADS based on scoring and dichotomous predictions. The prediction is a independent diagnosis based on convolutional neural network deep learning techniques (31, 32). S-Detect has the ability to evaluate TNs in real time, which could reduce subjectivity to a certain extent (23, 24). ACR TI-RADS and C-TIRADS are widely used and have important value in determining the nature of nodules (5, 6). In this study, the diagnostic efficacy of the two TI-RADS classifications for CITNs was similar without significant difference.
The diagnostic difference between S-Detect and the two TI-RADS classifications in each Bethesda category
In this study, the malignant risk for Bethesda III, IV, and V TNs increased with grade, similar to the ACR TI-RADS and C-TIRADS, but benign and malignant nodules were found in each category. In all the CITNs, there were significant differences between S-Detect and the two TI-RADS classifications. However, there was no significant difference in the diagnosis results of S-Detect and the two TI-RADS classifications in each category (Bethesda III, IV, and V). The nodules with Bethesda V had a high malignant risk, so it was understandable that S-Detect and the two TI-RADS classifications had similar diagnostic efficacy. For Bethesda III and IV nodules, the reason might be related to the low number of cases, especially for Bethesda IV nodules, which needed to be further confirmed in future studies. In addition, malignancy rates of Bethesda III, IV, and V in the study population were much higher than those of the general population (33), which maybe associated with bases on surgical cohorts selection bias.
The diagnostic efficacy of S-Detect and the two TI-RADS classifications in the all enrolled CITNs
The diagnostic performance of S-Detect in discriminating CITNs was similar to ACR TI-RADS and superior to C-TIRADS. In terms of sensitivity and accuracy, S-Detect was higher than the two TI-RADS classifications. According to previous studies (34, 35), S-Detect analyzed and weighted US features of many malignant nodules to identify suspicious TNs, such as composition, echogenicity, shape, margin, calcification and extrathyroidal extension. So S-Detect was more detailed and objective in the identification process. While the malignant features judged by two TI-RADS classifications were relatively limited, which could not comprehensively and reliably predict the nature of TNs (5, 6, 36). Moreover, the interpretation process depends on visual observation and personal experience, which also affected the diagnostic efficiency (19). In terms of specificity and PPV, S-Detect needed to be improved, which was consistent with the results of many studies (21, 28, 37). This could be attributed to the three factors. First, some TI-RADS classifications related to the development of S-Detect software, such as Kwak TI-RADS, itself have high sensitivity and low specificity (26). Secondly, S-Detect is not accurate enough to recognize calcifications (22). For S-Detect, benign lesions with calcification are likely to show false-positive results, whereas, the smaller lesions without calcifications were likely to show false-negative results (38). Furthermore, in this study, all the enrolled TNs were suspected by US, the TI-RADS grade was relatively higher, and the number of benign nodules was relatively small, which could easily lead to biased results. However, some studies believed that S-Detect still had higher specificity and PPV compared with different TI-RADS classifications (27). Therefore, it is generally accepted that the diagnostic efficacy of S-Detect is different from that of different TI-RADS classifications. Of course, radiologists with different experience can also give different TI-RADS results for the same thyroid nodule. Chung et al. (28) believed that the diagnostic efficacy of S-Detect had obvious advantages for radiologists with less experience in the diagnosis of thyroid nodule.
The diagnostic efficacy of S-Detect and the two TI-RADS classifications in different pathological types
PTC is the highest incidence of thyroid malignant tumors (1, 2), so the training sets of S-Detect are mainly based on the US features of PTC (39). Therefore, S-Detect has satisfactory diagnostic performance for PTC (40), which was also confirmed in this study. The diagnostic accuracy of S-Detect for PTC was significantly higher than that of the two TI-RADS classifications. However, for other malignant pathological types, such as FTC, there was no significant difference in the diagnostic results, which may be related to the limited number of cases, and it is necessary to increase the number of cases for further research in the future. Of course, the preoperative diagnosis of follicular thyroid neoplasm has been challenging. It is difficult to distinguish a benign or malignant TN based on cytological examination of follicular neoplasm (41). The presence of tumor capsule invasion and angioinvasion could be assessed only by postoperative histological pathology (42). Preoperative TI-RADS had been used to predict malignancy for Bethesda IV category TNs in previous studies (17, 18, 43), and the results showed that the risk of malignancy was 50.0% (18). This is similar to present study and indicated that the follicular neoplasm presented a confusing US characteristics. FTC could be found with a larger size, ovoid shape, and well-defined margins, which were regarded as benign features (44), so the diagnostic value of AI-CADS was also limited. Although the diagnostic accuracy of S-Detect in this study was 66.7%, which was higher than that of ACR TI-RADS (33.3%) and C-TIRADS (16.7%), there was no significant difference between the diagnostic results.
The relationship between size and nature of CITNs
In addition, the maximum diameter of TNs may also be used as an auxiliary diagnostic method. This has been proven in some studies (40, 45). With the resolution of ultrasonic diagnostic instrument continuously improve and residents’ health consciousness enhancement, malignant nodules in small size can often be detected in the regular health examination, so as to obtain intervention treatment. Whereas benign nodules are generally not intervened, and have the opportunity to increase in size over time. So the maximum diameter of malignant nodules is often smaller than that of benign nodules (40), which is consistent with the results in this study. Of course, there are a small number of malignant nodules that are detected with large size, which is often associated with less differentiated nodules or patients who have not undergone a health examination for a long time.
Limitations
There are several limitations in this study. Firstly, the images used in this study are static two-dimensional images, and sonographers may have missed some important information when analyzing TNs images. Secondly, the number of TNs with Bethesda III and IV categories was relatively small. Similarly, PTC was the mainly pathological type in this study, and the number of other pathological types was relatively short. These may cause sampling bias to a certain extent. Thirdly, the S-Detect used in this study focused on PTC mainly. There is relatively little training for other types of thyroid tumors. This may lead to poor diagnostic ability of S-Detect for other tumors.
Conclusion
S-Detect based on US images could be used to predict malignant risk of CITNs after FNA. Although its performance cannot replace the pathological results, it was similar to ACR TI-RADS and superior to C-TIRADS in this study. Especially when the pathological type was PTC, S-Detect had the opportunity to become an alternative to TI-RADS.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving human participants were reviewed and approved by the Ethics Committee of Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University. The patients/participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.
Author contributions
LZ proposed the research ideas, designed the method and wrote the original draft. S-YL put forward some suggestions on the research ideas and reviewed and revised the paper. H-FW and L-LZ collected ultrasound images and obtained the diagnosis results of S-Detect. C-JZ, QL and L-LX were responsible for the interpretation of ultrasound image features. L-LZ counted the counting results of ultrasound image features. M-RZ was responsible for the diagnosis of pathological results. G-FH collected the patients’ clinical data. EG-A finally proofread the paper. All authors contributed to the article and approved the submitted version.
Funding
This study is supported by the Fund of Natural Science Foundation of Zhejiang Province (LY20H180005), which is sponsored by S-YL.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Wang J, Yu F, Shang Y, Ping Z, Liu L. Thyroid cancer: incidence and mortality trends in China, 2005-2015. Endocrine. (2020) 68:163–73. doi: 10.1007/s12020-020-02207-6
2. Yan KL, Li S, Tseng C-H, Kim J, Nguyen DT, Dawood NB, et al. Rising incidence and incidence-based mortality of thyroid cancer in California, 2000-2017. J Clin Endocrinol Metab (2020) 105:dgaa121. doi: 10.1210/clinem/dgaa121
3. Dighe M, Barr R, Bojunga J, Cantisani V, Chammas MC, Cosgrove D, et al. Thyroid ultrasound: State of the art. part 2 - focal thyroid lesions. Med Ultrason. (2017) 19:195–210. doi: 10.11152/mu-999
4. Burman KD, Wartofsky L. CLINICAL PRACTICE. thyroid nodules. N Engl J Med (2015) 373:2347–56. doi: 10.1056/NEJMcp1415786
5. Tessler FN, Middleton WD, Grant EG, Hoang JK, Berland LL, Teefey SA, et al. ACR thyroid imaging, reporting and data system (TI-RADS): White paper of the ACR TI-RADS committee. J Am Coll Radiol (2017) 14:587–95. doi: 10.1016/j.jacr.2017.01.046
6. Zhou J, Yin L, Wei X, Zhang S, Song Y, Luo B, et al. Superficial organ and vascular ultrasound group of the society of ultrasound in medicine of the Chinese medical association; Chinese artificial intelligence alliance for thyroid and breast ultrasound. 2020 Chinese guidelines for ultrasound malignancy risk stratification of thyroid nodules: The c-TIRADS. Endocrine. (2020) 70:256–79. doi: 10.1007/s12020-020-02441-y
7. Cantisani V, David E, Grazhdani H, Rubini A, Radzina M, Dietrich CF, et al. Prospective evaluation of semiquantitative strain ratio and quantitative 2D ultrasound shear wave elastography (SWE) in association with TIRADS classification for thyroid nodule characterization. Ultraschall Med (2019) 40:495–503. doi: 10.1055/a-0853-1821
8. Dietrich CF, Müller T, Bojunga J, Dong Y, Mauri G, Radzina M, et al. Statement and recommendations on interventional ultrasound as a thyroid diagnostic and treatment procedure. Ultrasound Med Biol (2018) 44:14–36. doi: 10.1016/j.ultrasmedbio.2017.08.1889
9. Oliveira CM, Costa RA, Patrício M, Estêvão A, Graça B, Caseiro-Alves F. Sonographic criteria predictive of malignant thyroid nodules: Which lesions should be biopsied? Acad Radiol (2018) 25:213–8. doi: 10.1016/j.acra.2017.09.006
10. Joo L, Na DG, Kim JH, Seo H. Comparison of core needle biopsy and repeat fine-needle aspiration in avoiding diagnostic surgery for thyroid nodules initially diagnosed as Atypia/Follicular lesion of undetermined significance. Korean J Radiol (2022) 23:280–8. doi: 10.3348/kjr.2021.0619
11. Huhtamella R, Kholová I. Thyroid Bethesda category AUS/FLUS in our microscopes: Three-Year-Experience and cyto-histological correlation. Cancers (Basel). (2019) 11:1670. doi: 10.3390/cancers11111670
12. Cibas ES, Ali SZ. The 2017 Bethesda system for reporting thyroid cytopathology. Thyroid. (2017) 27:1341–6. doi: 10.1089/thy.2017.0500
13. Ahn HS, Kim HJ, Welch HG. Korea's thyroid-cancer "epidemic"–screening and overdiagnosis. N Engl J Med (2014) 371:1765–7. doi: 10.1056/NEJMp1409841
14. Misiakos EP, Margari N, Meristoudis C, Machairas N, Schizas D, Petropoulos K, et al. Cytopathologic diagnosis of fine needle aspiration biopsies of thyroid nodules. World J Clin Cases. (2016) 4:38–48. doi: 10.12998/wjcc.v4.i2.38
15. Bongiovanni M, Bellevicine C, Troncone G, Sykiotis GP. Approach to cytological indeterminate thyroid nodules. Gland Surg (2019) 8:S98–S104. doi: 10.21037/gs.2018.12.06
16. Chirayath SR, Pavithran PV, Abraham N, Nair V, Bhavani N, Kumar H, et al. Prospective study of Bethesda categories III and IV thyroid nodules: Outcomes and predictive value of BRAFV600E mutation. Indian J Endocrinol Metab (2019) 23:278–81. doi: 10.4103/ijem.IJEM_635_18
17. Barbosa TLM, Junior COM, Graf H, Cavalvanti T, Trippia MA, da Silveira Ugino RT, et al. ACR TI-RADS and ATA US scores are helpful for the management of thyroid nodules with indeterminate cytology. BMC Endocr Disord (2019) 19:112. doi: 10.1186/s12902-019-0429-5
18. Ahmadi S, Herbst R, Oyekunle T, X' J, Strickland K, Roman S, et al. USING THE ATA AND ACR TI-RADS SONOGRAPHIC CLASSIFICATIONS AS ADJUNCTIVE PREDICTORS OF MALIGNANCY FOR INDETERMINATE THYROID NODULES. Endocr Pract (2019) 25:908–17. doi: 10.4158/EP-2018-0559
19. Jin Z, Zhu Y, Zhang S, Xie F, Zhang M, Zhang Y, et al. Ultrasound computer-aided diagnosis (CAD) based on the thyroid imaging reporting and data system (TI-RADS) to distinguish benign from malignant thyroid nodules and the diagnostic performance of radiologists with different diagnostic experience. Med Sci Monit (2020) 26:e918452. doi: 10.12659/MSM.918452
20. Sun C, Zhang Y, Chang Q, Liu T, Zhang S, Wang X, et al. Evaluation of a deep learning-based computer-aided diagnosis system for distinguishing benign from malignant thyroid nodules in ultrasound images. Med Phys (2020) 47:3952–60. doi: 10.1002/mp.14301
21. Choi JH, Kang BJ, Baek JE, Lee HS, Kim SH. Application of computer-aided diagnosis in breast ultrasound interpretation: Improvements in diagnostic performance according to reader experience. Ultrasonography. (2018) 37:217–25. doi: 10.14366/usg.17046
22. Kim HL, Ha EJ, Han M. Real-world performance of computer-aided diagnosis system for thyroid nodules using Ultrasonography. Ultrasound Med Biol (2019) 45:2672–8. doi: 10.1016/j.ultrasmedbio.2019.05.032
23. Buda M, Wildman-Tobriner B, Hoang JK, Thayer D, Tessler FN, Middleton WD, et al. Management of thyroid nodules seen on US images: Deep learning may match performance of radiologists. Radiology. (2019) 292:695–701. doi: 10.1148/radiol.2019181343
24. Jeong EY, Kim HL, Ha EJ, Park SY, Cho YJ, Han M. Computer-aided diagnosis system for thyroid nodules on ultrasonography: Diagnostic performance and reproducibility based on the experience level of operators. Eur Radiol (2019) 29:1978–85. doi: 10.1007/s00330-018-5772-9
25. Wei Q, Zeng SE, Wang LP, Yan YJ, Wang T, Xu JW, et al. The value of s-detect in improving the diagnostic performance of radiologists for the differential diagnosis of thyroid nodules. Med Ultrason. (2020) 22:415–23. doi: 10.11152/mu-2501
26. Szczepanek-Parulska E, Wolinski K, Dobruch-Sobczak K, Antosik P, Ostalowska A, Krauze A, et al. S-detect software vs. EU-TIRADS classification: A dual-center validation of diagnostic performance in differentiation of thyroid nodules. J Clin Med (2020) 9:2495. doi: 10.3390/jcm9082495
27. Han M, Ha EJ, Park JH. Computer-aided diagnostic system for thyroid nodules on ultrasonography: Diagnostic performance based on the thyroid imaging reporting and data system classification and dichotomous outcomes. AJNR Am J Neuroradiol (2021) 42:559–65. doi: 10.3174/ajnr.A6922
28. Chung SR, Baek JH, Lee MK, Ahn Y, Choi YJ, Sung TY, et al. Computer-aided diagnosis system for the evaluation of thyroid nodules on ultrasonography: Prospective non-inferiority study according to the experience level of radiologists. Korean J Radiol (2020) 21:369–76. doi: 10.3348/kjr.2019.0581
29. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. (1988) 44:837–45. doi: 10.2307/2531595
30. Almquist M, Muth A. Surgical management of cytologically indeterminate thyroid nodules. Gland Surg (2019) 8:S105–11. doi: 10.21037/gs.2019.01.03
31. Ha EJ, Baek JH, Na DG. Risk stratification of thyroid nodules on ultrasonography: Current status and perspectives. Thyroid. (2017) 27:1463–8. doi: 10.1089/thy.2016.0654
32. Haugen BR. 2015 American thyroid association management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer: What is new and what has changed? Cancer. (2017) 123:372–81. doi: 10.1002/cncr.30360
33. Belovarac B, Zhou F, Modi L, Sun W, Shafizadeh N, Negron R, et al. Evaluation of ACR TI-RADS cytologically indeterminate thyroid nodules and molecular profiles: a single-institutional experience. J Am Soc Cytopathol. (2022) 11:165–72. doi: 10.1016/j.jasc.2022.01.002
34. Shin I, Kim YJ, Han K, Lee E, Kim HJ, Shin JH, et al. Application of machine learning to ultrasound images to differentiate follicular neoplasms of the thyroid gland. Ultrasonography. (2020) 39:257–65. doi: 10.14366/usg.19069
35. Yu Q, Jiang T, Zhou A, Zhang L, Zhang C, Xu P. Computer-aided diagnosis of malignant or benign thyroid nodes based on ultrasound images. Eur Arch Otorhinolaryngol (2017) 274:2891–7. doi: 10.1007/s00405-017-4562-3
36. Russ G, Bonnema SJ, Erdogan MF, Durante C, Ngu R, Leenhardt L. European Thyroid association guidelines for ultrasound malignancy risk stratification of thyroid nodules in adults: The EU-TIRADS. Eur Thyroid J (2017) 6:225–37. doi: 10.1159/000478927
37. Gao L, Liu R, Jiang Y, Song W, Wang Y, Liu J, et al. Computer-aided system for diagnosing thyroid nodules on ultrasound: A comparison with radiologist-based clinical assessments. Head Neck. (2018) 40:778–83. doi: 10.1002/hed.25049
38. Wu JY, Zhao ZZ, Zhang WY, Liang M, Ou B, Yang HY, et al. Computer-aided diagnosis of solid breast lesions with ultrasound: Factors associated with false-negative and false-positive results. J Ultrasound Med (2019) 38:3193–202. doi: 10.1002/jum.15020
39. Zhao WJ, Fu LR, Huang ZM, Zhu JQ, Ma BY. Effectiveness evaluation of computer-aided diagnosis system for the diagnosis of thyroid nodules on ultrasound: A systematic review and meta-analysis. Med (Baltimore). (2019) 98:e16379. doi: 10.1097/MD.0000000000016379
40. Xia S, Yao J, Zhou W, Dong Y, Xu S, Zhou J, et al. A computer-aided diagnosing system in the evaluation of thyroid nodules-experience in a specialized thyroid center. World J Surg Oncol (2019) 17:210. doi: 10.1186/s12957-019-1752-z
41. Tepeoğlu M, Bilezikçi B, Bayraktar SG. A histological assessment of the Bethesda system for reporting thyroid cytopathology (2010) abnormal categories: a series of 219 consecutive cases. Cytopathology. (2014) 25:39–44. doi: 10.1111/cyt.12051
42. Stanek-Widera A, Biskup-Frużyńska M, Zembala-Nożyńska E, Półtorak S, Śnietura M, Lange D. Suspicious for follicular neoplasm or follicular neoplasm? dilemma pathologist surgeon. Endokrynol Pol (2016) 67:17–22. doi: 10.5603/EP.2016.0003
43. Zhang WB, Li JJ, Chen XY, He BL, Shen RH, Liu H, et al. SWE combined with ACR TI-RADS categories for malignancy risk stratification of thyroid nodules with indeterminate FNA cytology. Clin Hemorheol Microcirc (2020) 76:381–90. doi: 10.3233/CH-200893
44. Jeong SH, Hong HS, Lee EH. Can nodular hyperplasia of the thyroid gland be differentiated from follicular adenoma and follicular carcinoma by ultrasonography? Ultrasound Q (2016) 32:349–55. doi: 10.1097/RUQ.0000000000000246
Keywords: computer-aided diagnosis, thyroid imaging reporting and data system, ultrasound, diagnosis, cytologically indeterminate thyroid nodule
Citation: Zhou L, Zheng L-l, Zhang C-j, Wei H-f, Xu L-l, Zhang M-r, Li Q, He G-f, Ghamor-Amegavi EP and Li S-y (2023) Comparison of S-Detect and thyroid imaging reporting and data system classifications in the diagnosis of cytologically indeterminate thyroid nodules. Front. Endocrinol. 14:1098031. doi: 10.3389/fendo.2023.1098031
Received: 14 November 2022; Accepted: 11 January 2023;
Published: 24 January 2023.
Edited by:
Sijie Fang, Shanghai Jiao Tong University, ChinaReviewed by:
Pia Pace-Asciak, University of Toronto, CanadaDong Xu, University of Chinese Academy of Sciences, China
Yong Wang, Chinese Academy of Medical Sciences and Peking Union Medical College, China
Copyright © 2023 Zhou, Zheng, Zhang, Wei, Xu, Zhang, Li, He, Ghamor-Amegavi and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Shi-yan Li, bGlzaGl5YW5Aemp1LmVkdS5jbg==