A comprehensive comparative assessment of eight risk stratification systems for thyroid nodules in the elderly population

Ma, Xiao; Yu, Jing; Huang, Yuanjing; Cui, Yiyang; Cui, Kefei

doi:10.3389/fonc.2023.1265973

ORIGINAL RESEARCH article

Front. Oncol., 15 November 2023

Sec. Head and Neck Cancer

Volume 13 - 2023 | https://doi.org/10.3389/fonc.2023.1265973

A comprehensive comparative assessment of eight risk stratification systems for thyroid nodules in the elderly population

Xiao Ma

Jing Yu^*

Yuanjing Huang

Yiyang Cui

Kefei Cui^*

Department of Ultrasound, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China

Objective: This study aims to investigate the diagnostic value of eight risk stratification systems (RSSs) for thyroid nodules in the elderly and explore the reasons in comparison with a younger group.

Methods: Cases of thyroid nodules that underwent ultrasound examination with thyroidectomy or fine-needle aspiration (FNA) at our hospital between August 2013 and March 2023 were collected. The patients were categorized into two groups: an elderly group (aged ≥60) and a younger group (aged <60). Eight RSSs were applied to evaluate these nodules respectively.

Results: The malignant rate in the elderly group was significantly lower than that in the younger group (28.2% vs. 49.6%, P=0.000). There were statistically significant differences in nodule diameter, multiplicity, composition, echogenicity, orientation, margin, and echogenic foci between the elderly and younger groups (P<0.05). Among the eight RSSs evaluated in elderly adults, the artificial intelligence-based Thyroid Imaging Reporting and Data System (AI TIRADS) demonstrated the highest overall diagnostic efficacy, but with relatively high unnecessary FNA rate (UFR) and missed cancer rate (MCR) of 55.0% and 51.3%, respectively. By modifying the size thresholds, the new AI TI-RADS achieved the lowest UFR and MCR while maintaining nearly the lowest FNA rate (FNAR) among all the RSSs (P=0.172, 0.162, compared to the ACR and original AI, respectively, but P<0.05 compared to the other six RSSs).

Conclusion: Among the eight RSS systems, AI demonstrated higher diagnostic efficacy in the elderly population. However, the size thresholds for FNA needed to be adjusted.

1 Introduction

With the widespread application of imaging techniques, the prevalence of thyroid nodules in adults reaches approximately 19%–68%, and it tends to increase with age (1, 2). To aid clinicians in determining suitable management strategies for the growing number of thyroid nodules, various versions of ultrasound (US)-based risk stratification systems (RSSs) have been developed in recent years. The commonly utilized RSSs can be broadly classified into two groups: the “point-based” system and the “pattern-based” system. The point-based system comprises the Thyroid Imaging Reporting and Data System (TIRADS) established by Kwak et al. (Kwak) (3), American College of Radiology (ACR) (4), Benjamin et al. with an artificial intelligence algorithm (AI) (5), and the Chinese (C-TIRADS) (6). The pattern-based system comprises the American Thyroid Association (ATA) guideline (2), the American Association of Clinical Endocrinologists, American College of Endocrinology, and Associazione Medici Endocrinology (AACE/ACE/AME) guideline (7), European Thyroid Association (EU) TIRADS (8) and Korean Society of Thyroid Radiology (K-TIRADS) (9). All these systems have exhibited excellent diagnostic performance (10–14). However, studies showed that age is a confounding factor that cannot be overlooked (15, 16).

Age is associated with an increased incidence of thyroid nodules, a lower malignancy rate, and a higher proportion of invasive nodules (17). This implies that RSSs designed for the general population may not necessarily be applicable to older patients. To the best of our knowledge, there is currently no comparative study of these eight systems specifically focusing on thyroid nodules in the elderly population. This study aims to analyze thyroid nodules in elderly patients using the eight RSS systems, investigate the optimal diagnostic system, and explore whether the established biopsy thresholds are applicable to older individuals.

2 Materials and methods

2.1 Patients

The Scientific Research and Clinical Trials Ethics Committee of the First Affiliated Hospital of Zhengzhou University of China granted approval for this retrospective study and waived the requirement for written informed consent for data usages. The study was conducted from August 2013 to March 2023 on a cohort of 5473 thyroid nodules in 3685 patients who received thyroid US exams and thyroid surgery or fine-needle aspiration (FNA) at our hospital. A total of 3914 thyroid nodules in 2638 patients were included in this study after meeting the exclusion criteria. Then, the nodules were divided into two groups according to the ages: elderly group (≥60 years old, 794 nodules in 504 patients) and younger group (<60 years old, 3120 nodules in 2134 patients) (Figure 1). The definition of 60 years as the age standard was based on our country’s regulations, medical situation, and previous literature (18). The exclusion criteria were as follows: (I) Age < 18 years. (II) Incomplete ultrasound images. (III) Inconclusive pathological results. If surgery had been performed, then the postoperative pathology resulted prevail. If no surgery was done, the results of the FNA was applied. Cytology was classified according to the Bethesda System (19). Bethesda V and VI were considered malignant, Bethesda II were considered benign. Bethesda classes I, III or IV were excluded as uncertain outcomes. In the elderly group, 456 nodules were confirmed by postoperative pathology, consisting of 271 benign and 185 malignant cases. Additionally, 338 nodules were confirmed through FNA, with 299 benign and 39 malignant nodules. In the younger group, there were 2011 nodules with pathological confirmation, comprising 713 benign and 1298 malignant cases. Among these, 1109 nodules were confirmed through FNA, including 861 benign and 248 malignant cases.

FIGURE 1

Figure 1 Flowchart of study subject inclusion. US, ultrasound; FNA, fine needle aspiration; n, number.

2.2 Thyroid ultrasound examination and image interpretation

One of two US specialists with 33 or 11 years of expertise in thyroid US did each examination with Aplio 300 or 500 (Toshiba Corporation, Tokyo, Japan) equipped with a 5-12 MHz linear array transducer. Two superficial sonographers (with 8 and 12 years of expertise analyzing thyroid US images), blinded to the biopsy results and the final pathological diagnoses were hired to assess the ultrasonic features of the nodules and classify them according to the ATA guidelines, ACR, AI, Kwak, EU, AACE/ACE/AME, C and K-TIRADS. US features included the size (the maximal diameter on US), composition (solid or almost solid, mixed cystic and solid, cystic, spongiform), echogenicity (hyperechoic, isoechoic, hypoechoic, markedly hypoechoic, anechoic), orientation (taller-than-wide, wider-than-tall), margin (smooth, ill-defined, irregular or lobulated, extrathyroidal extension) and echogenic foci (punctuate echogenic foci, peripheral calcifications, macrocalcification, comet-tail artifacts). It is worth noting that comet-tail artifacts were recorded only in the absence of microcalcification. Other types of calcifications could be selected simultaneously. When the two doctors had differing opinions, a third expert with 33 years of thyroid imaging experience participated in a joint discussion to reach a final decision. Before assessing the ultrasonic features, an interactive case-based training session was conducted using 30 representative thyroid nodules not included in this study. Then, FNA were determined based on the size thresholds of each guideline. It was worth noting that the unclassified nodules in the ATA guidelines were grouped with intermediate-suspicion categories, as in previous reports (20–23).

2.3 Statistical analysis

The FNA rate (FNAR) was determined by calculating the percentage of nodules recommended for FNA out of the total nodules. The unnecessary FNA rate (UFR) was computed by determining the ratio of benign nodules among the nodules that were advised to undergo FNA. The missed cancer rate (MCR) was derived by calculating the proportion of malignant nodules that were not recommended for FNA out of all malignant nodules.

Statistical analysis was conducted using SPSS 26.0 (IBM Corp., Armonk, NY, USA) and MedCalc 18.2.1 (MedCalc Software Ltd, Ostend, Belgium) software. Continuous data were presented as mean ± standard deviation (SD) and compared using the independent two-sample t-test. Categorical data were compared using the Chi-square test or Fisher’s exact test. Receiver Operating Characteristic (ROC) curves were constructed, and the Area Under ROC (AUC) was compared using the DeLong method or Z-test. Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), accuracy, FNAR, UFR and MCR with 95% confidence intervals (CI) were evaluated for the RSSs and compared using the McNemar or Chi-square test. A two-sided P-value of <0.05 was considered statistically significant.

3 Results

3.1 Basic characteristics

The malignant rate in the elderly group was significantly lower than that in the younger group (P =0.000). There was no statistically significant difference in gender between the two groups (P =0.119). However, the elderly group had a higher prevalence of multiple nodules and larger nodules, with a higher proportion of nodules measuring ≥20mm and a lower proportion of nodules measuring <10mm compared to the younger group (P<0.05) (Table 1).

TABLE 1

Table 1 Basic characteristics of thyroid nodules according to age group.

3.2 Comparison of ultrasound features between elderly and younger groups

There were statistically significant differences in composition, echogenicity, orientation, margin and echogenic foci between the elderly group and the younger group (Table 2). The proportion of solid nodules was lower in the elderly group, while the mixed cystic and solid was higher than in the younger group. The proportions of hyperechoic, isoechoic and cannot classify were higher in the elderly group compared to the younger group, while the proportion of hypoechoic and markedly hypoechoic were lower than in the younger group. The taller-than-wide was less common in the elderly group than in the younger group. Smooth, ill-defined, and cannot classify margins were more common in the elderly group compared to the younger group, while irregular and extrathyroidal extension were less common in the elderly group than in the younger group. Peripheral calcifications, macrocalcification and non-calcified nodules were more common in the elderly group than in the younger group, while punctate echogenic foci were less common in the elderly group than in the younger group.

TABLE 2

Table 2 Ultrasound features of thyroid nodules according to age group.

3.3 Diagnostic efficacy of suspicious ultrasound features

From Table 3, it was observed that the elderly group demonstrated lower sensitivity regarding hypoechoic nodules, extrathyroidal extension, and punctuate echogenic foci in comparison to the younger group (P=0.042, 0.028, 0.000, respectively). However, they showed higher sensitivity in terms of ill-defined margins compared to the younger group (P=0.035). For specificity, the elderly group demonstrated higher specificity in terms of hypoechoic compared to the younger group (P=0.036). However, they exhibited lower specificity in terms of ill-defined margins compared to the younger group (P=0.003). All ultrasound features in the Table 3, except for markedly hypoechoic nodules, showed lower PPV in the elderly group compared to the younger group (P<0.05). However, all ultrasound features in the Table 3 exhibited higher NPV in the elderly group compared to the younger group (P<0.05).

TABLE 3

Table 3 Diagnostic performance of partial suspicious ultrasound features.

3.4 Diagnostic efficacy of RSSs in elderly and younger groups

The AUCs for ACR, AI, Kwak, C-TIRADS, ATA, EU, AACE/ACE/AME, and K-TIRADS in the elder group were 0.854, 0.871, 0.861, 0.837, 0.832, 0.810, 0.795, and 0.859, respectively (Figure 2A). In the younger group, the AUCs were 0.869, 0.882, 0.887, 0.867, 0.855, 0.837, 0.828, and 0.880, respectively (Figure 2B). The AUCs of the elder group were consistently lower than those of the younger group in all eight RSSs (P<0.05 for Kwak, C-TIRADS and AACE/ACE/AME, P>0.05 for other five RSSs).

FIGURE 2

Figure 2 The ROC curves of eight RSSs for elderly patients and younger patients. (A) The ROC curve of eight RSSs for elderly patients. (B) The ROC curve of eight RSSs for younger patients. ROC, receiver operating characteristic curve; ACR-TIRADS, American College of Radiology Thyroid Imaging Reporting and Data System; Kwak-TIRADS, TIRADS issued by Kwak et al; C-TIRADS, Chinese-TIRADS; ATA guideline, American Thyroid Association guideline; EU-TIRADS, European TIRADS; AACE/ACE/AME, American Association of Clinical Endocrinologists, American College of Endocrinology, and Associazione Medici Endocrinology guideline; K-TIRADS, Korean TIRADS.

3.5 Comparison of diagnostic efficacy among different RSSs for elderly patients

The ROC showed that the cutoff value for C-TIRADS was 4B, for Kwak was 4C, for AACE/ACE/AME was 3, and for the remaining systems was 5. The highest area under the ROC curve was observed for AI, followed by Kwak, with AACE/ACE/AME exhibiting the lowest value. In terms of sensitivity, C-TIRADS demonstrated the highest level, followed by EU and AACE/ACE/AME, whereas K-TIRADS showed the lowest sensitivity. K-TIRADS displayed the highest specificity, followed by AI, while C-TIRADS exhibited the lowest specificity. The highest PPV was associated with AI, followed by K-TIRADS, while C-TIRADS presented the lowest PPV. The maximum NPV was achieved by C-TIRADS, followed by AACE/ACE/AME and EU, whereas K-TIRADS and ACR had the lowest NPV. AI showed the best accuracy, K-TIRADS came in second, and C-TIRADS exhibited the lowest accuracy (Table 4).

TABLE 4

Table 4 Diagnostic performance of eight RSSs for elder patients.

3.6 Comparison of diagnostic efficacy for elderly patients based on size thresholds

After considering the size thresholds for FNA from various guidelines, we found that the FNAR for the eight different RSSs ranged from 30.5% to 59.7%, with AI having the lowest rate and K-TIRADS the highest. The UFR ranged from 55.0% to 74.5%, with AI having the lowest rate and K-TIRADS the highest. The MCR ranged from 22.8% to 51.8%, with C-TIRADS having the lowest rate and ACR the highest (Table 5).

TABLE 5

Table 5 Comparison of therapeutic performance for elderly patients based on size thresholds.

3.7 The modified version of AI-TIRADS with adjusted size thresholds

From the Table 5, we observed that after incorporating the size thresholds for FNA, all eight RSSs performed poorly. Among them, AI had the lowest FNAR and UFR, while C-TIRADS had the lowest MCR. Taken together, AI showed the best diagnostic efficacy among older adults. However, the size thresholds for FNA needed to be adjusted. Considering the specific characteristics of nodules in elderly patients, we found that by modifying the size thresholds of category 3 from ≥25 to no-FNA for all, the category 4 from ≥15 to ≥25, and the category 5 from ≥10 to ≥5, the UFR of the new modified AI TI-RADS decreased from 55.0% to 34.3%. The MCR also significantly decreased from 51.3% to 21.4%, and the FNAR only increased from 30.5% to 33.8%. The modified AI demonstrated the lowest UFR (P<0.05 compared to all eight RSSs) and MCR (P=0.733 compared to C-TIRADS and P<0.05 compared to the other seven RSSs). Additionally, it achieved nearly the lowest FNAR (P=0.172, 0.162, compared to the ACR and original AI, respectively, but P<0.05 compared to the other six RSSs).

4 Discussion

Among the various versions of US-based RSSs, AI-TIRADS demonstrated the best overall performance, with the largest AUC, highest PPV and accuracy, nearly the highest specificity and relatively high sensitivity and NPV, which suggested that AI-TIRADS was more suitable for elderly individuals. AI-TIRADS was a simplified version of ACR based on artificial intelligence algorithms, sharing the same risk stratification and the same thresholds for FNA with ACR, thus maintaining its excellent diagnostic efficacy and reducing UFR. Furthermore, it excluded the scoring of several ultrasound indicators, making the evaluation process simpler and thereby improving user-friendliness. Previous studies have confirmed that AI had similar or even higher diagnostic value compared to ACR (24, 25), which was consistent with our study findings.

Combining the size thresholds for FNA, we found that the FNAR for various guidelines ranged from 30.5% to 59.7%, the UFR ranged from 55.0% to 74.5%, and the MCR ranged from 22.8% to 51.8%. Despite AI demonstrating the best performance, its UFR and MCR were as high as 55.0% and 51.3%, respectively. This indicated that the current size thresholds in existing guidelines were not suitable for the elderly, including the ACR/AI, which had been reported to have the lowest UFR in previous literatures (12, 14). One possible reason was that thyroid nodules in the elderly population were generally larger and had a higher prevalence of benign nodules. The results of this study also corroborated this point. A study reported that the malignancy rate of thyroid nodules in individuals aged 20-49 was 17.1-22.9%, but it decreased to only 12.6% in those aged 70 and above (17). Hence, the size thresholds that were suitable for the general population may have been relatively low for elderly individuals, resulting in a higher rate of UFR. For elderly patients, careful consideration should have been given to surgical indications because surgery for this age group not only implied treatment but also posed a significant risk due to potential morbidity associated with surgical interventions, particularly for those frail elderly individuals (26). Advancing patient age should be a factor to consider when dealing with thyroid nodules (27). As AI demonstrated the best overall performance in the diagnostic value for elderly individuals, however, with high NPV and MCR, we adjusted the size thresholds of FNA for AI. In this study, the malignancy rate of category 3 nodules in the elderly group was only 7.6% (9/119), and among them, only 33.3% (3/9) had a size of ≥25mm. These nodules could be adequately monitored through follow-up (28). Therefore, we recommend follow-up instead of FNA for category 3 nodules. For category 4, the malignancy rate was 19.6% (31/158), and we recommended adjusting the size threshold from 15mm to 25mm. With these changes above, we reduced the number of FNA nodules by 60.7% (from 122 to 48), while also avoiding unnecessary FNA for 68.3% of benign nodules (from 101 to 32). There was only a slight increase of 5 missed diagnoses (from 19 to 24).

However, for category 5 nodules, given the high malignancy rate and the higher likelihood of aggressive cancer in elderly individuals, which accounts for almost all thyroid-related deaths (26, 29), we have lowered the size threshold for grade 5 nodules from 10mm to 5mm, thus avoiding 82.8% of cancers being missed (from 87 to 15). With all the adjustments implemented, the modified AI-TIRADS showed a significant decrease in the UFR and MCR (UFR: before vs. after adjustments: 55.0% vs. 34.3%; MCR: before vs. after adjustments: 51.3% vs. 21.4%; both P=0.000). Although the FNAR increased slightly, there was no statistically significant difference compared to the ACR and original AI (P=0.172, 0.162, respectively), and it remained lower than the other six RSSs.

In this study, the ROC analysis yielded a diagnostic threshold of 4A for C-TIRADS, which differed from the previously used 4B threshold in the general population (30). This disparity in threshold selection contributed to the higher sensitivity and lower specificity observed in this research. The possible reason was that C-TIRADS only utilized a few key suspicious US signs, including solid, markedly hypoechoic, ill-defined/irregular margin or extrathyroidal extension, vertical orientation, and microcalcifications. These features were generally less sensitive in the elderly population, especially the presence of microcalcifications. As a result, C-TIRADS tended to yield lower scores in the elderly population, leading to a lower diagnostic cutoff than in previous studies. Additionally, C-TIRADS did not account for the highly sensitive feature of hypoechoic nodules, which partly explained the superior diagnostic performance observed in Kwak (3), a similar classification approach with C-TIRADS. Moreover, C-TIRADS assigned 1 point for ill-defined margin. While in the elderly population, ill-defined margin exhibited low specificity and PPV (66.7% and 18.8%, respectively), which also contributed to the divergence between C-TIRADS and Kwak’s.

It is worth mentioning that in this study, the unclassified nodules in the ATA guidelines were grouped with intermediate-suspicion categories, which was similar to the classification method of K-TIRADS (9). However, the AUC of K-TIRADS was found to be superior to ATA in this study. Upon analyzing the data, the difference was observed in mixed cystic and solid nodules with suspicious US features. ATA categorized mixed cystic and solid nodules with suspicious US features into the high suspicious category (TR-5). In contrast, K-TIRADS classified them, along with isoechoic nodules with suspicious US features, into TR-4, with only solid hypoechoic nodules with malignant features classified into TR-5. This emphasized the predictive ability of solid hypoechoic nodules for malignancy. The data from this study also confirmed this point. In the elderly group, the PPV of solid nodules was 43.1%, whereas mixed cystic and solid nodules were only 5.8%. Hypoechoic nodules, although less correlated with malignancy in the elderly group compared to the younger group, still reached 44.1%, while hyperechoic or isoechoic nodules were only 8.4%. This indicates that in clinical practice, paramount significance should be given to the predictive ability of solid hypoechoic nodules for malignancy.

This study had several limitations. Firstly, it was conducted at a single center, which may have limited the generalizability of the findings. Multi-center studies would have been necessary to validate and strengthen the results in the future. Secondly, due to the limited number of patients aged 80 and above, the study did not compare different age groups within the elderly population. Thirdly, as this study was retrospective in nature, there might have been some limitations in image interpretation. Conducting further prospective studies would be essential to establish more definitive conclusions.

5 Conclusion

All eight RSSs showed acceptable diagnostic efficacy in elderly patients, albeit lower compared to younger patients. Among these RSSs, AI demonstrated the highest overall diagnostic efficacy. By adjusting the size thresholds, the AI TIRADS achieved the lowest UFR, MCR, and nearly the lowest FNAR, thus offering enhanced guidance for clinical practice.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the scientific research and clinical trials ethics committee of the First Affiliated Hospital of Zhengzhou University. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because this study was retrospective and only necessary clinical data were collected.

Author contributions

XM: Data curation, Investigation, Project administration, Writing – original draft. JY: Conceptualization, Investigation, Methodology, Project administration, Writing – review & editing, Writing – original draft. YH: Conceptualization, Formal Analysis, Investigation, Supervision, Writing – review & editing. YC: Data curation, Formal Analysis, Investigation, Software, Validation, Writing – review & editing. KC: Conceptualization, Supervision, Validation, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Acknowledgments

We would like to express our gratitude to all my patients and colleagues who provided invaluable assistance and support throughout this work.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Gharib H, Papini E, Paschke R, Duick D, Valcavi R, Hegedüs L, et al. American association of clinical endocrinologists, associazione medici endocrinologi, and european thyroid association medical guidelines for clinical practice for the diagnosis and management of thyroid nodules: executive summary of recommendations. J Endocrinol Invest (2010) 33:51–6. doi: 10.4158/10024.GL

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Haugen BR, Alexander EK, Bible KC, Doherty GM, Mandel SJ, Nikiforov YE, et al. 2015 American thyroid association management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer: the american thyroid association guidelines task force on thyroid nodules and differentiated thyroid cancer. Thyroid Off J Am Thyroid Assoc (2016) 26(1):1–133. doi: 10.1089/thy.2015.0020

CrossRef Full Text | Google Scholar

3. Kwak JY, Han KH, Yoon JH, Moon HJ, Son EJ, Park SH, et al. Thyroid imaging reporting and data system for US features of nodules: a step in establishing better stratification of cancer risk. Radiology (2011) 260(3):892–9. doi: 10.1148/radiol.11110206

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Tessler FN, Middleton WD, Grant EG, Hoang JK, Berland LL, Teefey SA, et al. ACR thyroid imaging, reporting and data system (TI-RADS): white paper of the ACR TI-RADS committee. J Am Coll Radiol (2017) 14(5):587–95. doi: 10.1016/j.jacr.2017.01.046

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Wildman-Tobriner B, Buda M, Hoang JK, Middleton WD, Thayer D, Short RG, et al. Using artificial intelligence to revise ACR TI-RADS risk stratification of thyroid nodules: diagnostic accuracy and utility. Radiology (2019) 292(1):112–9. doi: 10.1148/radiol.2019182128

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Zhou J, Yin L, Wei X, Zhang S, Song Y, Luo B, et al. Chinese guidelines for ultrasound malignancy risk stratification of thyroid nodules: the C-TIRADS. Endocrine (2020) 70(2):256–79. doi: 10.1007/s12020-020-02441-y

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Gharib H, Papini E, Garber JR, Duick DS, Harrell RM, Hegedüs L, et al. American association of clinical endocrinologists, american college of endocrinology, and associazione medici endocrinologi medical guidelines for clinical practice for the diagnosis and management of thyroid nodules–2016 update. Endocrine Pract Off J Am Coll Endocrinol Am Assoc Clin Endocrinol (2016) 22(5):622–39. doi: 10.4158/EP161208.GL

CrossRef Full Text | Google Scholar

8. Russ G, Bonnema Steen J, Erdogan Murat F, Durante C, Ngu R, Leenhardt L. European thyroid association guidelines for ultrasound Malignancy risk stratification of thyroid nodules in adults: the EU-TIRADS. Eur Thyroid J (2017) 6(5):225–37. doi: 10.1159/000478927

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Shin JH, Baek JH, Chung J, Ha EJ, Kim JH, Lee YH, et al. Ultrasonography diagnosis and imaging-based management of thyroid nodules: revised korean society of thyroid radiology consensus statement and recommendations. Korean J Radiol (2016) 17(3):370–95. doi: 10.3348/kjr.2016.17.3.370

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Ha S, Baek J, Na D, Suh C, Chung S, Choi Y, et al. Diagnostic performance of practice guidelines for thyroid nodules: thyroid nodule size versus biopsy rates. Radiology (2019) 291(1):92–9. doi: 10.1148/radiol.2019181723

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Hoang J, Asadollahi S, Durante C, Hegedüs L, Papini E, Tessler FN. An international survey on utilization of five thyroid nodule risk stratification systems: A needs assessment with future implications. Thyroid (2022) 32(6):675–81. doi: 10.1089/thy.2021.0558

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Grani G, Lamartina L, Ascoli V, Bosco D, Biffoni M, Giacomelli L, et al. Reducing the number of unnecessary thyroid biopsies while improving diagnostic accuracy: toward the "Right" TIRADS. J Clin Endocrinol Metab (2019) 104(1):95–102. doi: 10.1210/jc.2018-01674

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Basha M, Alnaggar A, Refaat R, El-Maghraby A, Refaat M, Abd Elhamed M, et al. The validity and reproducibility of the thyroid imaging reporting and data system (TI-RADS) in categorization of thyroid nodules: Multicentre prospective study. Eur J Radiol (2019) 117:184–92. doi: 10.1016/j.ejrad.2019.06.015

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Castellana M, Castellana C, Treglia G, Giorgino F, Giovanella L, Russ G, et al. Performance of five ultrasound risk stratification systems in selecting thyroid nodules for FNA. J Clin Endocrinol Metab (2020) 105(5):dgz170. doi: 10.1210/clinem/dgz170

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Sorrenti S, Baldini E, Tartaglia F, Catania A, Arcieri S, Pironi D, et al. Nodular thyroid disease in the elderly: novel molecular approaches for the diagnosis of Malignancy. Aging Clin Exp Res (2017) 29:7–13. doi: 10.1007/s40520-016-0654-y

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Liang Y, Li X, Wang F, Yan Z, Sang Y, Yuan Y, et al. Detection of thyroid nodule prevalence and associated risk factors in southwest China: A study of 45,023 individuals undergoing physical examinations. Diabetes Metab Syndr Obes (2023) 16:1697–707. doi: 10.2147/DMSO.S412567

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Kwong N, Medici M, Angell T, Liu X, Marqusee E, Cibas E, et al. The influence of patient age on thyroid nodule formation, multinodularity, and thyroid cancer risk. J Clin Endocrinol Metab (2015) 100(12):4434–40. doi: 10.1210/jc.2015-3100

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Zhang J, Li D, Gao J. Health disparities between the rural and urban elderly in China: A cross-sectional study. Int J Environ Res Public Health (2021) 18(15):8056. doi: 10.3390/ijerph18158056

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Kakudo K, Higuchi M, Hirokawa M, Satoh S, Jung CK, Bychkov A. Thyroid FNA cytology in Asian practice-Active surveillance for indeterminate thyroid nodules reduces overtreatment of thyroid carcinomas. Cytopathol: Off J Br Soc Clin Cytol (2017) 28(6):455–66. doi: 10.1111/cyt.12491

CrossRef Full Text | Google Scholar

20. Rosario P, da Silva A, Nunes M, Ribeiro Borges M, Mourão G, Calsolari MJE. Risk of malignancy in 1502 solid thyroid nodules >1 cm using the new ultrasonographic classification of the American Thyroid Association. Endocrine (2017) 56(2):442–5. doi: 10.1007/s12020-016-1163-7

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Ha E, Na D, Moon W, Lee Y, Choi N. Diagnostic performance of ultrasound-based risk-stratification systems for thyroid nodules: comparison of the 2015 american thyroid association guidelines with the 2016 korean thyroid association/korean society of thyroid radiology and 2017 american college of radiology guidelines. Thyroid (2018) 28(11):1532–7. doi: 10.1089/thy.2018.0094

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Yim Y, Na D, Ha E, Baek J, Sung J, Kim J, et al. Concordance of three international guidelines for thyroid nodules classified by ultrasonography and diagnostic performance of biopsy criteria. Korean J Radiol (2020) 21(1):108–16. doi: 10.3348/kjr.2019.0215

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Yoon J, Lee H, Kim E, Moon H, Kwak JJR. Malignancy risk stratification of thyroid nodules: comparison between the thyroid imaging reporting and data system and the 2014 american thyroid association management guidelines. Radiology (2016) 278(3):917–24. doi: 10.1148/radiol.2015150056

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Liu Y, Li X, Yan C, Liu L, Liao Y, Zeng H, et al. Comparison of diagnostic accuracy and utility of artificial intelligence-optimized ACR TI-RADS and original ACR TI-RADS: a multi-center validation study based on 2061 thyroid nodules. Eur Radiol (2022) 32(11):7733–42. doi: 10.1007/s00330-022-08827-y

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Watkins L, O'Neill G, Young D, McArthur C. Comparison of british thyroid association, american college of radiology tirads and artificial intelligence tirads with histological correlation: diagnostic performance for predicting thyroid malignancy and unnecessary fine needle aspiration rate. Br J Radiol (2021) 94(1123):20201444. doi: 10.1259/bjr.20201444

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Wang Z, Vyas C, Van Benschoten O, Nehs M, Moore F, Marqusee E, et al. Quantitative analysis of the benefits and risk of thyroid nodule evaluation in patients ≥70 years old. Thyroid (2018) 28(4):465–71. doi: 10.1089/thy.2017.0655

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Ruel E, Thomas S, Dinan M, Perkins J, Roman S, JJTJoce S, et al. Adjuvant radioactive iodine therapy is associated with improved survival for patients with intermediate-risk papillary thyroid cancer. J Clin Endocrinol Metab (2015) 100(4):1529–36. doi: 10.1210/jc.2014-4332

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Ito Y, Uruno T, Nakano K, Takamura Y, Miya A, Kobayashi K, et al. An observation trial without surgical treatment in patients with papillary microcarcinoma of the thyroid. Thyroid (2003) 13(4):381–7. doi: 10.1089/105072503321669875

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Toniato A, Bernardi C, Piotto A, Rubello D, Pelizzo MR. Features of papillary thyroid carcinoma in patients older than 75 years. Updates Surg (2011) 63(2):115–8. doi: 10.1007/s13304-011-0060-0

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Zhu H, Yang Y, Wu S, Chen K, Luo H, Huang J, et al. Diagnostic performance of us-based fnab criteria of the 2020 chinese guideline for malignant thyroid nodules: comparison with the 2017 american college of radiology guideline, the 2015 american thyroid association guideline, and the 2016 korean thyroid association guideline. Quant Imaging Med Surg (2021) 11(8):3604–18. doi: 10.21037/qims-20-1365

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: thyroid nodules, risk stratification systems, elderly, ultrasonography, Thyroid Imaging Reporting and Data System

Citation: Ma X, Yu J, Huang Y, Cui Y and Cui K (2023) A comprehensive comparative assessment of eight risk stratification systems for thyroid nodules in the elderly population. Front. Oncol. 13:1265973. doi: 10.3389/fonc.2023.1265973

Received: 26 July 2023; Accepted: 30 October 2023;
Published: 15 November 2023.

Edited by:

Nerina Denaro, IRCCS Ca ‘Granda Foundation Maggiore Policlinico Hospital, Italy

Reviewed by:

Hwa Young Ahn, Chung-Ang University, Republic of Korea
Pia Pace-Asciak, University of Toronto, Canada

Copyright © 2023 Ma, Yu, Huang, Cui and Cui. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jing Yu, eXVqaW5nODIyQDE2My5jb20=; Kefei Cui, Y3Vpa2VmZWkyMDEwQDEyNi5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.