Skip to main content

ORIGINAL RESEARCH article

Front. Endocrinol., 26 September 2024
Sec. Thyroid Endocrinology
This article is part of the Research Topic Advances in precision medicine in the management of thyroid nodules and thyroid cancer View all 35 articles

Development and validation of a predictive model for assessing the risk of follicular carcinoma in thyroid nodules identified as suspicious by intraoperative frozen section

Cheng Li*Cheng Li1*Yong LuoYong Luo1Yongli GanYongli Gan2Yan JiangYan Jiang3Qi LiQi Li1Jin HuangJin Huang4
  • 1Department of Thyroid Surgery, Ningbo Medical Center Lihuili Hospital, Ningbo, China
  • 2Ningbo Clinical Diagnostic Pathology Center, Ningbo, China
  • 3Department of Ultrasound, Ningbo Medical Center Lihuili Hospital, Ningbo, China
  • 4Department of Surgery, The Second Hospital of Ninghai County, Ningbo, China

Introduction: Follicular thyroid carcinoma (FTC) is the second most common thyroid malignancy and is characterized by a higher risk of distant metastasis compared to papillary thyroid cancer. Intraoperative frozen section (IOFS) diagnosis of FTC is challenging due to its limited sensitivity and accuracy, leading to uncertainty in intraoperative surgical decision-making. In response, we developed a predictive model to assess the risk of follicular carcinoma in thyroid nodules identified as suspicious for follicular neoplasm by IOFS.

Methods: This model was derived from preoperative clinical and ultrasound data of 493 patients who underwent thyroid surgery at Ningbo Medical Center Lihuili Hospital. It identified five significant predictors of follicular carcinoma: nodule size, thyroglobulin (Tg) level, hypoechogenicity, lobulated or irregular margins, and thick halo.

Results: The model demonstrated robust discrimination and calibration, with an area under the curve (AUC) of 0.83 (95% CI: 0.77-0.90) in the training set and 0.78 (95% CI: 0.68-0.88) in the validation set. In addition, it achieved a sensitivity of 81.63% (95% CI: 69.39-91.84) and 68.00% (95% CI: 48.00--4.00), a specificity of 77.42% (95% CI: 72.18-82.66) and 72.51% (95% CI: 65.50-78.96), an accuracy of 78.1% (95% CI: 73.4-82.4) and 71.9% (95% CI: 65.3-78.6), a positive predictive value (PPV) of 41. 67% (95% CI: 35.65-48.84) and 26.79% (95% CI: 19.40-34.33), respectively, and a negative predictive value (NPV) of 95.61% (95% CI: 92.86-97.99) and 94.07% (95% CI: 90.44-97.08) in the training and validation sets, respectively.

Conclusion: The model can accurately rule out FTC in low-risk nodules, thereby providing surgeons with a practical tool to determine the necessary extent of surgical intervention for nodules flagged as suspicious by IOFS.

1 Introduction

Thyroid cancer is the most common endocrine malignancy worldwide, with an estimated 586,202 new cases and 43,646 deaths in 2020, making it the ninth most common cancer worldwide (1). Follicular thyroid carcinoma (FTC), which accounts for 10-15% of all thyroid cancers (2), is the second most common type after papillary thyroid carcinoma (PTC). FTC is associated with a significantly higher risk of distant metastasis than PTC, with metastasis rates ranging from 19% to 33.7% in FTC patients—significantly higher than the 2.3% to 7% observed in PTC patients (38). In response to this risk, the American Thyroid Association (ATA) guidelines recommend total thyroidectomy for widely invasive FTC or minimally invasive FTC with high-risk factors (9). This surgical strategy not only allows for radioactive iodine ablation of potential residual disease postoperatively but also improves detection of recurrence by facilitating monitoring of postoperative thyroglobulin (Tg) levels, a marker for differentiated thyroid carcinoma, thus reducing the likelihood of recurrence (10). Therefore, accurate preoperative and intraoperative diagnosis of FTC is critical to guide surgical decision-making and determine the extent of surgical intervention required.

However, the preoperative and intraoperative diagnosis of FTC remains challenging. A definitive pathologic diagnosis of FTC requires evidence of vascular or capsular invasion, typically confirmed by postoperative paraffin pathology (11). As a result, preoperative fine needle aspiration (FNA) cytology is inadequate for the diagnosis of FTC and often yields indeterminate results, such as the Bethesda System for Reporting Thyroid Cytopathology (TBSRTC) categories III or IV (12). For nodules with indeterminate FNA results, intraoperative frozen section (IOFS) can help differentiate the malignant from benign lesions and guide surgical decision-making (13, 14). However, the sensitivity of IOFS for the diagnosis of FTC is also limited, with only 3.4% to 10.5% of cases being accurately identified (15, 16); Most of the remainder are initially reported as suspicious of follicular neoplasm (17), requiring definitive diagnosis by paraffin pathology postoperatively. This diagnostic uncertainty may result in some patients returning to surgery for contralateral thyroidectomy after a diagnosis of FTC, while others may have undergone total thyroidectomy only to later discover that the nodule was benign, such as a follicular adenoma.

In response to this diagnostic challenge, researchers have been actively developing methods to predict FTC. Various studies have linked specific clinical characteristics (18) and ultrasound features (1922) to the likelihood of FTC. In addition, predictive models have been developed to distinguish FTC from follicular adenoma in thyroid nodules diagnosed by postoperative paraffin pathology (2325). Despite these advances, there is currently no model designed to assess the risk of FTC intraoperatively in nodules identified as suspicious for follicular neoplasm by IOFS.

Therefore, this study aims to develop an intraoperative predictive model to assess the risk of FTC in thyroid nodules identified as suspicious for follicular neoplasm by IOFS. The model will utilize preoperative clinical and ultrasound data to improve the accuracy of intraoperative diagnoses, particularly in accurately ruling out FTC in low-risk nodules. This enhancement will aid surgical decision-making and guide the selection of appropriate surgical interventions.

2 Methods

2.1 Ethical approval and study design

This retrospective cohort study was approved by the Ethics Committee of Ningbo Medical Center Lihuili Hospital (approval number: KY2024SL082-01). The study adhered to the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) statement for model development, validation, and reporting (26). The study included patients who underwent thyroid surgery at Ningbo Medical Center Lihuili Hospital between January 1, 2019 and December 31, 2023. The hospital has two separate campuses: the Eastern Campus, located on the eastern side of the city, and the Xingning Campus, located in the center of the city. Each campus has independent laboratories, ultrasound facilities, operating rooms, and medical teams. Both campuses utilize the same Hospital Information System (HIS). All pathology diagnostic services, including cytology, molecular biology, IOFS, and postoperative paraffin pathology, are provided by the Ningbo Clinical Pathology Diagnostic Center in a uniform manner.

The diagnosis and treatment of thyroid nodules at the hospital follow the guidelines of ATA (9). Patients with thyroid nodules underwent an initial evaluation, including physical examination, thyroid function tests, and ultrasound. Nodules classified as Thyroid Imaging Reporting and Data System (TI-RADS) (27) category 4A or higher on ultrasound were recommended for FNA to facilitate cytologic examination. In addition to FNA Bethesda VI results and BRAF V600E mutation, our institution’s surgical indications for nodules without definitive preoperative malignancy evidence (such as those included in this study) include Bethesda categories IV or V, persistent Bethesda III after repeated FNA, compressive symptoms, cosmetic concerns, rapid growth, or other clinical symptoms necessitating surgery. Before surgery, all patients were tested for Tg and thyroglobulin antibody (TgAb) levels. IOFS was routinely performed during surgery to guide the surgical approach for nodules with indeterminate FNA results or those that had not undergone FNA.

2.2 Study cohort establishment and dataset partitioning

The study cohort was established by enrolling patients who met the following criteria. Inclusion criteria were (all conditions had to be met): 1) Patients underwent surgery for FNA TBSRTC IV or V, or repeated FNA results of TBSRTC III, or nodules causing compressive symptoms, cosmetic concerns, rapid growth, or other clinical symptoms necessitating surgery; 2) IOFS of the thyroid nodule is consistent with a follicular cell-derived neoplasm; 3) IOFS of the nodule indicates suspicion of a follicular neoplasm, but without definitive evidence of vascular or capsular invasion, requiring postoperative paraffin pathology for definitive diagnosis; 4) IOFS cannot diagnose the nodule as another type of malignancy. Exclusion criteria were (any of the following): 1) incomplete clinical or imaging data; 2) history of partial or total thyroid surgery; 3) presence of another nodule diagnosed as malignant by FNA or IOFS, or detection of BRAF V600E mutation by preoperative molecular testing, in the same or contralateral lobe; 4) nodules with inconclusive pathologic type by postoperative paraffin pathology; 5) preoperative confirmation of central (level VI) or lateral neck (levels II-V) lymph node metastasis; 6) preoperative confirmation of distant metastasis of thyroid cancer.

Patients meeting these criteria were identified from the hospital’s HIS to form the study cohort. Patients admitted to the Eastern Campus were assigned to the training set, while those admitted to the Xingning Campus were assigned to the validation set.

2.3 Candidate predictor selection and data collection

A comprehensive literature review identified several clinical and ultrasound features as potential predictors of FTC. These included age (19, 28), gender (19), smoking status (28), serum Tg levels (16, 29, 30), and serum TgAb levels (31). Thyroid nodule size (18, 19, 32), along with ultrasound features such as echogenicity (1921, 23, 24, 33), margin (20, 23, 33, 34), halo (19, 21, 23, 24), and calcification (18, 20, 21, 23, 35, 36) were also relevant. Additional ultrasound descriptors from the TI-RADS, including nodule composition and shape, were employed in this analysis.

At our institution, preoperative serum Tg and TgAb levels were measured by immunochemiluminescent assay using the Siemens ADVIA Centaur XP Immunoassay System. The reference ranges are 0.973 – 29.58 µg/L for Tg and 0 – 60.0 IU/mL for TgAb. Accordingly, cut-off values for Tg and TgAb in this study were set at 29.58 µg/L and 60.0 IU/mL, respectively. Test results below these cut-offs were considered “negative,” and those above were considered “positive.” Nodule size was determined by measuring the maximum diameter on ultrasound. Ultrasound features were graded according to the TI-RADS criteria. A halo, defined as a thin rim of decreased echogenicity surrounding the nodule (19), was categorized by thickness as absent, thin (< 2 mm), or thick (≥ 2 mm) (37).

Data on these predictors were collected from the hospital’s HIS and recorded in a Microsoft ACCESS database. To minimize the potential for human error, data entry was performed by two researchers and automatically checked for discrepancies by ACCESS. For ultrasound features, one researcher entered data based on diagnostic reports, while another sonographer, blinded to the patient’s final diagnosis, entered data based on the original images. Any discrepancies were resolved by a senior sonographer. IOFS and paraffin pathology data were obtained based on reports from the hospital pathology database. Any ambiguous cases were resolved by a senior pathologist.

2.4 Model construction and visualization

The model was constructed using the training set. The dependent variable was the paraffin pathology diagnosis of the nodule, which was classified into two categories: FTC and non-FTC (pathological types other than FTC). The independent variables were the candidate predictors identified in the literature review. To select the predictors for the model, we employed the Least Absolute Shrinkage and Selection Operator (LASSO) regression (38), which is particularly useful when the number of predictors is large relative to the number of observations (39). LASSO is a regression analysis method that performs variable selection by shrinking the coefficients of less influential predictors to zero via increasing the value of a penalty parameter, λ. The optimal value of λ was selected based on the one-standard-error (1-SE) criterion (40). This criterion identifies the optimal λ as the largest value that places the LASSO’s binomial deviance, a measure of model fit calculated using 10-fold cross-validation, within 1-SE of the minimum binomial deviance. The predictors with non-zero coefficients at this optimal λ were identified as the final predictors. A logistic regression model was then constructed on the training set using these predictors. The optimal threshold for the model was determined as the value that maximized the Youden index in the training set. To facilitate the clinical application of the model, we developed a nomogram based on the logistic regression model.

2.5 Model validation

The model was validated on both the training and validation sets. The discrimination of the model was assessed using the receiver operating characteristic (ROC) curve and the area under the curve (AUC). Calibration of the model was assessed using the calibration curve, which plots the predicted probabilities against the observed incidence of FTC. A perfectly calibrated model would lie on the diagonal reference line. The clinical utility of the model was determined by decision curve analysis (DCA), which calculates the net benefit by comparing the model’s performance against the treat-all strategy (assuming all nodules are FTC) and the treat-none strategy (assuming none of the nodules are FTC) (41). If the model provides a higher net benefit than both strategies over a range of threshold probabilities, it is considered clinically useful. The diagnostic performance of the model was evaluated by calculating sensitivity, specificity, accuracy, positive predictive value (PPV), and negative predictive value (NPV) at the optimal threshold in both the training and validation sets.

2.6 Statistical analysis

All modeling procedures were performed using the R software (42) (https://www.R-project.org/), version 4.3.3. the LASSO regression was performed using the package “glmnet” (43). The logistic regression model and nomogram were constructed using the package “rms” (44). The ROC curve, calibration curve, and DCA were generated using the packages “pROC” (45), “probably” (46), and “rmda” (47), respectively. Plots were generated using the package “ggplot2” (48). Continuous variables were presented as mean ± standard deviation (SD) and compared using the t-test. Categorical variables were presented as frequencies and percentages and compared using the chi-squared test or Fisher’s exact test. A p-value of less than 0.05 was considered statistically significant.

3 Results

A total of 8,997 patients who underwent thyroid surgery at Ningbo Medical Center Lihuili Hospital between January 1, 2019, and December 31, 2023, were identified from the hospital’s HIS. Of these patients, 5,681 were treated at the Eastern Campus and 3,316 at the Xingning Campus. A total of 493 patients met the inclusion and exclusion criteria and were included in the study cohort. Of these, 297 patients from the Eastern Campus were designated as the training set, while 196 patients from the Xingning Campus were designated as the validation set, as detailed in Figure 1.

Figure 1
www.frontiersin.org

Figure 1. Flowchart of study cohort establishment and dataset partitioning. IOFS, intraoperative frozen section; FNA, fine-needle aspiration. The number of thyroid nodules is equal to the number of patients.

Table 1 presents the baseline characteristics of the training and validation sets. In the training set (n = 297), 49 patients (16.5%) were diagnosed with FTC, while 248 (83.5%) had non-FTC diagnoses. The non-FTC group comprised 8 (2.7%) with PTC, 95 (32.0%) with follicular tumor of uncertain malignant potential, 34 (11.4%) with follicular adenoma, 107 (36.0%) with follicular nodular disease, and 4 (1.3%) with other benign types. The validation set (n = 196) included 25 patients (12.8%) with FTC and 171 (87.2%) with non-FTC diagnoses, consisting of 10 (5.1%) with PTC, 72 (36.7%) with follicular tumor of uncertain malignant potential, 12 (6.1%) with follicular adenoma, 76 (38.8%) with follicular nodular disease, and 1 (0.5%) with other benign types.

Table 1
www.frontiersin.org

Table 1. Baseline characteristics and comparison of data distribution across training and validation sets.

Females constituted 71.4% of the total cohort (352/493 patients), with similar proportions in the training (70.0%, 208/297) and validation (73.5%, 144/196) sets. The mean age of the entire cohort was 50.8 ± 13.8 years, with comparable ages in the training (50.0 ± 13.9 years) and validation (52.0 ± 13.7 years) sets. Non-smokers accounted for 87.6% of the cohort (432/493 patients), with similar distributions in the training (86.5%, 257/297) and validation (89.3%, 175/196) sets. The mean nodule size was 36.1 ± 15.4 mm overall, with slight variations between the training (37.0 ± 16.1 mm) and validation (34.7 ± 14.1 mm) sets.

Additional clinical characteristics, including Tg and TgAb test results, and ultrasound features such as nodule composition, shape, echogenicity, margin, and calcification characteristics, are detailed in Table 1. All baseline characteristics were comparable between the training and validation sets, with no significant differences observed (all p > 0.05), thus ensuring a reliable assessment of the model’s performance and its potential generalizability to new patient populations.

Univariate analysis in the training set revealed six predictors significantly associated with FTC: nodule size, Tg levels, smoking status, and ultrasound features of echogenicity, margin, and halo (all p < 0.05). These results are detailed in Table 2. The LASSO regression identified the optimal value of λ based on the 1-SE criterion, as illustrated in Figure 2A. At the optimal value of λ, five coefficients of the predictors remained non-zero: nodule size, Tg levels, and ultrasound features of echogenicity, margin, and halo, as illustrated in Figure 2B. A logistic regression model constructed using these predictors in the training set demonstrated a good fit, with a χ2 value of 67.69 (p < 0.01) and a pseudo-R2 of 0.34. The model demonstrated that larger nodule size (odds ratio (OR): 1.03, 95% confidence interval (CI): 1.01-1.06), positive Tg levels (OR: 2.47, 95% CI: 1.18-5.16), hypoechogenicity (OR: 4.90, 95% CI: 1.01-23.69), lobulated or irregular margins (OR: 7. 70, 95% CI: 2.23-26.54) and thick halo (OR: 13.84, 95% CI: 4.73-40.53) significantly increased the risk of FTC compared to their respective reference categories (all p < 0.05), as detailed in Table 3.

Table 2
www.frontiersin.org

Table 2. Univariate analysis of potential predictors for follicular carcinoma in the training set.

Figure 2
www.frontiersin.org

Figure 2. Selection of predictors for follicular thyroid carcinoma risk model using LASSO regression. (A) Optimal value of λ selection by 10-fold cross-validation. The “Binomial deviance” on the y-axis is a measure of the goodness of fit of the model. The optimal value of λ was determined by the one-standard-error (1-SE) of the minimum binomial deviance (marked by the red dotted vertical line). (B) The LASSO coefficients were shrunk by the increasing value of λ. At the optimal value of λ(marked by the red dotted vertical line), five variables with non-zero coefficients remained. The five variables were selected as predictors for the model. LASSO, least absolute shrinkage and selection operator. SE, standard error.

Table 3
www.frontiersin.org

Table 3. Logistic regression model for the prediction of follicular thyroid carcinoma.

A nomogram was developed based on this logistic regression model, as shown in Figure 3. In the training set, the maximum Youden index was 0.59, corresponding to a model probability of 0.175 and a nomogram score of 173 points, which served as the optimal threshold. Consequently, nodules with a nomogram score of 173 points or above were classified as high-risk for FTC, while those with scores below this threshold were classified as low-risk.

Figure 3
www.frontiersin.org

Figure 3. Nomogram to predict follicular thyroid carcinoma risk. To calculate the risk of follicular thyroid carcinoma, locate the features of the nodule on the corresponding axes, then draw a vertical line from each location to the “Points” axis to determine the points for that feature. Add the points of all the features and place the total on the “Total Points” axis, which corresponds to the risk probability of follicular thyroid carcinoma on the “Risk” axis. A total of 173 points(risk probability of 0.175) or more indicates a high risk of follicular thyroid carcinoma. Tg, thyroglobulin; US, ultrasound.

The ROC curve of the model, depicted in Figure 4, demonstrated satisfactory discrimination, with an AUC of 0.83 (95% CI: 0.77-0.90) in the training set and 0.78 (95% CI: 0.68-0.88) in the validation set. The calibration curves, depicted in Figure 5, demonstrated a satisfactory agreement between the predicted probabilities and the observed incidences of FTC in both sets, indicating an accurate calibration. The DCA, illustrated in Figure 6, revealed that the model provided a higher net benefit than both the treat-all and treat-none strategies over a wide range of threshold probabilities, underscoring its clinical utility in both the training and validation sets.

Figure 4
www.frontiersin.org

Figure 4. ROC curves evaluating the discrimination of the follicular thyroid carcinoma risk model in training (A) and validation sets (B). The light blue shaded area represents the 95% confidence interval of the ROC curves. AUC, area under the curve; ROC, receiver operating characteristic; CI, confidence interval.

Figure 5
www.frontiersin.org

Figure 5. Calibration curves evaluating the calibration of the follicular thyroid carcinoma risk model in training (A) and validation sets (B). The calibration curves illustrate the concordance between the predicted probabilities and the actual observed incidence of follicular thyroid carcinoma in the training and validation sets. The diagonal dashed lines serve as a benchmark for perfect prediction. The light blue shaded area represents the 95% confidence interval (CI) of the calibration curves.

Figure 6
www.frontiersin.org

Figure 6. Decision curve analysis (DCA) evaluating the clinical utility of the follicular thyroid carcinoma risk model in training and validation sets. The line labeled “treat-all” represents the net benefit of assuming that all nodules are follicular thyroid carcinoma, and the line labeled “treat-none” represents the net benefit of assuming that no nodules are follicular thyroid carcinoma.

At the optimal threshold, the model achieved a sensitivity of 81.63% (95% CI: 69.39–91.84), a specificity of 77.42% (95% CI: 72.18-82.66), an accuracy of 78.1% (95% CI: 73.4 – 82.4), a PPV of 41.67% (95% CI: 35.65–48.84), and an NPV of 95.61% (95% CI: 92.86–97.99) in the training set. In the validation set, the model achieved a sensitivity of 68.00% (95% CI: 48.00–84.00), a specificity of 72.51% (95% CI: 65.50–78.96), an accuracy of 71.9% (95% CI: 65.3 – 78.6), a PPV of 26.79% (95% CI: 19.40–34.33), and an NPV of 94.07% (95% CI: 90.44-97.08), as detailed in Table 4.

Table 4
www.frontiersin.org

Table 4. Performance of the predictive model for predicting follicular carcinoma in training and validation setsa.

4 Discussion

In this study, we developed an intraoperative predictive model to assess the risk of FTC in thyroid nodules identified by IOFS as suspicious for follicular neoplasm. The model exhibited a high NPV, with 95.61% in the training set and 94.07% in the validation set. This enables surgeons to confidently rule out FTC in low-risk nodules. This level of diagnostic confidence encourages surgeons to adopt more conservative surgical approaches, such as preferring lobectomy over total thyroidectomy. This reduces the number of unnecessary surgeries and the associated risks, including hypoparathyroidism and recurrent laryngeal nerve injury. Ultimately, this enhances patient outcomes and quality of life.

Furthermore, the model has demonstrated strong discrimination, calibration, and clinical utility in both the training and validation sets. This underscores its potential for integration into clinical practice. The predictors incorporated in the model, including nodule size, Tg levels, and ultrasound features such as echogenicity, margin, and halo, are commonly available in clinical settings, facilitating their routine use. The accompanying nomogram serves as a practical tool, aiding surgeons in making informed decisions regarding the necessary extent of surgical intervention.

To our knowledge, this is the first model specifically designed to predict the risk of FTC in this setting. Existing models, such as the one by Yu et al. (16), aimed to predict FTC in all patients undergoing thyroid surgery, but showed a low PPV of 11.1% due to the low prevalence of FTC in unselected thyroid surgery patients (only 61 out of 3,588 cases in the study). Another model by Macias et al. (28) was designed to predict malignancy in nodules with a cytologic diagnosis of follicular neoplasm (Bethesda IV) on FNA. While a significant proportion of these nodules was malignant (33.8% of 151 patients, of which 29.4% were FTC), this model was not exclusively focused on identifying FTC. Furthermore, it did not include cases of FTC that were misclassified as Bethesda I, III, or V due to FNA sampling error or cytologic limitations, which often require further confirmation by IOFS. Therefore, our model addresses a critical need by providing a targeted approach to predict the risk of FTC in nodules during intraoperative examination.

Predictors identified in the model are consistent with previous studies. Larger nodule size, positive Tg levels, ultrasound features of hypoechogenicity, lobulated or irregular margin, and thick halo have been reported as predictors of FTC in several studies (16, 18, 19, 21, 21, 23, 24, 29, 30, 3234). However, other predictors such as older age (19), male sex (19), the presence of calcifications (20, 23, 35) or microcalcifications (21, 36) on ultrasound, did not show a significant association in our analysis. Furthermore, although smoking showed a significant association in our univariate analysis, it was not included in the final model. This discrepancy may be due to the unique characteristics of our study population or the limited sample size. Further studies with larger cohorts are needed to validate these findings.

Despite its strengths, this study has several limitations. First, due to the study design, some malignancies other than FTC may have been inadvertently included in the non-FTC group. Despite the application of strict inclusion and exclusion criteria, a total of 18 (3.7%) patients diagnosed with PTC were included in this group, mainly because some patients in this study did not undergo preoperative FNA or BRAF V600E mutation testing. This inclusion may affect the performance and generalizability of the model. However, with the increasing use of these diagnostic tools in clinical practice, we anticipate that the impact of this limitation will be minimized. Second, the model showed a relatively low PPV of 41.67% (95% CI: 35.65-48.84) in the training set and 26.79% (95% CI: 19.40-34.33) in the validation set, indicating a high false-positive rate. However, the high NPV of the model indicates that it is more effective in excluding FTC than in confirming it. This still represents a significant clinical benefit. Third, the retrospective nature of the study may introduce selection bias, especially since some patients who did not undergo IOFS were not included in the study. Furthermore, the relatively small number of subjects and an even smaller number of FTC cases compared to the number of predictors could lead to overfitting of the model, although LASSO regression was used to mitigate this risk. Finally, although the model was validated using an independent dataset from another campus of the hospital, it may still be subject to institutional bias and may not generalize well to other populations, underscoring the need for further multicenter studies to validate the model’s performance in different clinical settings.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Ethics Committee of Ningbo Medical Center Lihuili Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

CL: Conceptualization, Formal analysis, Funding acquisition, Methodology, Writing – original draft, Writing – review & editing. YL: Conceptualization, Formal analysis, Methodology, Writing – review & editing. YG: Data curation, Supervision, Writing – review & editing. YJ: Data curation, Supervision, Writing – review & editing. QL: Data curation, Supervision, Writing – review & editing. JH: Data curation, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This research is funded by the Medical and Health Research Project of Zhejiang Province (2024KY1484).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: A Cancer J Clin. (2021) 71:209–49. doi: 10.3322/caac.21660

Crossref Full Text | Google Scholar

2. Ashorobi D, Lopez PP. Follicular thyroid cancer. In: StatPearls. StatPearls Publishing, Treasure Island (FL (2024).

Google Scholar

3. Ruegemer JJ, Hay ID, Bergstralh EJ, Ryan JJ, Offord KP, Gorman CA. Distant metastases in differentiated thyroid carcinoma: A multivariate analysis of prognostic variables*. J Clin Endocrinol Metab. (1988) 67:501–8. doi: 10.1210/jcem-67-3-501

PubMed Abstract | Crossref Full Text | Google Scholar

4. Benbassat C, Mechlis-Frish S, Hirsch D. Clinicopathological characteristics and long-term outcome in patients with distant metastases from differentiated thyroid cancer. World J Surg. (2006) 30:1088–95. doi: 10.1007/s00268-005-0472-4

PubMed Abstract | Crossref Full Text | Google Scholar

5. Lin JD, Huang MJ, Juang JH, Chao TC, Huang BY, Chen KW, et al. Factors related to the survival of papillary and follicular thyroid carcinoma patients with distant metastases. Thyroid. (1999) 9:1227–35. doi: 10.1089/thy.1999.9.1227

PubMed Abstract | Crossref Full Text | Google Scholar

6. Lin JD, Chao TC, Chou SC, Hsueh C. Papillary thyroid carcinomas with lung metastases. Thyroid. (2004) 14:1091–6. doi: 10.1089/thy.2004.14.1091

PubMed Abstract | Crossref Full Text | Google Scholar

7. Avci İ, Başkurt O, Şeker S, Çelik SE. Late-onset metastasis of intracranial papillary thyroid carcinoma. World Neurosurg. (2019) 130:7–9. doi: 10.1016/j.wneu.2019.06.164

PubMed Abstract | Crossref Full Text | Google Scholar

8. Shaha AR, Shah JP, Loree TR. Differentiated thyroid cancer presenting initially with distant metastasis. Am J Surg. (1997) 174:474–6. doi: 10.1016/S0002-9610(97)00158-X

PubMed Abstract | Crossref Full Text | Google Scholar

9. Haugen BR, Alexander EK, Bible KC, Doherty GM, Mandel SJ, Nikiforov YE, et al. 2015 American thyroid association management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer: the american thyroid association guidelines task force on thyroid nodules and differentiated thyroid cancer. Thyroid: Off J Am Thyroid Assoc. (2016) 26:1–133. doi: 10.1089/thy.2015.0020

Crossref Full Text | Google Scholar

10. Gaafar H, Hefny M, Refaat D, Metwalli AE. TOTAL VERSUS COMPLETION THYROIDECTOMY IN MANAGEMENT OF SUSPECTED THYROID CARCINOMA. Zagazig Univ Med J. (2013) 19:1–8. doi: 10.21608/zumj.2013.4335

Crossref Full Text | Google Scholar

11. WHO. Classification of tumours editorial board. In: WHO Classification of Tumours: Endocrine and Neuroendocrine Tumours, 5 edn, vol. 8. International Agency for Research on Cancer, Lyon, France (2022).

Google Scholar

12. Ali SZ, Baloch ZW, Cochand-Priollet B, Schmitt FC, Vielh P, VanderLaan PA. The 2023 bethesda system for reporting thyroid cytopathology. Thyroid: Off J Am Thyroid Assoc. (2023) 33:1039–44. doi: 10.1089/thy.2023.0141

Crossref Full Text | Google Scholar

13. Posillico SE, Wilhelm SM, McHenry CR. The utility of frozen section examination for determining the extent of thyroidectomy in patients with a thyroid nodule and “atypia/follicular lesion of undetermined significance. Am J Surg. (2015) 209:552–6. doi: 10.1016/j.amjsurg.2014.09.026

PubMed Abstract | Crossref Full Text | Google Scholar

14. Mao Z, Ding Y, Wen L, Zhang Y, Wu G, You Q, et al. Combined fine-needle aspiration and selective intraoperative frozen section to optimize prediction of Malignant thyroid nodules: A retrospective cohort study of more than 3000 patients. Front Endocrinol. (2023) 14:1091200. doi: 10.3389/fendo.2023.1091200

Crossref Full Text | Google Scholar

15. Udelsman R, Westra WH, Donovan PI, Sohn TA, Cameron JL. Randomized prospective evaluation of frozen-section analysis for follicular neoplasms of the thyroid. Ann Surg. (2001) 233:716–22. doi: 10.1097/00000658-200105000-00016

PubMed Abstract | Crossref Full Text | Google Scholar

16. Yu Q, Liu K, Xie C, Ma D, Wu Y, Jiang H, et al. Development and validation of a preoperative prediction model for follicular thyroid carcinoma. Clin Endocrinol. (2019) 91:348–55. doi: 10.1111/cen.14002

Crossref Full Text | Google Scholar

17. Emerick GT, Duh QY, Siperstein AE, Burrow GN, Clark OH. Diagnosis, treatment, and outcome of follicular thyroid carcinoma. Cancer. (1993) 72:3287–95. doi: 10.1002/1097-0142(19931201)72:11<3287::aid-cncr2820721126>3.0.co;2-5

PubMed Abstract | Crossref Full Text | Google Scholar

18. Lee JH, Ha EJ, Lee DH, Han M, Park JH, Kim JH. Clinicoradiological characteristics in the differential diagnosis of follicular-patterned lesions of the thyroid: A multicenter cohort study. Korean J Radiol. (2022) 23:763–72. doi: 10.3348/kjr.2022.0079

PubMed Abstract | Crossref Full Text | Google Scholar

19. Sillery JC, Reading CC, Charboneau JW, Henrichsen TL, Hay ID, Mandrekar JN. Thyroid follicular carcinoma: sonographic features of 50 cases. Am J Roentgenol. (2010) 194:44–54. doi: 10.2214/AJR.09.3195

Crossref Full Text | Google Scholar

20. Ahn HS, Kim HS, Hong MJ. Ultrasonographic and cytologic assessments of follicular neoplasms of the thyroid: Predictive features differentiating follicular carcinoma from follicular adenoma. PloS One. (2022) 17:e0271437. doi: 10.1371/journal.pone.0271437

PubMed Abstract | Crossref Full Text | Google Scholar

21. Borowczyk M, Woliński K, Więckowska B, Jodłowska-Siewert E, Szczepanek-Parulska E, Verburg FA, et al. Sonographic features differentiating follicular thyroid cancer from follicular adenoma-A meta-analysis. Cancers. (2021) 13:938. doi: 10.3390/cancers13050938

PubMed Abstract | Crossref Full Text | Google Scholar

22. Li J, Li C, Zhou X, Huang J, Yang P, Cang Y, et al. US risk stratification system for follicular thyroid neoplasms. Radiology. (2023) 309:e230949. doi: 10.1148/radiol.230949

PubMed Abstract | Crossref Full Text | Google Scholar

23. Huang Q, Xie L, Huang L, Wei W, Li H, Zhuang Y, et al. Development and validation of an ultrasonic diagnostic model for differentiating follicular thyroid carcinoma from follicular adenoma. Int J Gen Med. (2021) 14:5069–78. doi: 10.2147/IJGM.S331338

PubMed Abstract | Crossref Full Text | Google Scholar

24. Wu Q, Qu Y, Li Y, Liu Y, Shen J, Wang Y. Logistic regression analysis of contrast-enhanced ultrasound and conventional ultrasound of follicular thyroid carcinoma and follicular adenoma. Gland Surg. (2021) 10:2890–900. doi: 10.21037/gs-21-535

PubMed Abstract | Crossref Full Text | Google Scholar

25. Tang P, Ren C, Shen L, Zhou Z. Development and validation of a diagnostic nomogram for the preoperative differentiation between follicular thyroid carcinoma and follicular thyroid adenomas. J Comput Assisted Tomogr. (2021) 45:128. doi: 10.1097/RCT.0000000000001078

Crossref Full Text | Google Scholar

26. Collins GS, Reitsma JB, Altman DG, Moons K. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD Statement. BMC Med.. (2015) 13(1):1. doi: 10.1186/s12916-014-0241-z

PubMed Abstract | Crossref Full Text | Google Scholar

27. Tessler FN, Middleton WD, Grant EG, Hoang JK, Berland LL, Teefey SA, et al. ACR thyroid imaging, reporting and data system (TI-RADS): white paper of the ACR TI-RADS committee. J Am Coll Radiol. (2017) 14:587–95. doi: 10.1016/j.jacr.2017.01.046

PubMed Abstract | Crossref Full Text | Google Scholar

28. Macias CA, Arumugam D, Arlow RL, Eng OS, Lu SE, Javidian P, et al. A risk model to determine surgical treatment in patients with thyroid nodules with indeterminate cytology. Ann Surg Oncol. (2015) 22:1527–32. doi: 10.1245/s10434-014-4190-8

PubMed Abstract | Crossref Full Text | Google Scholar

29. Lee SH, Baek JS, Lee JY, Lim JA, Cho SY, Lee TH, et al. Predictive factors of Malignancy in thyroid nodules with a cytological diagnosis of follicular neoplasm. Endocrine Pathol. (2013) 24:177–83. doi: 10.1007/s12022-013-9263-x

Crossref Full Text | Google Scholar

30. Kim HJ, Mok JO, Kim CH, Kim YJ, Kim SJ, Park HK, et al. Preoperative serum thyroglobulin and changes in serum thyroglobulin during TSH suppression independently predict follicular thyroid carcinoma in thyroid nodules with a cytological diagnosis of follicular lesion. Endocrine Res. (2017) 42:154–62. doi: 10.1080/07435800.2016.1262395

Crossref Full Text | Google Scholar

31. Chen Z, Lin Y, Lai S, Wang P, Li J, Wang L, et al. The utility of serum anti-thyroglobulin antibody and thyroglobulin in the preoperative differential diagnosis of thyroid follicular neoplasms. Endocrine. (2022) 76:369–76. doi: 10.1007/s12020-022-02993-1

PubMed Abstract | Crossref Full Text | Google Scholar

32. Staubitz JI, Musholt PB, Musholt TJ. The surgical dilemma of primary surgery for follicular thyroid neoplasms. Best Pract Res Clin Endocrinol Metab. (2019) 33:101292. doi: 10.1016/j.beem.2019.101292

PubMed Abstract | Crossref Full Text | Google Scholar

33. Liu BJ, Zhang YF, Zhao CK, Wang HX, Li MX, Xu HX. Conventional ultrasound characteristics, TIRADS category and shear wave speed measurement between follicular adenoma and follicular thyroid carcinoma. Clin Hemorheol Microcirculation. (2020) 75:291–301. doi: 10.3233/CH-190750

Crossref Full Text | Google Scholar

34. Kim DH, Baek JW, Lee YJ, Baek HJ. Korean Thyroid Imaging Reporting and Data System features of follicular thyroid adenoma and carcinoma: A single-center study. Ultrasonography. (2017) 36:349–54. doi: 10.14366/usg.17020

PubMed Abstract | Crossref Full Text | Google Scholar

35. Kuo TC, Wu MH, Chen KY, Hsieh MS, Chen A, Chen CN. Ultrasonographic features for differentiating follicular thyroid carcinoma and follicular adenoma. Asian J Surg. (2020) 43:339–46. doi: 10.1016/j.asjsur.2019.04.016

PubMed Abstract | Crossref Full Text | Google Scholar

36. Rago T, Di Coscio G, Basolo F, Scutari M, Elisei R, Berti P, et al. Combined clinical, thyroid ultrasound and cytological features help to predict thyroid Malignancy in follicular and Hürthle cell thyroid lesions: Results from a series of 505 consecutive patients. Clin Endocrinol. (2007) 66:13–20. doi: 10.1111/j.1365-2265.2006.02677.x

Crossref Full Text | Google Scholar

37. Wu SJ, Tan L, Ruan JL, Qiu Y, Hao SY, Yang HY, et al. ACR TI-RADS classification combined with number of nodules, halo features optimizes diagnosis and prediction of follicular thyroid cancer. Clin Hemorheol Microcirculation. (2022) 82:323–34. doi: 10.3233/CH-221507

Crossref Full Text | Google Scholar

38. Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Society: Ser B (Methodological). (1996) 58:267–88. doi: 10.1111/j.2517-6161.1996.tb02080.x

Crossref Full Text | Google Scholar

39. Scherr S, Zhou J. Automatically identifying relevant variables for linear regression with the lasso method: A methodological primer for its application with R and a performance contrast simulation with alternative selection strategies. Communication Methods Measures. (2020) 14:204–11. doi: 10.1080/19312458.2019.1677882

Crossref Full Text | Google Scholar

40. Maranzano P, Otto P, Fassò A. Adaptive LASSO estimation for functional hidden dynamic geostatistical models. Stochastic Environ Res Risk Assess. (2023) 37(9):3615–37. doi: 10.1007/s00477-023-02466-5

Crossref Full Text | Google Scholar

41. Vickers AJ, Elkin EB. Decision curve analysis: A novel method for evaluating prediction models. Med Decision Making: Int J Soc Med Decision Making. (2006) 26:565–74. doi: 10.1177/0272989X06295361

Crossref Full Text | Google Scholar

42. R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing (2024).

Google Scholar

43. Tay JK, Narasimhan B, Hastie T. Elastic net regularization paths for all generalized linear models. J Stat Software. (2023) 106:1–31. doi: 10.18637/jss.v106.i01

Crossref Full Text | Google Scholar

44. Harrell FE Jr. Rms: Regression Modeling Strategies. (2023). Available at: https://CRAN.R-project.org/package=rms

Google Scholar

45. Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez JC, et al. pROC: An open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinf. (2011) 12:77. doi: 10.1186/1471-2105-12-77

Crossref Full Text | Google Scholar

46. Kuhn M, Vaughan D, Ruiz E. Probably: Tools for Post-Processing Predicted Values. (2024). Available at: https://github.com/tidymodels/probably

Google Scholar

47. Brown M. Rmda: Risk Model Decision Analysis. (2018). Available at: https://CRAN.R-project.org/package=rmda

Google Scholar

48. Wickham H. Ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York (2016). Available at: https://ggplot2.tidyverse.org

Google Scholar

Keywords: predictive modeling, intraoperative frozen section, thyroid nodules, follicular carcinoma, risk assessment, ultrasound features, nomogram

Citation: Li C, Luo Y, Gan Y, Jiang Y, Li Q and Huang J (2024) Development and validation of a predictive model for assessing the risk of follicular carcinoma in thyroid nodules identified as suspicious by intraoperative frozen section. Front. Endocrinol. 15:1431247. doi: 10.3389/fendo.2024.1431247

Received: 11 May 2024; Accepted: 02 September 2024;
Published: 26 September 2024.

Edited by:

Horatiu Silaghi, University of Medicine and Pharmacy Iuliu Hatieganu, Romania

Reviewed by:

Jeehee Yoon, Chonnam National University Bitgoeul Hospital, Republic of Korea
Anupam Kotwal, University of Nebraska Medical Center, United States

Copyright © 2024 Li, Luo, Gan, Jiang, Li and Huang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Cheng Li, ZHIuY2hlbmdfbGlAb3V0bG9vay5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.