AUTHOR=Cao Yuan , Yang Yixian , Chen Yunchao , Luan Mengqi , Hu Yan , Zhang Lu , Zhan Weiwei , Zhou Wei TITLE=Optimizing thyroid AUS nodules malignancy prediction: a comprehensive study of logistic regression and machine learning models JOURNAL=Frontiers in Endocrinology VOLUME=15 YEAR=2024 URL=https://www.frontiersin.org/journals/endocrinology/articles/10.3389/fendo.2024.1366687 DOI=10.3389/fendo.2024.1366687 ISSN=1664-2392 ABSTRACT=Background

The accurate diagnosis of thyroid nodules with indeterminate cytology, particularly in the atypia of undetermined significance (AUS) category, remains challenging. This study aims to predict the risk of malignancy in AUS nodules by comparing two machine learning (ML) and three conventional logistic regression (LR) models.

Methods

A retrospective study on 356 AUS nodules in 342 individuals from 6728 patients who underwent thyroid surgery in 2021. All the clinical, ultrasonographic, and molecular data were collected and randomly separated into training and validation cohorts at a ratio of 7: 3. ML (random forest and XGBoost) and LR (lasso regression, best subset selection, and backward stepwise regression) models were constructed and evaluated using area under the curve (AUC), calibration, and clinical utility metrics.

Results

Approximately 90% (321/356) of the AUS nodules were malignant, predominantly papillary thyroid carcinoma with 68.6% BRAF V600E mutations. The final LR prediction model based on backward stepwise regression exhibited superior discrimination with AUC values of 0.83 (95% CI: 0.73-0.92) and 0.80 (95% CI: 0.67-0.94) in training and validation, respectively. Well calibration, and clinical utility were also confirmed. The ML models showed moderate performance. A nomogram was developed on the final LR model.

Conclusions

The LR model developed using the backward stepwise regression, outperformed ML models in predicting malignancy in AUS thyroid nodules. The corresponding nomogram based on this model provides a valuable and practical tool for personalized risk assessment, potentially reducing unnecessary surgeries and enhancing clinical decision-making.