AUTHOR=Liu Jianning , Feng Zhuoying , Gao Ru , Liu Peng , Meng Fangang , Fan Lijun , Liu Lixiang , Du Yang TITLE=Establishment and validation of a multivariate logistic model for risk factors of thyroid nodules using lasso regression screening JOURNAL=Frontiers in Endocrinology VOLUME=15 YEAR=2024 URL=https://www.frontiersin.org/journals/endocrinology/articles/10.3389/fendo.2024.1346284 DOI=10.3389/fendo.2024.1346284 ISSN=1664-2392 ABSTRACT=Objective

This study aims to analyze the association between the occurrence of thyroid nodules and various factors and to establish a risk factor model for thyroid nodules.

Methods

The study population was divided into two groups: a group with thyroid nodules and a group without thyroid nodules. Regression with the least absolute shrinkage and selection operator (Lasso) was applied to the complete dataset for variable selection. Binary logistic regression was used to analyze the relationship between various influencing factors and the prevalence of thyroid nodules.

Results

Based on the screening results of Lasso regression and the subsequent establishment of the Binary Logistic Regression Model on the training dataset, it was found that advanced age (OR=1.046, 95% CI: 1.033-1.060), females (OR = 1.709, 95% CI: 1.342-2.181), overweight individuals (OR = 1.546, 95% CI: 1.165-2.058), individuals with impaired fasting glucose (OR = 1.590, 95% CI: 1.193-2.122), and those with dyslipidemia (OR = 1.588, 95% CI: 1.197-2.112) were potential risk factors for thyroid nodule disease (p<0.05). The area under the curve (AUC) of the receiver operating characteristic (ROC) curve for the Binary Logistic Regression Model is 0.68 (95% CI: 0.64-0.72).

Conclusions

advanced age, females, overweight individuals, those with impaired fasting glucose, and individuals with dyslipidemia are potential risk factors for thyroid nodule disease.