Skip to main content

ORIGINAL RESEARCH article

Front. Med.
Sec. Ophthalmology
Volume 11 - 2024 | doi: 10.3389/fmed.2024.1452756
This article is part of the Research Topic Predictive and Diagnostic Approaches for Systemic Disorders Using Ocular Assessment View all 8 articles

Development and Construction of a Cataract Risk Prediction Model Based on Biochemical Indices:The National Health and Nutrition Examination Survey, 2005 to 2008

Provisionally accepted
  • 1 Xinjiang Medical University, Ürümqi, China
  • 2 First Affiliated Hospital of Xinjiang Medical University, Urumqi, Xinjiang Uyghur Region, China

The final, formatted version of the article will be published soon.

    The aim of this study is to develop and validate a novel multivariable prediction model capable of accurately estimating the probability of cataract development, utilizing parameters such as blood biochemical markers and age. Design: This population-based cross-sectional study comprised 9,566 participants drawn from the National Health and Nutrition Examination Survey (NHANES) across the 2005 to 2008 cycles. Methods: Demographic information and laboratory test results from the patients were collected and analyzed using LASSO regression and multivariate logistic regression to accurately capture the influence of biochemical indicators on the outcomes. The SHAP (Shapley Additive Explanations) scale was employed to assess the importance of each clinical feature, excluding age. A multivariate logistic regression model was then developed and visualized as a nomogram. To assess the model's performance, its discrimination, calibration, and clinical utility were evaluated using receiver operating characteristic (ROC) curves, 10-fold cross-validation, Hosmer-Lemeshow calibration curves, and decision curve analysis (DCA), respectively. Results: Logistic regression analysis identified age, erythrocyte folate (nmol/L), blood glucose (mmol/L), and blood urea nitrogen (mmol/L) as independent risk factors for cataract, and these variables were incorporated into a multivariate logistic regression-based nomogram for cataract risk prediction. The area under the receiver operating characteristic (ROC) curve (AUC) for cataract risk prediction was 0.917 (95% CI: 0.9067-0.9273) in the training cohort, and 0.9148 (95% CI: 0.8979-0.9316) in the validation cohort. The Hosmer-Lemeshow calibration curve demonstrated a good fit, indicating strong model calibration. Ten-fold cross-validation confirmed the logistic regression model's robust predictive performance and stability during external validation. Decision curve analysis (DCA) demonstrated that the nomogram prediction model provided greater clinical benefit for predicting cataract risk when the patient's threshold probability ranged from 0.10 to 0.90. Conclusions: This study identified blood urea nitrogen (mmol/L), serum glucose (mmol/L), and erythrocyte folate (mmol/L) as significant risk factors for cataract. A risk prediction model was developed, demonstrating strong predictive accuracy and clinical utility, offering clinicians a reliable tool for early and effective diagnosis. Cataract development may be delayed by reducing levels of blood urea nitrogen, serum glucose, and erythrocyte folate through lifestyle improvements and dietary modifications.

    Keywords: Cataract, Blood biochemical indicators, Prediction model, nomogram, machine learning

    Received: 21 Jun 2024; Accepted: 02 Oct 2024.

    Copyright: © 2024 Guoqing and Yi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Xiang-Long Yi, First Affiliated Hospital of Xinjiang Medical University, Urumqi, Xinjiang Uyghur Region, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.