ORIGINAL RESEARCH article

Front. Cardiovasc. Med.

Sec. Cardiovascular Epidemiology and Prevention

Volume 12 - 2025 | doi: 10.3389/fcvm.2025.1490961

Development and validation of a carotid plaque risk prediction model for coal miners

Provisionally accepted
Yichun  LiYichun Li1Tieru  ZhangTieru Zhang1Chaoqun  CuiChaoqun Cui1Fan  ZhangFan Zhang1Yutong  YangYutong Yang1Haiwang  GaoHaiwang Gao2Yingbo  LiuYingbo Liu2Jianguang  HaoJianguang Hao3Jianru  WangJianru Wang3Jiao  WuJiao Wu3Mingzhong  LuoMingzhong Luo3Lijian  LeiLijian Lei1*
  • 1Department of Epidemiology, School of Public Health, Shanxi Medical University, Taiyuan, China
  • 2Peking University Medical Lu 'an Hospital Health Management Center, Changzhi, Shanxi, China
  • 3The Second People’s Hospital of Shanxi Province, Taiyuan, Shanxi, China

The final, formatted version of the article will be published soon.

Objective: Carotid plaque represents an independent risk factor for cardiovascular disease and a significant threat to human health. The aim of the study is to develop an accurate and interpretable predictive model for early detection the occurrence of carotid plaque.A cross-sectional study was conducted by selecting coal miners who participated in medical examinations from October 2021 to January 2022 at a hospital in North China. The features were initially screened using extreme gradient boosting (XGBoost), random forest, and LASSO regression, and the model was subsequently constructed using logistic regression. The three models were then compared, and the optimum model was identified. Finally, a nomogram was plotted to increase its interpretability.The XGBoost algorithm demonstrated superior performance in feature screening, identifying the top five features as follows: age, systolic blood pressure, low-density lipoprotein cholesterol, white blood cell count, and body mass index (BMI). The area under the curve (AUC), sensitivity, and specificity of the model constructed based on the XGBoost algorithm were 0.846, 0.867, and 0.702, respectively.Conclusions: It is possible to predict the presence of carotid plaque using machine learning. The model has high application value and can better predict the risk of carotid artery plaque in coal miners. Furthermore, it provides a theoretical basis for the health management of coal miners.

Keywords: XGBoost1, nomogram2, Machine Learning3, Coal miners4, Carotid plaque5

Received: 12 Sep 2024; Accepted: 24 Apr 2025.

Copyright: © 2025 Li, Zhang, Cui, Zhang, Yang, Gao, Liu, Hao, Wang, Wu, Luo and Lei. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Lijian Lei, Department of Epidemiology, School of Public Health, Shanxi Medical University, Taiyuan, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.