AUTHOR=Yao Jun , Du Zhilin , Yang Fuyue , Duan Ran , Feng Tong TITLE=The relationship between heavy metals and metabolic syndrome using machine learning JOURNAL=Frontiers in Public Health VOLUME=12 YEAR=2024 URL=https://www.frontiersin.org/journals/public-health/articles/10.3389/fpubh.2024.1378041 DOI=10.3389/fpubh.2024.1378041 ISSN=2296-2565 ABSTRACT=Background

Exposure to high levels of heavy metals has been widely recognized as an important risk factor for metabolic syndrome (MetS). The main purpose of this study is to assess the associations between the level of heavy metal exposure and Mets using machine learning (ML) method.

Methods

The data used in this study are from the national health and nutrition examination survey 2003–2018. According to the demographic information and heavy metal exposure level of participants, a total of 22 variables were included. Lasso was used to screen out the key variables, and 9 commonly used ML models were selected to establish the associations with the 5-fold cross validation method. Finally, we choose the SHapley Additive exPlanations (SHAP) method to explain the prediction results of Adaboost model.

Results

11,667 eligible individuals were randomly divided into two groups to train and verify the prediction model. Through lasso, characteristic variables were selected from 24 variables as predictors. The AUC (area under curve) of the models selected in this study were all greater than 0.7, and AdaBoost was the best model. The AUC value of AdaBoost was 0.807, the accuracy was 0.720, and the sensitivity was 0.792. It is noteworthy that higher levels of cadmium, body mass index, cesium, being female, and increasing age were associated with an increased probability of MetS. Conversely, lower levels of cobalt and molybdenum were linked to a decrease in the estimated probability of MetS.

Conclusion

Our study highlights the AdaBoost model proved to be highly effective, precise, and resilient in detecting a correlation between exposure to heavy metals and MetS. Through the use of interpretable methods, we identified cadmium, molybdenum, cobalt, cesium, uranium, and barium as prominent contributors within the predictive model.