AUTHOR=Cao Shunshun , Hu Yangyang TITLE=Creating machine learning models that interpretably link systemic inflammatory index, sex steroid hormones, and dietary antioxidants to identify gout using the SHAP (SHapley Additive exPlanations) method JOURNAL=Frontiers in Immunology VOLUME=15 YEAR=2024 URL=https://www.frontiersin.org/journals/immunology/articles/10.3389/fimmu.2024.1367340 DOI=10.3389/fimmu.2024.1367340 ISSN=1664-3224 ABSTRACT=Background

The relationship between systemic inflammatory index (SII), sex steroid hormones, dietary antioxidants (DA), and gout has not been determined. We aim to develop a reliable and interpretable machine learning (ML) model that links SII, sex steroid hormones, and DA to gout identification.

Methods

The dataset we used to study the relationship between SII, sex steroid hormones, DA, and gout was from the National Health and Nutrition Examination Survey (NHANES). Six ML models were developed to identify gout by SII, sex steroid hormones, and DA. The seven performance discriminative features of each model were summarized, and the eXtreme Gradient Boosting (XGBoost) model with the best overall performance was selected to identify gout. We used the SHapley Additive exPlanation (SHAP) method to explain the XGBoost model and its decision-making process.

Results

An initial survey of 20,146 participants resulted in 8,550 being included in the study. Selecting the best performing XGBoost model associated with SII, sex steroid hormones, and DA to identify gout (male: AUC: 0.795, 95% CI: 0.746- 0.843, accuracy: 98.7%; female: AUC: 0.822, 95% CI: 0.754- 0.883, accuracy: 99.2%). In the male group, The SHAP values showed that the lower feature values of lutein + zeaxanthin (LZ), vitamin C (VitC), lycopene, zinc, total testosterone (TT), vitamin E (VitE), and vitamin A (VitA), the greater the positive effect on the model output. In the female group, SHAP values showed that lower feature values of E2, zinc, lycopene, LZ, TT, and selenium had a greater positive effect on model output.

Conclusion

The interpretable XGBoost model demonstrated accuracy, efficiency, and robustness in identifying associations between SII, sex steroid hormones, DA, and gout in participants. Decreased TT in males and decreased E2 in females may be associated with gout, and increased DA intake and decreased SII may reduce the potential risk of gout.