
94% of researchers rate our articles as excellent or good
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.
Find out more
ORIGINAL RESEARCH article
Front. Immunol.
Sec. Systems Immunology
Volume 16 - 2025 | doi: 10.3389/fimmu.2025.1528046
The final, formatted version of the article will be published soon.
You have multiple emails registered with Frontiers:
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
The "gut-skin axis" has been proposed to play an important role in the development and symptoms of atopic dermatitis. So, we construct an interpretable machine learning framework to screen key gut flora quantificationally.The 16s dataset with centered log-ratio transformation was explored using five different machine learning models, including random forest, light gradient boosting machine, extreme gradient boosting, support vector machine with radial kernel and logistic regression. Interpretable methods based on machine learning, such as SHAP value, were used to estimate significant features associated with atopic dermatitis.Results: Random forest performed better than the other "tree" models in the validation partitions. The SHAP global dependency plot screened that Bifidobacterium ranked as the strongest predictive factor for all prediction horizons, though the SHAP values of some features were still higher in support vector machine and logistic regression. The SHAP partial dependency plot for "tree" models indicated that best segmentation point for the Bifidobacterium was further away from the origin than other features in respective models, quantitatively reflecting the difference of gut microbiota.The machine learning models combined with SHAP could be used to screen key gut flora quantificationally in atopic dermatitis patients and give doctors an intuitive knowledge of 16s rRNA sequencing data to support precision medicine in care and recovery.
Keywords: machine learning, random forest, Light gradient boosting machine, Extreme gradient boosting, SHAP value, Partial dependence plot, Interpretable machine learning
Received: 15 Nov 2024; Accepted: 07 Apr 2025.
Copyright: © 2025 Ma, fang, li, Zeng, Chen, Li, Ji, Yang and wu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Jingtai Ma, NMPA Key Laboratory for Safety Evaluation of Cosmetics, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou, China
yiting fang, NMPA Key Laboratory for Safety Evaluation of Cosmetics, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou, China
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
Research integrity at Frontiers
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.