Skip to main content

ORIGINAL RESEARCH article

Front. Med.

Sec. Pulmonary Medicine

Volume 12 - 2025 | doi: 10.3389/fmed.2025.1529993

This article is part of the Research Topic Bridging Tradition and Future: Cutting-edge Exploration and Application of Artificial Intelligence in Comprehensive Diagnosis and Treatment of Lung Diseases View all 4 articles

Explainable Machine Learning Model and Nomogram for Predicting the Efficacy of Traditional Chinese Medicine in Treating Long COVID: A Retrospective Study

Provisionally accepted
Jisheng Zhang Jisheng Zhang 1Yang Chen Yang Chen 1Aijun Zhang Aijun Zhang 2Yi Yang Yi Yang 3Liqian Ma Liqian Ma 1Hangqi Meng Hangqi Meng 1Jintao Wu Jintao Wu 1Kean Zhu Kean Zhu 1Jiangsong Zhang Jiangsong Zhang 1Ke Lin Ke Lin 4Xianming Lin Xianming Lin 1*
  • 1 Third Affiliated Hospital of Zhejiang Chinese Medical University, Hangzhou, Zhejiang Province, China
  • 2 First Hospital of Jiaxing, Jiaxing, Zhejiang Province, China
  • 3 Haining People's Hospital, Jiaxing, China
  • 4 First College of Clinical Medicine, Zhejiang Chinese Medical University, Hangzhou, Zhejiang Province, China

The final, formatted version of the article will be published soon.

    Introduction: Long COVID significantly affects patients' quality of life, yet no standardized treatment has been established. Traditional Chinese Medicine (TCM) presents a promising potential approach with targeted therapeutic strategies. This study aims to develop an explainable machine learning (ML) model and nomogram to identify Long COVID patients who may benefit from TCM, enhancing clinical decision-making. Methods: We analyzed data from 1,331 Long COVID patients treated with TCM between December 2022 and February 2024 at three hospitals in Zhejiang, China. Effectiveness was defined as improvement in two or more symptoms or a minimum 2-point increase in the Traditional Chinese Medicine Syndrome Score (TCMSS). Data included 11 patient and disease characteristics, 18 clinical symptoms and syndrome scores, and 12 auxiliary examination indicators. The least absolute shrinkage and selection operator (LASSO) method identified features linked to TCM efficacy. Data from 1,204 patients served as the training set, while 127 patients formed the testing set. Results: We employed five ML algorithms: Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbors (KNN), Extreme Gradient Boosting (XGBoost), and Neural Network (NN). The XGBoost model achieved an Area Under the Curve (AUC) of 0.9957 and an F1 score of 0.9852 in the training set, demonstrating superior performance in the testing set with an AUC of 0.9059 and F1 score of 0.9027. Key features identified through SHapley Additive exPlanations (SHAP) included chest tightness, aversion to cold, age, TCMSS, Short Form (36) Health Survey (SF-36), C-reactive protein (CRP), and lymphocyte ratio. The logistic regression-based nomogram demonstrated an AUC of 0.9479 and F1 score of 0.9384 in the testing set. Conclusion: This study utilized multicenter data and multiple ML algorithms to create a ML model for predicting TCM efficacy in Long COVID treatment. Furthermore, a logistic regression-based nomogram was developed to assist the model and improve decision-making efficiency in TCM applications for Long COVID management.

    Keywords: machine learning, Shapley additive explanations, nomogram, Traditional Chinese Medicine, Long Covid, efficacy

    Received: 18 Nov 2024; Accepted: 03 Mar 2025.

    Copyright: © 2025 Zhang, Chen, Zhang, Yang, Ma, Meng, Wu, Zhu, Zhang, Lin and Lin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Xianming Lin, Third Affiliated Hospital of Zhejiang Chinese Medical University, Hangzhou, 310053, Zhejiang Province, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

    Research integrity at Frontiers

    Man ultramarathon runner in the mountains he trains at sunset

    94% of researchers rate our articles as excellent or good

    Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


    Find out more