Skip to main content

ORIGINAL RESEARCH article

Front. Cell. Infect. Microbiol.

Sec. Clinical Infectious Diseases

Volume 15 - 2025 | doi: 10.3389/fcimb.2025.1550933

This article is part of the Research Topic Unravelling Host-Pathogen Interactions in Bacterial Infection: Insights from Omics and Machine Learning View all 3 articles

Enhancing Fever of unknown origin Diagnosis: Machine Learning Approaches to Predict Metagenomic next-generation sequencing Positivity

Provisionally accepted
  • Second Xiangya Hospital, Central South University, Changsha, Hunan Province, China

The final, formatted version of the article will be published soon.

    Metagenomic next-generation sequencing (mNGS) can potentially detect various pathogenic microorganisms without bias to improve the diagnostic rate of fever of unknown origin (FUO), but there are no effective methods to predict mNGS-positive results. This study aimed to develop an interpretable machine learning algorithm for the effective prediction of mNGS results in patients with FUO.A clinical dataset from a large medical institution was used to develop and compare the performance of several predictive models, namely eXtreme Gradient Boosting (XGBoost), Light Gradient-Boosting Machine (LightGBM), and Random Forest, and the Shapley additive explanation (SHAP) method was employed to interpret and analyze the results.The mNGS-positive rate among 284 patients with FUO reached 64.1%. Overall, the LightGBM-based model exhibited the best comprehensive predictive performance, with areas under the curve of 0.84 and 0.93 for the training and validation sets, respectively. Using the SHAP method, the five most important factors for predicting mNGS-positive results were albumin, procalcitonin, blood culture, disease type, and sample type.The validated LightGBM-based predictive model could have practical clinical value in enhancing the application of mNGS in the etiological diagnosis of FUO, representing a powerful tool to optimize the timing of mNGS.

    Keywords: Metagenomic next-generation sequencing (mNGS), Fever of unknown origin (FUO), machine learning algorithms, Light Gradient-Boosting Machine (LightGBM), Predictive Modeling

    Received: 24 Dec 2024; Accepted: 21 Mar 2025.

    Copyright: © 2025 Gao, JIANG, Chen, Wang, Liu and Ma. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Jing Ma, Second Xiangya Hospital, Central South University, Changsha, Hunan Province, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

    Research integrity at Frontiers

    Man ultramarathon runner in the mountains he trains at sunset

    94% of researchers rate our articles as excellent or good

    Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


    Find out more