The final, formatted version of the article will be published soon.
ORIGINAL RESEARCH article
Front. Public Health
Sec. Infectious Diseases: Epidemiology and Prevention
Volume 13 - 2025 |
doi: 10.3389/fpubh.2025.1470072
Multimodal machine learning-based model for differentiating nontuberculous mycobacteria from mycobacterium tuberculosis
Provisionally accepted- Zhoushan Hospital, Zhoushan, China
Objective: To develop and evaluate the effectiveness of multimodal machine learning approach for the differentiation of NTM from MTB. Methods: The clinical data and CT images of 175 patients were retrospectively obtained. We established clinical data-based model, radiomics-based model, and multimodal (clinical plus radiomics) model gradually using 5 machine learning algorithms (Logistic, XGBoost, AdaBoost, RandomForest, and LightGBM). Optimal algorithm in each model was selected after evaluating the differentiation performance both in training and validation sets. The model performance was further verified using external new MTB and NTM patient data. Performance was also compared with the existing approaches and model. Results: The clinical data-based model contained age, gender, and IL-6, and the RandomForest algorithm achieved the optimal learning model. Two key radiomics features of CT images were identified and then used to establish the radiomics model, finding that model from Logistic algorithm was the optimal. The multimodal model contained age, IL-6, and the 2 radiomics features, and the optimal model was from LightGBM algorithm. The optimal multimodal model had the highest AUC value, accuracy, sensitivity, and negative predictive value compared with the optimal clinical or radiomics models, and its’ favorable performance was also verified in the external test dataset (accuracy=0.745, sensitivity=0.900). Additionally, the performance of multimodal model was better than that of the radiologist, NGS detection, and existing machine learning model, with an increased accuracy of 26%, 4%, and 6%, respectively.Conclusion: This is the first study to establish multimodal model to distinguish NTM from MTB and it performs well in differentiating them, which has the potential to aid clinical decision-making for experienced radiologists.
Keywords: Nontuberculous mycobacterium, Mycobacterium tuberculosis, deep learning, CT images, Multimodal model
Received: 25 Jul 2024; Accepted: 06 Feb 2025.
Copyright: © 2025 Li, Zhi, Liu, Wang and Yu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Hong-ling Li, Zhoushan Hospital, Zhoushan, China
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.