Skip to main content

ORIGINAL RESEARCH article

Front. Artif. Intell.

Sec. Medicine and Public Health

Volume 8 - 2025 | doi: 10.3389/frai.2025.1446876

Development of machine learning algorithms to predict viral load suppression among HIV patients in Conakry (Guinea)

Provisionally accepted
Degninou Yehadji Degninou Yehadji 1*Geraldine Geraldine.Gray@tudublin.ie Gray Geraldine Geraldine.Gray@tudublin.ie Gray 2Carlos Arias Vicente Carlos Arias Vicente 1Petros Isaakidis Petros Isaakidis 3,4Abdourahime Diallo Abdourahime Diallo 1Saa Andre Kamano Saa Andre Kamano 1Thierno Saidou Diallo Thierno Saidou Diallo 5
  • 1 Medecins Sans Frontieres Belgique, Guinea Mission, Conakry, Guinea
  • 2 Technological University Dublin, Dublin, Ireland
  • 3 Medecins Sans Frontieres, Southern Africa Medical Unit, Cape Town, South Africa
  • 4 Department of Hygiene and Epidemiology, Faculty of Medicine, School of Health Sciences, University of Ioannina, Ioannina, Greece
  • 5 National HIV and Hepatitis Control Program (PNLSH), Ministry of Health and Public Hygiene, Conakry, Guinea

The final, formatted version of the article will be published soon.

    Background Viral load (VL) suppression is key to ending the global HIV epidemic, and predicting it is critical for healthcare providers and people living with HIV (PLHIV). Traditional research has focused on statistical analysis, but machine learning (ML) is gradually influencing HIV clinical care. While ML has been used in various settings, there's a lack of research supporting antiretroviral therapy (ART) programs, especially in resource-limited settings like Guinea. This study aims to identify the most predictive variables of VL suppression and develop ML models for PLHIV in Conakry (Guinea). Methods Anonymized data from HIV patients in eight Conakry health facilities were pre-processed, including variable recoding, record removal, missing value imputation, grouping small categories, creating dummy variables, and oversampling the smallest target class. Support vector machine (SVM), logistic regression (LR), naïve Bayes (NB), random forest (RF), and four stacked models were developed. Optimal parameters were determined through two cross-validation loops using a grid search approach. Sensitivity, specificity, predictive positive value (PPV), predictive negative value (PNV), F-score, and area under the curve (AUC) were computed on unseen data to assess model performance. RF was used to determine the most predictive variables. Results RF (94% F-score, 82% AUC) and NB (89% F-score, 82% AUC) were the most optimal models to detect VL suppression and non-suppression when applied to unseen data. The optimal parameters for RF were 1,000 estimators and no maximum depth (Random state = 40), and it identified Regimen schedule_6-Month, Duration on ART (months), Last ART CD4, Regimen schedule_Regular, and Last Pre-ART CD4 as top predictors for VL suppression. Conclusion This study demonstrated the capability to predict VL suppression but has some limitations. The results are dependent on the quality of the data and are specific to the Guinea context and thus, there may be limitations with generalizability. Future studies may be to conduct a similar study in a different context and develop the most optimal model into an application that can be tested in a clinical context. Keywords: HIV, antiretroviral therapy, viral load, machine learning, prediction, classification, algorithm.

    Keywords: HIV, antiretroviral therapy, Viral Load, machine learning, prediction, Classification, algorithm

    Received: 10 Jun 2024; Accepted: 14 Feb 2025.

    Copyright: © 2025 Yehadji, Gray, Vicente, Isaakidis, Diallo, Kamano and Diallo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Degninou Yehadji, Medecins Sans Frontieres Belgique, Guinea Mission, Conakry, Guinea

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

    Research integrity at Frontiers

    Man ultramarathon runner in the mountains he trains at sunset

    94% of researchers rate our articles as excellent or good

    Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


    Find out more