AUTHOR=Chimbunde Emmanuel , Sigwadhi Lovemore N. , Tamuzi Jacques L. , Okango Elphas L. , Daramola Olawande , Ngah Veranyuy D. , Nyasulu Peter S. TITLE=Machine learning algorithms for predicting determinants of COVID-19 mortality in South Africa JOURNAL=Frontiers in Artificial Intelligence VOLUME=Volume 6 - 2023 YEAR=2023 URL=https://www.frontiersin.org/journals/artificial-intelligence/articles/10.3389/frai.2023.1171256 DOI=10.3389/frai.2023.1171256 ISSN=2624-8212 ABSTRACT=Background: COVID-19 has strained healthcare resources, necessitating efficient prognostication to triage patients effectively. This study quantified COVID-19 risk factors and predicted COVID-19 Intensive Care Unit (ICU) mortality in South Africa based on machine learning algorithms. Methods: Data for this study was on 392 COVID-19 ICU patients enrolled between 26 March 2020 and 10 February 2021. We used an Artificial Neural Network (ANN) and random forest (RF) to predict mortality among ICU patients and a semi-parametric logistic regression with nine covariates including a grouping variable based on k-means clustering. Further evaluation of the algorithms was performed using sensitivity, accuracy, specificity, and Cohen’s k statistics. Results: From the semi-parametric logistic regression and ANN variable importance, age, gender, cluster, presence of severe symptoms, being on the ventilator, and comorbidities of asthma significantly contributed to ICU death. In particular, the odds of mortality were 6 times higher among asthmatic patients than non-asthmatic patients. In univariable and multivariate regression, advanced age, PF1 and 2, FiO2, severe symptoms, asthma, oxygen saturation, and Clusters 4 were strongly predictive of mortality. The RF model revealed that intubation status, age, cluster, diabetes, and hypertension were the top five significant predictors of mortality. The ANN performed well with accuracy of 71%, a precision of 83%, F1 score of 100%, MCC) of 100% and a recall of 88%. In addition, a Cohen's k value of 0.75 verified the most extreme discriminative power of the ANN. In comparison, the RF model provided a 76% recall, an 87% precision, and a 65% Matthews correlation coefficient (MCC). Conclusion: Based on the findings, we can conclude that both ANN and RF can predict COVID-19 mortality in the ICU with accuracy. The proposed models accurately predict the prognosis of COVID-19 patients after diagnosis. The models can be used to prioritize COVID-19 patients with a high mortality risk in resource-constrained ICUs.