AUTHOR=Ford Colby T. , Alemayehu Gezahegn Solomon , Blackburn Kayla , Lopez Karen , Dieng Cheikh Cambel , Golassa Lemu , Lo Eugenia , Janies Daniel TITLE=Modeling Plasmodium falciparum Diagnostic Test Sensitivity Using Machine Learning With Histidine-Rich Protein 2 Variants JOURNAL=Frontiers in Tropical Diseases VOLUME=2 YEAR=2021 URL=https://www.frontiersin.org/journals/tropical-diseases/articles/10.3389/fitd.2021.707313 DOI=10.3389/fitd.2021.707313 ISSN=2673-7515 ABSTRACT=

Malaria, predominantly caused by Plasmodium falciparum, poses one of largest and most durable health threats in the world. Previously, simplistic regression-based models have been created to characterize malaria rapid diagnostic test performance, though these models often only include a couple genetic factors. Specifically, the Baker et al., 2005 model uses two types of particular repeats in histidine-rich protein 2 (PfHRP2) to describe a P. falciparum infection, though the efficacy of this model has waned over recent years due to genetic mutations in the parasite. In this work, we use a dataset of 100 P. falciparum PfHRP2 genetic sequences collected in Ethiopia and derived a larger set of motif repeat matches for use in generating a series of diagnostic machine learning models. Here we show that the usage of additional and different motif repeats in more sophisticated machine learning methods proves effective in characterizing PfHRP2 diversity. Furthermore, we use machine learning model explainability methods to highlight which of the repeat types are most important with regards to rapid diagnostic test sensitivity, thereby showcasing a novel methodology for identifying potential targets for future versions of rapid diagnostic tests.