AUTHOR=Ford Colby T. , Alemayehu Gezahegn Solomon , Blackburn Kayla , Lopez Karen , Dieng Cheikh Cambel , Golassa Lemu , Lo Eugenia , Janies Daniel
TITLE=Modeling Plasmodium falciparum Diagnostic Test Sensitivity Using Machine Learning With Histidine-Rich Protein 2 Variants
JOURNAL=Frontiers in Tropical Diseases
VOLUME=2
YEAR=2021
URL=https://www.frontiersin.org/journals/tropical-diseases/articles/10.3389/fitd.2021.707313
DOI=10.3389/fitd.2021.707313
ISSN=2673-7515
ABSTRACT=
Malaria, predominantly caused by Plasmodium falciparum, poses one of largest and most durable health threats in the world. Previously, simplistic regression-based models have been created to characterize malaria rapid diagnostic test performance, though these models often only include a couple genetic factors. Specifically, the Baker et al., 2005 model uses two types of particular repeats in histidine-rich protein 2 (PfHRP2) to describe a P. falciparum infection, though the efficacy of this model has waned over recent years due to genetic mutations in the parasite. In this work, we use a dataset of 100 P. falciparum PfHRP2 genetic sequences collected in Ethiopia and derived a larger set of motif repeat matches for use in generating a series of diagnostic machine learning models. Here we show that the usage of additional and different motif repeats in more sophisticated machine learning methods proves effective in characterizing PfHRP2 diversity. Furthermore, we use machine learning model explainability methods to highlight which of the repeat types are most important with regards to rapid diagnostic test sensitivity, thereby showcasing a novel methodology for identifying potential targets for future versions of rapid diagnostic tests.