Skip to main content

ORIGINAL RESEARCH article

Front. Public Health, 25 June 2024
Sec. Infectious Diseases: Epidemiology and Prevention
This article is part of the Research Topic Breaking Barriers, Bridging Gaps: UN World AIDS Day 2023 View all 10 articles

Investigating the effects of cytokine biomarkers on HIV incidence: a case study for individuals randomized to pre-exposure prophylaxis vs. control

  • 1School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, South Africa
  • 2School of Nursing and Public Health, University of KwaZulu-Natal, Pietermaritzburg, South Africa

Introduction: Understanding and identifying the immunological markers and clinical information linked with HIV acquisition is crucial for effectively implementing Pre-Exposure Prophylaxis (PrEP) to prevent HIV acquisition. Prior analysis on HIV incidence outcomes have predominantly employed proportional hazards (PH) models, adjusting solely for baseline covariates. Therefore, models that integrate cytokine biomarkers, particularly as time-varying covariates, are sorely needed.

Methods: We built a simple model using the Cox PH to investigate the impact of specific cytokine profiles in predicting the overall HIV incidence. Further, Kaplan-Meier curves were used to compare HIV incidence rates between the treatment and placebo groups while assessing the overall treatment effectiveness. Utilizing stepwise regression, we developed a series of Cox PH models to analyze 48 longitudinally measured cytokine profiles. We considered three kinds of effects in the cytokine profile measurements: average, difference, and time-dependent covariate. These effects were combined with baseline covariates to explore their influence on predictors of HIV incidence.

Results: Comparing the predictive performance of the Cox PH models developed using the AIC metric, model 4 (Cox PH model with time-dependent cytokine) outperformed the others. The results indicated that the cytokines, interleukin (IL-2, IL-3, IL-5, IL-10, IL-16, IL-12P70, and IL-17 alpha), stem cell factor (SCF), beta nerve growth factor (B-NGF), tumor necrosis factor alpha (TNF-A), interferon (IFN) alpha-2, serum stem cell growth factor (SCG)-beta, platelet-derived growth factor (PDGF)-BB, granulocyte macrophage colony-stimulating factor (GM-CSF), tumor necrosis factor-related apoptosis-inducing ligand (TRAIL), and cutaneous T-cell-attracting chemokine (CTACK) were significantly associated with HIV incidence. Baseline predictors significantly associated with HIV incidence when considering cytokine effects included: age of oldest sex partner, age at enrollment, salary, years with a stable partner, sex partner having any other sex partner, husband's income, other income source, age at debut, years lived in Durban, and sex in the last 30 days.

Discussion: Overall, the inclusion of cytokine effects enhanced the predictive performance of the models, and the PrEP group exhibited reduced HIV incidences compared to the placebo group.

1 Introduction

HIV continues to be a serious worldwide health concern, with South Africa having the world's highest HIV epidemic, with an estimated 8.45 million people living with HIV (1). The primary mode of transmission in this endemic setting is heterosexual intercourse, and women 18–40 years account for more than 60% of new infections where the young women bear the greatest burden (2). It is well known that sex workers are at a greater risk of HIV acquisition (3). Significant effort has been made in South Africa over the last decade to search for new technologies that prevent sexually transmitted HIV infections in women such as the pre-exposure prophylaxis (PrEP) products. Initiatives have been undertaken to scale up education and access to these products for example the tenofovir gel an antiretroviral microbicide that can be applied to the vagina or rectum with intentions of reducing the acquisition of HIV (4).

The initial stages of HIV infection are characterized by inflammation and profound immune dysregulation in the gut mucosa (5, 6) and genital inflammation at this stage also correlates with an increased plasma viral load (7). Taken together, inflammation is a key mediator of HIV pathogenesis. The levels of inflammatory cytokines and chemokines, which signal the presence of infection and recruit activated immune cells to the mucosa, are frequently used as biomarkers of inflammation in the female reproductive tract (FRT) (8). As such, we hypothesize it might be expected that elevated mucosal cytokines would be correlated with increased rates of HIV acquisition. The increased levels of pro-inflammatory cytokine is associated with increased rates of HIV acquisition (9) and cytokine profile is a strong predictor of subsequent HIV acquisition. Understanding the interplay between cytokine biomarkers and HIV incidence by identifying specific cytokine profiles associated with increased or decreased HIV susceptibility is crucial for optimizing PrEP strategies.

Cytokines serve a vital role in maintaining immune system homoeostasis (10), and HIV infection causes dysregulation of the cytokine profile (11). Changes in the cytokine signature directly affect HIV disease progression (12), with an intense cytokine “storm” during acute HIV infection (13). T-helper type 1 (Th1) cytokines such as interleukin (IL)-2 and antiviral interferon (IFN)-gamma are generally decreased during HIV infection, whereas T-helper type 2 (Th2) cytokines such as IL-4, IL-10, pro-inflammatory cytokines (IL-1, IL-6, IL-8) and tumor necrosis factor (TNF)-alpha are increased (10). IFN-alpha, IFN-beta, and IL-16 are HIV-suppressive cytokines that inhibit HIV replication in T cells while IFN-gamma, IL-4, and granulocyte-macrophage colony-stimulating factor, for example, have been demonstrated to have both inhibitory and stimulatory effects on HIV (14).

In clinical research, it is a common phenomenon for covariate data to be collected longitudinally and for the covariates to change over time during the follow up period. For example, patients in a clinical trial to asses the safety and effectiveness of tenofovir gel, a vaginal microbicide in sexually active women at risk for HIV, cytokine profiles were measured repeatedly up to infection or until censorship (4). In many instances, while examining the relationship between time to HIV infection and covariate(s), investigators will only consider the baseline covariates, leaving out covariates that change over time hence failing to consider the relation of the survival outcome as a function of the change of the time dependent covariates (15). It appears natural and suitable to use time-varying covariate information in an appropriate statistical model. The Cox PH model can be used to link survival times with either fixed covariates whose values remain constant during the follow-up period or predictor variables that fluctuate over time (16). The mentioned covariates can be dealt with as a time dependent covariates into the Cox PH model or incorporated as a derived longitudinal variables as further elaborated in the Section 2.2.

A previous analysis was conducted by Abdool Karim et al. (4) and Mansoor et al. (17) to investigate the effectiveness, safety and adherence in the CAPRISA 004 tenofovir gel microbicide trial. They used Proportional Hazards (PH) regression model to calculate the hazard ratios while adjusting for potentially important baseline covariates (age, site, anal sex history, contraceptive method, HSV-2, antibody status and condom use). They reported a hazard ratio of 0.63 (CI: 0.42,0.94, p = 0.025). In their analysis they did not include cytokine profile neither did they report on significant baseline covariates associated with HIV incidence. Masson et al. (18) used the same dataset to investigate whether genital inflammation influenced HIV acquisition in women. Their study selected 12 cytokines for their analysis. They employed conditional logistic regression and unsupervised hierarchical clustering in their statistical analysis.

Naranbhai et al. (19) investigated the role of immune activation in HIV acquisition in the CAPRISA 004 trial. They selected 13 cytokines and used logistic regression and principal component analysis (PCA) in their statistical analysis. On the other hand, Ngcobo et al. (20), in their study examining whether pre-infection plasma cytokine concentrations predicted the rate of HIV disease progression in the same study cohort, considered all 48 cytokines. They used linear regression to assess the impact of each cytokine on viral load (VL) and the CD4:CD8 ratio in both bivariate and multivariable models, adjusting for age, contraceptive use, HSV-2 status at baseline, study site, and study arm at randomization. Ignacio et al. (21) used the Sabes dataset and LASSO machine learning algorithms to study how dynamic immune markers predict HIV acquisition and strengthen associations with sociobehavioral factors related to HIV exposure. They selected 10 cytokines for their analysis. Other studies (22, 23) that have utilized CAPRISA 004 data set to investigate HIV progression, did not include time varying cytokine profile as a covariate in their analysis. To the best of our knowledge, cytokine profile as a time-varying covariate or as derived covariate has not been used with baseline covariate in previous studies to identify significant predictors of HIV incidence.

This study therefore, seeks to investigate the effect of time-varying cytokine biomarkers in determining significant predictors of HIV incidence among individuals randomized to PrEP vs. control exposure. We achieved that by building a series of Cox PH models that include different forms of the covariates that change over time and further asses the overall effectiveness of the tenofovir treatment by comparing the two groups using Kaplan–Meier estimator and survival curves. The variations in individual immune responses, particularly in cytokine profiles, may influence the efficacy of PrEP therefore this research aims to contribute to the development of personalized PrEP interventions tailored to individual immune responses.

2 Materials and methods

2.1 Dataset

The data was accessed from Center for the Aids Programme of Research in South Africa (CAPRISA 004) (4), a two arm double blinded randomized trial, placebo and tenofovir group conducted on HIV negative and sexually active women aged 18–40 years in South Africa for a period of 30 months; 18 months Accrual period and 12 months follow up. It was conducted between May 2007 and March 2010, and the dataset consist of survival and longitudinal data. The variables considered in this study were baseline characteristics and longitudinally measured cytokine profiles as described in the Supplementary Table S1.

2.1.1 Cytokine measurement

Plasma samples and cervicovaginal lavage specimens from cases and control were collected and stored for assessment. There were a total of 48 cytokines from 812 (tenofovir group = 405, placebo group = 407) women with 96 HIV infections (tenofovir group = 37, placebo = 59). The measurements were taken at irregular follow up times as shown in Figure 1 where majority of patients had their cytokine measurements recorded three times during the course of study. The average interval between the first and second cytokine measurements was 12 months, while the interval between subsequent cytokine measurements was 6 months.

Figure 1
www.frontiersin.org

Figure 1. Total count for the frequency of cytokine profile measurements.

2.1.2 Data pre-processing

The data underwent pre-processing to prepare it for subsequent analysis. The pre-processing steps involved eliminating variables with excessive missing values (Figure 2) i.e with more than 50% missingness and very small frequency percentages for the levels of some categorical variables. Additionally, in our efforts to enhance the robustness of our statistical analysis, we appropriately combined certain levels of categorical variables and renamed the strings. This step is crucial because a categorical variable with too many levels can compromise the model's performance due to small frequencies in some of the levels (24). Moreover, variables with only one level fail to positively impact the model due to very low variation, while levels that rarely occur have minimal chance of significantly affecting the model fit (25). These adjustments ensure that our analysis accurately captures the relationships within the data. Furthermore, Figure 2 demonstrates the completeness of our dataset, with almost 84.93% of variables containing no missing information, 10.46% missing income value data, and the remaining variables displaying other missing patterns.

Figure 2
www.frontiersin.org

Figure 2. Missing data aggregation plot. Proportion of missing values for all variables in the dataset, sorted by decreasing order (left). Combinations of missing values (right): yellow squares in a matrix entry denote the presence of missing values for the variable associated to the column in the samples corresponding to the row; the bars on the right show the cardinality of each set of points. The x-axis displays the variable names (not all variables are displayed due to limited space).

The data preparation and the statistical analysis was done using the R version (R-4.3.2). The R code file for this analysis is available in the Supplementary Table S2. As a result of the pre-processing step, 24 baseline characteristics and 48 cytokines covariates were used as initial variables at the start of the analysis. The categorical variables were summarized using frequency and percentages depicted in the Supplementary Table S2. The patient baseline characteristics in relation to HIV status and treatment group is summarized in Table 1, where the number of years with stable partner (p = 0.034) and the patients receiving income from husband (p = 0.026) were significantly associated with HIV status and treatment group. The statistical analysis was conducted on complete cases only in two stages; the first is survival analysis on baseline covariates without the cytokine covariates effects then survival analysis when including the cytokine covariate effects. Cytokine variable profiles are time-varying covariates since they change over time through the follow-up period. Therefore, the cytokine information was included in three ways; firstly we averaged all measurements throughout the follow-up time to better capture their average effects, secondly we took the difference between the last and first measurement to model the effect of change and lastly we treated the cytokine as a time-dependent covariate.

Table 1
www.frontiersin.org

Table 1. Summary description of patient's baseline characteristics stratified by HIV status and treatment group.

2.2 Statistical methods

Four separate Cox PH models were fitted in an increasing complexity based on how the cytokine effects are included. Model 1 (Equation 8): Cox regression model with baseline variables only, model 2 (Equation 9): Cox regression model with baseline variables plus cytokine effects using the mean value of the cytokine measurements as covariate, model 3 (Equation 10): Cox regression model with baseline variables plus cytokine effects using the difference between the last observed cytokine value and the first value as covariate in the model and model 4 (Equation 11): Cox regression model with baseline variables plus time dependent cytokine effects.

2.2.1 Kaplan–Meier survival curves

The Kaplan–Meier estimator is a non-parametric statistic that is used to estimate the survival function based on lifetime data (26). The estimate is frequently used in medical research to examine recovery rates, likelihood of deaths and whether or not a treatment was effective. Furthermore, it is used to compare two groups of subjects, the control group and treatment group (27). The Kaplan–Meier survival curve is a graphical representation of the survival function defined as the probability of surviving in a given length of time while considering time in many small intervals (28).

To estimate the survival function S(t) (the probability that life is longer than t), we consider survival time ti = t1, t2, ..., tn including censored observations (ordered by increasing observation) of a group of n subjects. The proportion of individuals, S(t), who survive after any follow up time ti is estimated by (Equation 1)

S(t)=ti<tni-dini=ti<t(1-dini)    (1)

where ti is the largest survival time less than or equal to t, ni is the number of individuals uninfected just before time ti (the ith ordered survival time) and di denotes the number who got HIV infection at time ti (29). S(t0) = 1 before the first infection of HIV. The survival S(t) at time ti given the number of infections di and the number of uninfected patients ni just before ti is given by (Equation 2),

S(ti)=ni-dini×S(ti-1).    (2)

Maximum likelihood estimation of the discrete hazard function hi, (the probability of an individual experiencing an event at time ti), yields the Kaplan–Meier estimator as shown (Equation 3),

S^(t)=i:tit(1h^i)=i:tit(1dini).    (3)

Moreover, The Kaplan–Meier estimator is a statistic, and its variance is approximated by numerous estimators such as Greenwoods's formula (30) that gives (Equation 4),

Var^(S^(t))=S^(t)2i:titdini(nidi)    (4)

The log-Rank test: Is used to compare the hazards between two groups or more by testing the null hypothesis that the probability of an event at any time point between the two or more populations do not differ. Thus, log-rank test compares the survival function of the two groups (27). The null hypothesis will be rejected if the p-value is <0.05.

2.2.2 Stepwise Cox proportional hazard model (Cox PH)

Stepwise Cox proportional hazards regression is a method of selecting a subset of relevant variables for a Cox regression model from a larger set (31). Cox PH is the most widely used statistical method for analyzing the time-to-event data (16). The Cox PH model assesses the impact of multiple factors on survival simultaneously. Essentially, it enables one to investigate how specified predictors influence the rate of a specific event happening such as infection or death at a given point in time (32). This rate is commonly known as hazard rate.

In order to evaluate the association of the baseline and cytokine effects covariate and survival time, consider sample size n from sample k = 1, 2, ..., n and Ck = (Ck1, Ck2, ..., Ckp) is a vector of p covariates (baseline plus cytokine effect covariates) of the different models. The kth patient survival data can be represented by (Tk, θk, Ck), where Tk and θk are the survival time and censor status, respectively. Mathematically, the general Cox PH (33) in Equation 5 is represented as

hk(t;Ck)=h0(t)eβCk    (5)

where β is the parameter vector of the regression coefficients and Ck is the covariate (baseline and cytokine effects) vector. h0(t) is an unspecified baseline hazard function that corresponds to the value of the hazard if all Ck are equal to zero. The hazard ratio for two patients (Equation 6), k and i is

hk(t;Ck)hi(t;Ci)=eβCkeβCi    (6)

and is independent of time t. Cox PH model parameters are estimated by the maximum partial likelihood method given below (Equation 7);

L(β)=rEeβTCr         iRreβTCr    (7)

where E is the indices of the HIV infection and Rr represent vector of indices for individuals at risk at time tr.

The stepwise Cox proportional hazards regression method adds or removes predictor variables from the model based on some criteria, such as the Akaike information criterion (AIC) or the Bayesian information criterion (BIC) (34). The AIC and BIC are measures of how well the model fits the data, and they penalize models that have too many parameters. The lower the AIC or BIC, the better the model (35). The stepwise Cox PH regression method can be performed using different methods, such as forward selection, backward elimination, bidirectional selection, or score selection (36). Forward selection starts with an empty model and adds one variable at a time until it reaches a stopping criterion, such as a minimum AIC or BIC value. Backward elimination starts with a full model and removes one variable at a time until it reaches a stopping criterion. Bidirectional selection starts with an empty model and adds one variable at a time in both directions until it reaches a stopping criterion. Score selection starts with an empty model and adds one variable at a time based on its score in terms of AIC or BIC.

We used the packages StepReg (37) to implement stepwise regression, Survival and survminer to implement Cox PH model in R. The specific Cox PH models for model 1, model 2, model 3, and model 4, as described above, are formulated as follows:

2.2.3 Model 1

The model that includes the baseline covariates only. We call this the naive Cox PH model. The assumption is that regression parameters remain constant over time (38). Consequently, the hazard ratio for any two individuals remains constant over time. The Model is given by,

hk(t;Xk)=h0(t)×exp(βXk)    (8)

with h0(t) as the baseline hazard function, β is the vector of regression coefficients for baseline covariates Xk.

2.2.4 Model 2

The model that includes the baseline covariates plus cytokine effects using the mean value of the cytokine measurements as the covariate. Here the derived cytokine is the average of all the cytokine measurements for an individual patient recorded at different follow up time. It models the average effect of the time-varying cytokine covariate (15). The Cox model is;

hk(t;Xk,Gk)=h0(t)×exp(βXk+δG¯k)=h0(t)×exp                                     (βXk+j=1vδjG¯kj)    (9)

where h0(t) is the baseline hazard function, β is the regression coefficient vector for time invariant covariates Xk, kj=1mkjr=1mkjGkrj for r = 1, 2, ..., mkj, represents the average value of the cytokine level measured longitudinally for the kth subject with mkj observations for cytokine j (j = 1, 2, ..., v). The scalar δj∈ℝ is the parameter that links the average to the hazard.

2.2.5 Model 3

The model that includes the baseline covariates plus cytokine effects using the difference between the last observed cytokine value and the first value as the covariate in the model. It models the effect of change in the cytokine covariates (39).

hk(t;Xk,Dk)=h0(t)×exp(βXk+δDk)=h0(t)×exp                                      (βXk+j=1vδjDkj)    (10)

where h0(t) is the baseline hazard function, β is the regression coefficient vector for the time invariant covariates Xk, Dkj = [GkmkjGk1j] represents the difference between the last and first cytokine levels observed longitudinally for the kth subject, with mkj measurements for cytokine j (j = 1, 2, ..., v). The scalar δj∈ℝ is the parameter that links the change to the hazard. The model answers the question whether a big or small change in cytokine has an effect on HIV acquisition.

2.2.6 Model 4

The model that includes the baseline covariates plus time dependent cytokine effects. When a covariate changes over time throughout the follow-up period, this is referred to as time-varying/time dependent covariate (40). For this it is critical to structure the data in a counting process format. We code the time-dependent covariate using time intervals (41). The hazard is assumed to be proportional to the instantaneous probability of an event at a specific time conditional on the variables at that time (42). The interpretation of the results of this approach is more complicated than a naïve baseline approach as the covariate information changes over time. Here we consider sample size n subjects, consisting of [Tk, θk, [Gk(t), 0 ≤ tTk], k = 1, 2, ..., n], Tk is the time-to event for the kth subject, θk is the event indicator and Gk(t) is the time varying covariate. The Cox PH model becomes,

h(t;Xk,Gk)=h0(t)×exp(βXk+δGk(t))=h0(t)×exp                                    (βXk+  j=1vδjGkj(t))    (11)

where h0(t) is the baseline hazard function, β is the regression coefficient vector for time invariant covariates, Gkj(t) = [Gk1j(t), Gk2j(t), ..., Gkmkj(t)] is a set of covariates for the number of longitudinal measures mkj for the kth subject of cytokine j (j = 1, 2, ..., v). The scalar δj∈ℝ represent the parameter that links the time dependent covariates to the hazard.

3 Results

3.1 Kaplan–Meier survival curves analysis

Figure 3 shows the overall survival curve over time and the overall survival comparison between the two treatment group. It is clear that patients from Tenofovir treatment arm have a better chance of surviving (less probability of HIV acquisition) more than the patients from placebo group. The placebo curve has a steeper slope indicating a higher HIV infection rate, therefore a worse survival prognosis. The curve have plateaus from 24th month indicating no change in survival. The curves comparing the two treatment cross in the first few months and consistently separate afterwards. The log-rank test performed gives χ2 = 5.7, df = 1 and p-value = 0.02. Since p-value is <0.05 we reject the null hypothesis to conclude that there is sufficient evidence indicating that the two treatment groups are significantly different in terms of survival.

Figure 3
www.frontiersin.org

Figure 3. Kaplan–Meier survival curves: left panel showing overall survival curve for all participants and right panel compares overall survival curves by treatment groups, placebo vs. tenofovir participants.

3.2 Stepwise Cox proportional hazard analysis

The results of the survival problem based on the effects of cytokine biomarkers (mean, difference, and time dependent effect) were obtained. As a first step, we employed the stepwise regression using the stepwiseCox function. Within the function we specified the following arguments; model selection procedure to be bidirectional, model selection metric as the AIC, significance level of entry and exit value in the model as 0.15 and model approximation method as Efron. Bidirectional selection procedure is the appropriate since it adds variables in both directions. Moreover, backward selection produces same results as bidirectional while forward selection produces results with more covariates and larger AIC. The model selection criterion AIC was used to determine the order in which effects enter and leave at each step of the specified model selection procedure (bidirection). The value 0.15 is a commonly used p value threshold which is a statistical significance level that a predictor variable must meet to be included or to stay in the model. Several approximation methods have been proposed to handle tied events in cox regression such as Breslow, Efron, and Exact (methods of obtaining Cox partial likelihood estimate of the baseline hazard function). However, the Efron method performs better in terms of time, fit statistics, and differences in parameters estimates (43). We then tested the Cox PH assumption of the selected covariates using Schoenfeld residuals test (44) by applying the cox.zph function. The analysis of the results for model 1-4 are shown in Tables 25.

Table 2
www.frontiersin.org

Table 2. Multivariable Cox PH results for predictors of HIV survival among women aged 18–40 years (model 1).

Table 3
www.frontiersin.org

Table 3. Multivariable Cox PH results for predictors of HIV survival among women aged 18–40 years (model 2).

Table 4
www.frontiersin.org

Table 4. Multivariable Cox PH results for predictors of HIV survival among women aged 18–40 years (model 3).

Table 5
www.frontiersin.org

Table 5. Multivariable Cox PH results for predictors of HIV survival among women aged 18–40 years (model 4).

The analysis result of model 1 in Table 2 indicates that age at enrolment was the only significant predictor of HIV hazard. Tenofovir treatment group reduced the hazard of HIV infection as compared to the Placebo treatment group (HR: 0.629, 95% CI: 0.405,0.977). The adjusted hazard ratio for a 1 year increase in age at enrolment is 0.949 (95% CI: 0.902, 0.998). This implies that HIV incidence decreases with increasing age.

Model 2 results in Table 3 shows that tenofovir treatment group reduced the hazard of HIV infection as compared to the placebo treatment group (HR: 0.486, 95% CI: 0.296, 0.798). For every average unit increase of the cytokines IL-12P70, IL-16, B-NGF, SCGF-B, IL-17A and IL-3 there is a decrease of HIV hazard by 2.32% (HR: 0.977, 95% CI: 0.968, 0.986), 0.44% (HR: 0.996, 95% CI: 0.999, 0.999), 19.3% (HR: 0.807, 95% CI: 0.670, 0.973), 0.07% (HR: 0.995, 95% CI: 0.993, 0.999), 2.14% (HR: 0.979. 95% CI: 0.965, 0.992) and 1.1% (HR: 0.989, 95% CI: 0.982, 0.996) respectively. On the other hand for every average unit increase of the cytokines SCF, TNF-A, CTACK, IL-10, IL-5 and IFN-A2 there is an increase of HIV hazard by 11.31% (HR: 1.113, 95% CI: 1.077, 1.151), 1.77% (HR: 1.018, 95% CI: 1.009, 1.027), 3.94% (HR: 1.039, 95% CI: 1.016, 1.063), 4.9% (HR: 1.049, 95% CI: 1.013, 1.086), 10.6% (HR: 1.106, 95% CI: 1.004, 1.220), and 2.75% (HR: 1.028, 95% CI: 1.006, 1.050) respectively.

After including the mean value of the cytokine measurements as covariate, the Cox model showed that age of the oldest sex partner, salary, years with stable partner and sex partner have other partner variables were significant baseline predictors associated with HIV infection. For every year increase for the age of the oldest sex partner, the hazard of HIV decreases by 5.47% (HR: 0.945, 95% CI: 0.898, 0.995). Patients who earned salary had a higher risk of HIV infection (HR: 2.474, 95% CI: 1.227, 4.987) compared to their counterparts who did not earn salary. It was surprising to note that, for every one additional stable partner there was about a 1.5 fold increase in hazard of HIV infection (HR: 1.480, 95% CI: 1.023, 2.138). Moreover, the patients whom did not know if their sex partners had other sex partners had a higher HIV hazard (HR: 2.948, 95% CI: 1.241, 7.001) than those who knew their sex partners did not have other sex partners. Testing the PH assumption using the Scaled Schoenfeld test for the significant variables indicated that CTACK (χ2: 4.710, df: 1, p: 0.03) did not meet the Cox PH assumptions.

The results of model 3 shown in Table 4 depicts that Tenofovir treatment group reduced the hazard of HIV infection as compared to the placebo treatment group (HR: 0.652, 95% CI: 0.238, 1.039). For every unit change (difference) of the cytokines B-NGF, IL-5, IL-16 and TRAIL there is a decrease of HIV infection by 10.38% (HR: 0.896, 95% CI: 0.851, 0.943), 7.7% (HR: 0.923, 95% CI: 0.882, 0.966), 0.08% (HR: 0.999, 95% CI: 0.998, 0.999) and 0.3% (HR: 0.997, 95% CI: 0.995, 0.999) respectively while for the same change in the cytokine CTACK, IL-2 and PDGF-BB there is an increase of HIV infection by 2.23% (HR: 1.022, 95% CI: 1.013, 1.032), 10.59% (HR: 1.106, 95% CI: 1.037, 1.179) and 0.49% (HR: 1.005, 95% CI: 1.002, 1.008) respectively. After including the difference value (between last observed and first value) of the cytokine measurements as covariate, the Cox model showed that sex partner have other partner, husband's income and age of the oldest sex partner covariate were significant baseline predictors associated with HIV infection. For every year increase of age for the oldest sex partner, HIV risk decreases by 5.47% (HR: 0.945, 95% CI: 0.898, 0.995). Both patients who did not know if their partners had other sex partners (HR: 2.948, 95% CI: 1.241, 7.001) and those who knew their sex partners had other partner (HR = 3.991, 95% CI: 1.535, 10.373) had a higher HIV hazard compared to those who knew their partners had no other sex partners. Additionally, the ones who received income from husband (HR: 1.901, 95% CI: 1.023, 3.534) had a higher hazard of HIV than those who did not receive any income from their husband. Testing the PH assumption using the Scaled Schoenfeld test indicated that all the significant variables from the model met the PH Cox assumptions.

Table 5 presents the analysis results of model 4 which indicates that tenofovir treatment group reduced the hazard of HIV infection as compared to the placebo treatment group (HR: 0.652, 95% CI: 0.454, 0.938). The cytokines IL-15, SCGF-B and GM-CSF had an instantaneous decrease of HIV incidence by 9.11% (HR: 0.909, 95% CI: 0.851, 0.971), 0.07% (HR: 0.994, 95% CI: 0.992, 0.999), and 0.68% (HR: 0.993, 95% CI: 0.988, 0.998) respectively at a particular time t. Conversely, SCF had an instantaneous increase of HIV incidence by 4.02% (HR: 1.040, 95% CI: 1.022, 1.058) at a particular time t. When using the cytokines as time dependent covariate, the Cox PH analysis indicated that significant baseline predictors were; age of oldest sex partner, other source of income, age at debut, sex in the last 30 days, and years lived in Durban. For every year increase of the age of the oldest sex partner and patient's age at debut, the hazard of HIV infection increases by 7.99% (HR: 0.920, 95% CI: 0.862, 0.982) and 17.63% (HR: 0.824, 95% CI: 0.704, 0.963) respectively. The less sex the patient had in the previous 30 days, the lower the patient's HIV risk by 7.04% (HR: 0.930, 95% CI: 0.868, 0.995). Furthermore, for every extra year the patient spends in Durban, the chance of HIV infection rises by 3.94% (HR: 1.039, 95% CI: 1.001,1.080). Likewise individuals with other sources of income had an increased risk of HIV infection by 180.68% (HR: 2.807, 95% CI: 1.095, 7.197) compared to those without. Upon testing the PH assumption on significant variables using scaled Schoenfeld residual test, other sources of income (χ2: 5.288, df: 1, p: 0.022) and age of oldest sex partner (χ2: 5.426, df: 1, p: 0.020) violated the PH assumption.

The overall performance of the models (model 1–4) shown in Table 6 indicate that model 4 had the lowest AIC, while model 1 the highest AIC. The overall survival of the models over time are depicted in Figure 4.

Table 6
www.frontiersin.org

Table 6. Comparative tests to evaluate Cox PH model performances.

Figure 4
www.frontiersin.org

Figure 4. Comparative overall survival curves for model 1 (upper left panel), model 2 (upper right panel), model 3 (lower left panel) and model 4 (lower right panel).

The plot in Figure 5 show how the effects of the covariates in model 4 (with the lowest model fit scores as shown in Table 6) change over time. The intercept of the model 4 in Figure 5 had a smooth increasing slope over time. The time dependent cytokine covariates; G-CSF, GM-CSF, IL-15, MIP-1B, SCGF-B, and baseline covariates; age at debut, sex in the last 30 days, age of oldest sex partner and site had a decreasing slope over time. Increasing slope over time is observed in the time-dependent covariate SCF and baseline covariates; abnormal discharge, other income source, salary and years lived in Durban. Table 7 indicate which cytokines overlap between model 2–4, or which are no longer significant in the subsequent models. Figure 6 illustrates the direction of change for the significant cytokines identified in the analysis of models 2–4.

Figure 5
www.frontiersin.org

Figure 5. Overall trend of the covariate effects (only significant baseline and cytokine covariates) for model 4 over time. Abreviations of the baseline characteristics: agedebu, age at debut; p17v15_SEX_30DAYS, number of times had sex in 30 days; p17v26_AGE_OLDEST_SEX_PART, age of oldest sex partner; p19v14_ABNORMAL_DISCHARGEYes, abnormal discharge variable for yes category; p3v13_OTHER_INCOME_SOURCEYes, other income sources for category yes; p3v9_SALARYYes, salary variable for category yes; p5v9_LENGTH_IN_DBNVUL, number of years lived in Durban Vulindlela area; SiteVulindlela, site variable for category Vulindlela.

Table 7
www.frontiersin.org

Table 7. Significant predictors of HIV survival among women aged 18–40 years for model 2–4.

Figure 6
www.frontiersin.org

Figure 6. Direction of change for the cytokines of interest confirmed by our study results.

4 Discussion

The global HIV pandemic remains a significant public health challenge, necessitating the continuous exploration of innovative preventive strategies (45). Pre-exposure prophylaxis (PrEP) particularly Antiretroviral Microbicide has emerged as a promising intervention for individuals at high risk of HIV acquisition (4). However, variations in individual immune responses, particularly in cytokine profiles, may influence the efficacy of PrEP. It is known that dynamic changes in immune states are linked with HIV acquisition, and biomarkers, demographic and behavioral data add complementary details to HIV risk (21). Recent research has highlighted the potential of cytokines as biomarkers in the Pre-exposure prophylaxis. Cytokines have been suggested as potential predictors of HIV acquisition.

This study investigated the effect of individual cytokine biomarkers that changes over time in determining HIV incidence among individuals randomized to PrEP vs. control exposure by building a series of Cox Proportional Hazard models. The Cox PH is essentially a regression model commonly used statistical method in medical research and in other applications for investigating the association between the survival time and one or more predictor variables (16). The simple form of Cox model is when it models time fixed covariates. One of the strengths of the extended Cox model is its ability to incorporate covariates that change over time. This functionality is practical because, at each event time, the Cox model compares the current covariate values of the subject experiencing the event with the current values of all other subjects who were at risk at that time (41). We incorporate stepwise regression in the Cox PH model to eliminate noisy variables and remain with the best model fit (31).

The cytokine biomarkers in our data set changes over time i.e they were longitudinally measured, indicating the presence of a time-varying covariates. When such covariates exist, an analyst should consider taking them into account in survival modeling in order to improve estimation (15). The presence of time-dependent covariates in a model offers exciting opportunities for exploring associations and potentially causal mechanism (46). However, the use of these variables is technically difficult in the choice of covariate form, might have great potential for bias and violates the assumption that the hazard ratio for any two individual remains constant over time. We therefore, improve the model fit by using derived cytokine variables from the longitudinal measurements. As a starting point in modeling, we started with Model 1 (Equation 8), a traditional time-invariant (baseline covariates) Cox PH model. In this model the initial variables were 24 which were further reduced to seven variables that contributed to the best model fit and it estimated age at enrollment as the only significant predictors of HIV risk.

The first improved model (model 2-Equation 9) we used baseline covariates plus the individual level average of the cytokine measurements to better describe the average effect of the time-varying cytokine covariate. Through stepwise regression the covariates were reduced from 72 to 30 in the final best fit model. When comparing model 1 with model 2 we are able to identify four other different baseline covariate (Age of oldest sex partner, salary, years with stable partner and sex partner having other sex partner) and twelve individual average cytokines (IL-3, IL-5, IL-10, IL-16, IL-17A, IL-12P70, CTACK, SCF, B-NGF, SCGF-B, TNF-A, IFN-A2) that are significantly associated with HIV risk. Therefore, the predictive performance of model 2 was better than model 1 with lower AIC (919.3) in comparison to model 1 AIC (1,064.5). This clearly showed that not accounting for cytokine effect in model 1 confounded the effect of other significant baseline characteristics.

Model 3 (Equation 10) is the second improved Cox PH model which consisted of baseline covariates plus individual cytokine difference between the first and the last observed measurement. The final best model fit in model 3 had 19 covariates from an initial total of 72. Notably the model had a better predictive performance compared to model 1 as it had three additional baseline covariates (sex partner having other partners, husband's income, age of the oldest sex partner) and seven individual difference cytokines (IL-2, IL-5, IL-16, CTACK, PDGF-BB, BNG-F and TRAIL) that were significant predictors of HIV infection. Furthermore, when compared to model 2, there were three similar baseline covariates (age of the oldest sex partner, treatment group and sex partner having other partner) that were significant predictors in both models. However, there were fewer cytokine covariates than in model 2, with IL-5, IL-16, CTACK, and BNG-F all being significant cytokine covariates in both models. When compairing the AIC of the models, model 3 had a lower AIC than model 1 but slightly higher than AIC of model 2. Model 3 predicted the individual level changes of the cytokines and its association with HIV risk therefore accounting for time. The major drawback of the model was some individuals had single measurements hence no change effect observed. Additionally, the model ignores the intermediate changes between the first and the last observed cytokine measurement which implies loss of information within individual cytokine measurements.

The last improved Cox PH model fit was model 4 (Equation 11) that used baseline covariates plus time-dependent cytokine covariates. The final best model fit consisted of 14 variables out of 72. When compared to model 1, there were five additional baseline covariates (age of oldest sex partner, age at debut, other income source, sex in the last 30 days and years lived in Durban) and four time-dependent cytokines (SCF, IL-5, SCGF-B and GM-CSF). Moreover, Age of oldest sex partner and IL-5 were significant predictors estimated by all the improved models while SCF and SCGF-B were both predictors by model 2 and 4. Likewise CTACK, IL-5, IL-16 and B-NGF were significant predictors estimated in both model 3 and 4. Table 7 indicate which cytokines overlap between models 2–4, or which are no longer significant in the subsequent models.

Overall, model 2 produced the greatest number of significant cytokine predictor variables, giving a wider perspective to a researcher which cytokine biomarkers are associated with HIV Hazard. However, there is loss of time information in this model for the derived cytokine variables. Model 4 on the other hand had the lowest AIC compared to the other models making it the best model. This emphasizes that time-dependent covariates is a powerful tool for exploring predictive relationships. Nevertheless, their use and interpretation is much more complicated in practice than the fixed (baseline) covariates. Furthermore the potential for erroneous inference and modeling is increased (46).

Our findings reveal that incorporating cytokine biomarkers into the PH regression model not only enhances the model's predictive performance but also provides more insightful information about significant predictors linked to HIV incidence. These results are consistent with a recent study by Ignacio et al. (21), which found that changes in cytokine levels over time are highly predictive of HIV acquisition and that cytokines influence the effects of sociobehavioral risk factors on HIV acquisition. Although Ignacio et al. (21) used a different model (LASSO machine learning algorithms), a different dataset (Sabes study), and selected fewer biomarkers (10 cytokines), their study also highlighted the importance of immune activation markers in predicting HIV beyond traditional demographic and behavioral factors, aligning with our objective. Our analysis identified and reported several baseline predictors such as the age of the oldest sex partner, participant's age at enrollment, earning a salary or not, years with a stable partner, income source, whether the sex partner has other partners, and frequency of sex in the last 30 days as significantly associated with HIV incidence. These findings align with those of other research studies (4755).

In the previous analysis by Masson et al. (18) to investigate whether genital inflammation influenced HIV acquisition in women, they used 12 cytokines out of 48 available cytokine measurements. This selection was disadvantageous as it excluded other potentially relevant cytokine covariates. They utilized conditional logistic regression which has limitations because the risk sets and time-dependent covariates are predefined, unlike in Cox regression, where these factors are calculated at the time of each case failure (56). Moreover, Cox models that was employed in our study, offers more statistical power than logistic regression models because they account for the time until the event occurs (57). Naranbhai et al. using the same dataset, employed logistic regression and PCA to investigate the role of immune activation in HIV acquisition. The PCA's assumption of linearity limits its effectiveness in interpreting the components, as they are linear combinations of the original variables (58). Ngcobo et al. (20), in their study of examining whether pre-infection plasma cytokine concentrations predicted the rate of HIV disease progression in the same study cohort, used linear regression to assess the impact of each cytokine on viral load (VL) and the CD4:CD8 ratio in both bivariate and multivariable models. The major drawback of linear regression is its lack of consideration for time continuity (56). Notably, none of the previous studies exploring predictors of HIV progression (20, 22, 23) using CAPRISA 004 trial considered cytokine biomarkers as time-varying covariates. This study underscores the importance of incorporating longitudinal risk factor information in predicting HIV incidence.

Our study results successfully confirmed the cytokines Interleukin (IL-2, IL-3, IL-5, IL-10, IL-16, IL-12P70, and IL-17 alpha), Stem cell factor (SCF), Beta Nerve growth factor (B-NGF), Tumor necrosis factor alpha (TNF-A), interferon (IFN) alpha-2, serum stem cell growth factor (SCG)- beta, platelet-derived growth factor (PDGF)-BB, Granulocyte macrophage colony stimulating factor (GM-CSF), tumor necrosis factor-related apoptosis-inducing ligand (TRAIL) and cutaneous T-cell-attracting chemokine (CTACK) are associated directly to HIV infection and identified new cytokine biomarkers to enrich the field's literature further. Figure 6 shows the direction of change for the cytokines mentioned. Therefore, better understanding of the role of cytokines before, during, and after HIV infection could enable for the development of new therapeutic approaches based on the use of either recombinant cytokines or particular antagonists, with the goal of limiting both HIV spread and clinical manifestations of this infection (59).

Different cytokines play significant roles in HIV prevention and management with PrEP. Interleukins (ILs) such as IL-2 enhance T-cell proliferation and activation, aiding the immune response against HIV, and its levels can help assess immune activation efficacy in PrEP users. IL-3 and IL-5 regulate hematopoiesis and immune responses, with elevated levels indicating an ongoing immune response relevant for those exposed to HIV (60). IL-10, an anti-inflammatory cytokine, prevents excessive inflammation, with high levels suggesting reduced inflammation in PrEP patients. IL-12P70 and IL-17 alpha aid in differentiating and activating T-helper cells, promoting cell-mediated immunity, and protecting mucosal barriers, respectively (60). Monitoring these cytokines helps understand immune modulation in PrEP users. IL-16 attracts T-cells and other immune cells to infection or inflammation sites, a marker for immune activation in PrEP.

SCF and SCG-beta are essential for hematopoietic stem cell proliferation and differentiation, indicating bone marrow activity and the ability to replenish immune cells in PrEP users (61). B-NGF supports neuron survival and maintenance and has immunomodulatory effects. In PrEP users, B-NGF might influence neuroimmune interactions, affecting the nervous system's response to HIV exposure. TNF-A and TRAIL are significant in immune regulation and inflammation (10). TNF-A, a pro-inflammatory cytokine, indicates inflammation and immune activation, which is crucial for those at risk of HIV. TRAIL induces apoptosis in cancer and infected cells, helping eliminate HIV-infected cells in PrEP users. IFN Alpha-2 has antiviral properties, inhibiting HIV replication and modulating the immune response, providing additional protection in PrEP users. PDGF-BB aids in wound healing and tissue repair, helping maintain mucosal integrity and prevent HIV entry through mucosal surfaces in PrEP users. GM-CSF stimulates granulocyte and macrophage production, providing insights into immune readiness (62). CTACK directs T-cells to the skin, indicating immune surveillance at mucosal and skin surfaces to prevent initial HIV infection.

In clinical practice, these cytokines are useful biomarkers to monitor individuals' immune status and response using PrEP. Regularly measuring cytokines like IL-2, IL-10, TNF-A, IFN Alpha-2, IL-12P70, IL-17 alpha, and TRAIL can help assess immune activation, regulation, and the body's response to HIV exposure (60). This monitoring allows clinicians to evaluate the balance between immune activation and regulation, ensuring optimal immune response without excessive inflammation (11). Personalized PrEP strategies can be developed based on individual cytokine profiles, optimizing dosages and combinations of PrEP medications to enhance protection. Additionally, certain cytokines can indicate adverse immune reactions or inflammation, enabling timely interventions to manage side effects. Integrating cytokine monitoring into PrEP care enhances HIV prevention strategies, tailored interventions to individual needs, and improves clinical outcomes.

5 Conclusion

In this article we investigated the effect of individual cytokine biomarker, a time varying covariate, where we provided ways of handling the covariate in the stepwise Cox PH modeling by using a derived variable from the longitudinal measurements (mean and difference) and as a time dependent covariate (model 2–4). The presence of a cytokine effect in a model improved the predictive performance of the model hence the improved models were more informative about predictors that are associated with HIV hazard. Moreover, the tenofovir treatment exposure significantly lowered the hazard of HIV compared to the Placebo treatment group. Furthermore, Kaplan–Meier estimator indicated that the patients who received tenofovir antiretroviral microbicide treatment had a significantly lower risk of HIV infection compared to the placebo group hence an effective treatment in reducing the risk of HIV in women between the age of 18–40 years.

Further investigation of the cytokine biomarker could involve utilizing the standard deviation of longitudinal measurements or lagged observations. Additionally, with internal time-varying covariates, one might explore employing joint modeling of longitudinal and survival data. The aim is to apply a model to a continually changing covariate that is measured longitudinally, potentially with error. This longitudinal model is linked to survival times by modeling the joint distribution of longitudinal and survival data.

Data availability statement

The data analyzed in this study is subject to the following licenses/restrictions. Researchers wanting to access data from the completed CAPRISA studies are requested to complete a data request form. Requests to access these datasets should be directed to https://www.caprisa.org/Pages/CAPRISAStudies.

Author contributions

SO: Writing – review & editing, Writing – original draft, Visualization, Validation, Software, Resources, Project administration, Methodology, Investigation, Formal analysis, Data curation, Conceptualization. MM: Writing – review & editing, Visualization, Validation, Supervision, Software, Formal analysis. HM: Writing – review & editing, Validation, Supervision, Methodology, Conceptualization.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research was funded in whole or in part by Science for Africa Foundation to the Sub-Saharan Africa Consortium for Advanced Biostatistics (SSACAB II) programme [Grant Number DEL-22-009] with support from Wellcome Trust and the UK Foreign, Commonwealth & Development Office and is part of the EDCPT2 programme supported by the European Union. For purposes of open access, the author has applied a CC BY public copyright license to any Author Accepted Manuscript version arising from this submission.

Acknowledgments

The authors extend their sincere appreciation to CAPRISA for graciously granting permission to access and utilize the dataset in our research.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2024.1393627/full#supplementary-material

References

1. People of South Africa Statistics South Africa. (2022). Available online at: https://www.gov.za/about-sa/people-south-africa-0 (accessed February 28, 2024).

Google Scholar

2. Simbayi L, Zuma K, Zungu N, Moyo S, Marinda E, Jooste S, et al. South African national HIV Prevalence, Incidence, Behaviour and Communication Survey, 2017: Towards Achieving the UNAIDS 90-90-90 Targets. Cape Town: HSRC Press (2019).

PubMed Abstract | Google Scholar

3. Stover J, Glaubius R, Kassanjee R, Dugdale CM. Updates to the spectrum/AIM model for the UNAIDS 2020 HIV estimates. J Int AIDS Soc. (2021) 24:e25778. doi: 10.1002/jia2.25778

PubMed Abstract | Crossref Full Text | Google Scholar

4. Abdool Karim Q, Abdool Karim SS, Frohlich JA, Grobler AC, Baxter C, Mansoor LE, et al. Effectiveness and safety of tenofovir gel, an antiretroviral microbicide, for the prevention of HIV infection in women. Science. (2010) 329:1168–74. doi: 10.1126/science.1193748

PubMed Abstract | Crossref Full Text | Google Scholar

5. Brenchley JM, Schacker TW, Ruff LE, Price DA, Taylor JH, Beilman GJ, et al. CD4+ T cell depletion during all stages of HIV disease occurs predominantly in the gastrointestinal tract. J Exp Med. (2004) 200:749–59. doi: 10.1084/jem.20040874

PubMed Abstract | Crossref Full Text | Google Scholar

6. Mehandru S, Poles MA, Tenner-Racz K, Horowitz A, Hurley A, Hogan C, et al. Primary HIV-1 infection is associated with preferential depletion of CD4+ T lymphocytes from effector sites in the gastrointestinal tract. J Exp Med. (2004) 200:761–70. doi: 10.1084/jem.20041196

PubMed Abstract | Crossref Full Text | Google Scholar

7. Roberts L, Passmore JAS, Mlisana K, Williamson C, Little F, Bebell LM, et al. Genital tract inflammation during early HIV-1 infection predicts higher plasma viral load set point in women. J Infect Dis. (2012) 205:194–203. doi: 10.1093/infdis/jir715

PubMed Abstract | Crossref Full Text | Google Scholar

8. Fichorova RN, Tucker LD, Anderson DJ. The molecular basis of nonoxynol-9-induced vaginal inflammation and its possible relevance to human immunodeficiency virus type 1 transmission. J Infect Dis. (2001) 184:418–28. doi: 10.1086/322047

PubMed Abstract | Crossref Full Text | Google Scholar

9. Mlisana K, Naicker N, Werner L, Roberts L, Van Loggerenberg F, Baxter C, et al. Symptomatic vaginal discharge is a poor predictor of sexually transmitted infections and genital tract inflammation in high-risk women in South Africa. J Infect Dis. (2012) 206:6–14. doi: 10.1093/infdis/jis298

PubMed Abstract | Crossref Full Text | Google Scholar

10. Reuter MA, Pombo C, Betts MR. Cytokine production and dysregulation in HIV pathogenesis: lessons for development of therapeutics and vaccines. Cytokine Growth Factor Rev. (2012) 23:181–91. doi: 10.1016/j.cytogfr.2012.05.005

PubMed Abstract | Crossref Full Text | Google Scholar

11. Kedzierska K, Crowe SM. Cytokines and HIV-1: interactions and clinical implications. Antivir Chem Chemother. (2001) 12:133–50. doi: 10.1177/095632020101200301

PubMed Abstract | Crossref Full Text | Google Scholar

12. Breen EC. Pro-and anti-inflammatory cytokines in human immunodeficiency virus infection and acquired immunodeficiency syndrome. Pharmacol Ther. (2002) 95:295–304. doi: 10.1016/S0163-7258(02)00263-2

PubMed Abstract | Crossref Full Text | Google Scholar

13. Stacey AR, Norris PJ, Qin L, Haygreen EA, Taylor E, Heitman J, et al. Induction of a striking systemic cytokine cascade prior to peak viremia in acute human immunodeficiency virus type 1 infection, in contrast to more modest and delayed responses in acute hepatitis B and C virus infections. J Virol. (2009) 83:3719–33. doi: 10.1128/JVI.01844-08

PubMed Abstract | Crossref Full Text | Google Scholar

14. Seder RA, Grabstein KH, Berzofsky JA, McDyer JF. Cytokine interactions in human immunodeficiency virus-infected individuals: roles of interleukin (IL)-2, IL-12, and IL-15. J Exp Med. (1995) 182:1067–77. doi: 10.1084/jem.182.4.1067

PubMed Abstract | Crossref Full Text | Google Scholar

15. Zhang Z, Reinikainen J, Adeleke KA, Pieterse ME, Groothuis-Oudshoorn CG. Time-varying covariates and coefficients in Cox regression models. Ann Transl Med. (2018) 6:121. doi: 10.21037/atm.2018.02.12

PubMed Abstract | Crossref Full Text | Google Scholar

16. Lin DY, Wei LJ. The robust inference for the Cox proportional hazards model. J Am Stat Assoc. (1989) 84:1074–8. doi: 10.1080/01621459.1989.10478874

Crossref Full Text | Google Scholar

17. Mansoor LE, Abdool Karim Q, Yende-Zuma N, MacQueen KM, Baxter C, Madlala BT, et al. Adherence in the CAPRISA 004 tenofovir gel microbicide trial. AIDS Behav. (2014) 18:811–9. doi: 10.1007/s10461-014-0751-x

PubMed Abstract | Crossref Full Text | Google Scholar

18. Masson L, Passmore JAS, Liebenberg LJ, Werner L, Baxter C, Arnold KB, et al. Genital inflammation and the risk of HIV acquisition in women. Clin Infect Dis. (2015) 61:260–9. doi: 10.1093/cid/civ298

PubMed Abstract | Crossref Full Text | Google Scholar

19. Naranbhai V, Abdool Karim SS, Altfeld M, Samsunder N, Durgiah R, Sibeko S, et al. Innate immune activation enhances HIV acquisition in women, diminishing the effectiveness of tenofovir microbicide gel. J Infect Dis. (2012) 206:993–1001. doi: 10.1093/infdis/jis465

PubMed Abstract | Crossref Full Text | Google Scholar

20. Ngcobo S, Molatlhegi RP, Osman F, Ngcapu S, Samsunder N, Garrett NJ, et al. Pre-infection plasma cytokines and chemokines as predictors of HIV disease progression. Sci Rep. (2022) 12:2437. doi: 10.1038/s41598-022-06532-w

PubMed Abstract | Crossref Full Text | Google Scholar

21. Ignacio RAB, Dasgupta S, Valdez R, Pandey U, Pasalar S, Alfaro R, et al. Dynamic immune markers predict HIV acquisition and augment associations with sociobehavioral factors for HIV exposure. iScience. (2022) 25:105632. doi: 10.1016/j.isci.2022.105632

PubMed Abstract | Crossref Full Text | Google Scholar

22. Redd AD, Mullis CE, Wendel SK, Sheward D, Martens C, Bruno D, et al. Limited HIV-1 superinfection in seroconverters from the CAPRISA 004 microbicide trial. J Clin Microbiol. (2014) 52:844–8. doi: 10.1128/JCM.03143-13

PubMed Abstract | Crossref Full Text | Google Scholar

23. Garrett NJ, Werner L, Naicker N, Naranbhai V, Sibeko S, Samsunder N, et al. HIV disease progression in seroconvertors from the CAPRISA 004 tenofovir gel pre-exposure prophylaxis trial. J Acquir Immune Defic Syndr. (2015) 68:55–61. doi: 10.1097/QAI.0000000000000367

PubMed Abstract | Crossref Full Text | Google Scholar

24. Simonoff JS. Analyzing Categorical Data, Vol 496. New York, NY: Springer (2003). doi: 10.1007/978-0-387-21727-7

Crossref Full Text | Google Scholar

25. Agresti A. Categorical Data Analysis, Vol. 792. Hoboken, NJ: John Wiley & Sons (2012).

Google Scholar

26. Bland JM, Altman DG. Survival probabilities (the Kaplan-Meier method). BMJ. (1998) 317:1572–80. doi: 10.1136/bmj.317.7172.1572

PubMed Abstract | Crossref Full Text | Google Scholar

27. Etikan I, Abubakar S, Alkassim R. The Kaplan-Meier estimate in survival analysis. Biom Biostatistics Int J. (2017) 5:00128. doi: 10.15406/bbij.2017.05.00128

Crossref Full Text | Google Scholar

28. Altman DG. Practical Statistics for Medical Research. Boca Raton, FL: CRC Press (1990). doi: 10.1201/9780429258589

Crossref Full Text | Google Scholar

29. Goel MK, Khanna P, Kishore J. Understanding survival analysis: Kaplan-Meier estimate. Int J Ayurveda Res. (2010) 1:274. doi: 10.4103/0974-7788.76794

PubMed Abstract | Crossref Full Text | Google Scholar

30. Sawyer S. The greenwood and exponential greenwood confidence intervals in survival analysis. Applied survival analysis: regression modeling of time to event data. Department of Mathematics, Washington University in St. Louis (2003), p. 1–14. Available online at: https://www.math.wustl.edu/~sawyer/handouts/greenwood.pdf

Google Scholar

31. Ruengvirayudh P, Brooks GP. Comparing stepwise regression models to the best-subsets models, or, the art of stepwise. Gen Linear Model J. (2016) 42:1–14.

Google Scholar

32. Hogg RS, Heath KV, Yip B, Craib KJ. O'shaughnessy MV, Schechter MT, et al. Improved survival among HIV-infected individuals following initiation of antiretroviral therapy. JAMA. (1998) 279:450–4. doi: 10.1001/jama.279.6.450

PubMed Abstract | Crossref Full Text | Google Scholar

33. Bradburn MJ, Clark TG, Love SB, Altman DG. Survival analysis part II: multivariate data analysis-an introduction to concepts and methods. Br J Cancer. (2003) 89:431–6. doi: 10.1038/sj.bjc.6601119

PubMed Abstract | Crossref Full Text | Google Scholar

34. Asano J, Hirakawa A, Hamada C. A stepwise variable selection for a Cox proportional hazards cure model with application to breast cancer data. Jpn J Biom. (2013) 34:21–34. doi: 10.5691/jjb.34.21

Crossref Full Text | Google Scholar

35. Shi P, Tsai CL. Regression model selection–a residual likelihood approach. J R Stat Soc B: Stat Methodol. (2002) 64:237–52. doi: 10.1111/1467-9868.00335

Crossref Full Text | Google Scholar

36. Ekman A. Variable selection for the Cox proportional hazards model?: A simulation study comparing the stepwise, lasso and bootstrap approach [Internet] (Dissertation) (2017). Available online at: https://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-130521

Google Scholar

37. Hu F. Stepwise variable selection procedures for regression analysis. CRAN. R package version 01 0. (2018). doi: 10.32614/CRAN.package.My.stepwise

Crossref Full Text | Google Scholar

38. Johnson LL, Shih JH. An introduction to survival analysis. In:Gallin JI, Ognibene FP, , editors. Principles and Practice of Clinical Research. Amsterdam: Elsevier (2007), p. 273–82. doi: 10.1016/B978-012369440-9/50024-4

Crossref Full Text | Google Scholar

39. Reinikainen J, Laatikainen T, Karvanen J, Tolonen H. Lifetime cumulative risk factors predict cardiovascular disease mortality in a 50-year follow-up study in Finland. Int J Epidemiol. (2015) 44:108–16. doi: 10.1093/ije/dyu235

PubMed Abstract | Crossref Full Text | Google Scholar

40. Cox DR, Oakes D. Analysis of Survival Data, Vol. 21. Boca Raton, FL: CRC Press (1984).

Google Scholar

41. Therneau T, Crowson C, Atkinson E. Using time dependent covariates and time dependent coefficients in the cox model. Surv Vignettes. (2017) 2:1–25.

Google Scholar

42. Ngwa JS, Cabral HJ, Cheng DM, Gagnon DR, LaValley MP, Cupples LA. Generating survival times with time-varying covariates using the Lambert W function. Commun Stat-Simula Comput. (2022) 51:135–53. doi: 10.1080/03610918.2019.1648822

PubMed Abstract | Crossref Full Text | Google Scholar

43. Hertz-Picciotto I, Rockhill B. Validity and efficiency of approximation methods for tied survival times in Cox regression. Biometrics. (1997) 53:1151–6. doi: 10.2307/2533573

PubMed Abstract | Crossref Full Text | Google Scholar

44. Abeysekera W, Sooriyarachchi M. Use of Schoenfeld's global test to test the proportional hazards assumption in the Cox proportional hazards model: an application to a clinical study. J Natl Sci Found Sri Lanka. (2009) 37:41–5. doi: 10.4038/jnsfsr.v37i1.456

Crossref Full Text | Google Scholar

45. Eisinger RW, Fauci AS. Ending the HIV/AIDS pandemic. Emerg Infect Dis. (2018) 24:413. doi: 10.3201/eid2403.171797

PubMed Abstract | Crossref Full Text | Google Scholar

46. Fisher LD, Lin DY. Time-dependent covariates in the Cox proportional-hazards regression model. Annu Rev Public Health. (1999) 20:145–57. doi: 10.1146/annurev.publhealth.20.1.145

PubMed Abstract | Crossref Full Text | Google Scholar

47. Kassanjee R, Welte A, Otwombe K, Jaffer M, Milovanovic M, Hlongwane K, et al. HIV incidence estimation among female sex workers in South Africa: a multiple methods analysis of cross-sectional survey data. Lancet HIV. (2022) 9:e781–90. doi: 10.1016/S2352-3018(22)00201-6

PubMed Abstract | Crossref Full Text | Google Scholar

48. Anderegg N, Slabbert M, Buthelezi K, Johnson LF. Increasing age and duration of sex work among female sex workers in South Africa and implications for HIV incidence estimation: Bayesian evidence synthesis and simulation exercise. Infect Dis Modell. (2024) 9:263–77. doi: 10.1016/j.idm.2024.01.006

PubMed Abstract | Crossref Full Text | Google Scholar

49. Wang H, Reilly KH, Brown K, Jin X, Xu J, Ding G, et al. HIV incidence and associated risk factors among female sex workers in a high HIV-prevalence area of China. Sex Transm Dis. (2012) 39:835–41. doi: 10.1097/OLQ.0b013e318266b241

PubMed Abstract | Crossref Full Text | Google Scholar

50. Mavedzenge SN, Weiss HA, Montgomery ET, Blanchard K, de Bruyn G, Ramjee G, et al. Determinants of differential HIV incidence among women in three southern African locations. J Acquir Immune Defic Syndr. (2011) 58:89–99. doi: 10.1097/QAI.0b013e3182254038

PubMed Abstract | Crossref Full Text | Google Scholar

51. Bazzi AR, Rangel G, Martinez G, Ulibarri MD, Syvertsen JL, Bazzi SA, et al. Incidence and predictors of HIV and sexually transmitted infections among female sex workers and their intimate male partners in northern Mexico: a longitudinal, multilevel study. Am J Epidemiol. (2015) 181:723–31. doi: 10.1093/aje/kwu340

PubMed Abstract | Crossref Full Text | Google Scholar

52. Dunkle KL, Jewkes RK, Brown HC, Gray GE, McIntryre JA, Harlow SD. Transactional sex among women in Soweto, South Africa: prevalence, risk factors and association with HIV infection. Soc Sci Med. (2004) 59:1581–92. doi: 10.1016/j.socscimed.2004.02.003

PubMed Abstract | Crossref Full Text | Google Scholar

53. Wand H, Ramjee G. The relationship between age of coital debut and HIV seroprevalence among women in Durban, South Africa: a cohort study. BMJ Open. (2012) 2:e000285. doi: 10.1136/bmjopen-2011-000285

PubMed Abstract | Crossref Full Text | Google Scholar

54. Nel A, Louw C, Hellstrom E, Braunstein SL, Treadwell I, Marais M, et al. HIV prevalence and incidence among sexually active females in two districts of South Africa to determine microbicide trial feasibility. PLoS ONE. (2011) 6:e21528. doi: 10.1371/journal.pone.0021528

PubMed Abstract | Crossref Full Text | Google Scholar

55. Kiyingi J, Nabunya P, Bahar OS, Mayo-Wilson LJ, Tozan Y, Nabayinda J, et al. Prevalence and predictors of HIV and sexually transmitted infections among vulnerable women engaged in sex work: findings from the Kyaterekera Project in Southern Uganda. PLoS ONE. (2022) 17:e0273238. doi: 10.1371/journal.pone.0273238

PubMed Abstract | Crossref Full Text | Google Scholar

56. Essebag V, Platt RW, Abrahamowicz M, Pilote L. Comparison of nested case-control and survival analysis methodologies for analysis of time-dependent exposure. BMC Med Res Methodol. (2005) 5:1–6. doi: 10.1186/1471-2288-5-5

PubMed Abstract | Crossref Full Text | Google Scholar

57. Van Der Net JB, Janssens ACJ, Eijkemans MJ, Kastelein JJ, Sijbrands EJ, Steyerberg EW. Cox proportional hazards models have more statistical power than logistic regression models in cross-sectional genetic association studies. Eur J Hum Genet. (2008) 16:1111–6. doi: 10.1038/ejhg.2008.59

PubMed Abstract | Crossref Full Text | Google Scholar

58. Karamizadeh S, Abdullah SM, Manaf AA, Zamani M, Hooman A. An overview of principal component analysis. J Signal Inform Process. (2020) 4:173–175. doi: 10.4236/jsip.2013.43B031

Crossref Full Text | Google Scholar

59. Catalfamo M, Le Saout C, Lane HC. The role of cytokines in the pathogenesis and treatment of HIV infection. Cytokine Growth Factor Rev. (2012) 23:207–14. doi: 10.1016/j.cytogfr.2012.05.007

PubMed Abstract | Crossref Full Text | Google Scholar

60. Freeman ML, Shive CL, Nguyen TP, Younes SA, Panigrahi S, Lederman MM. Cytokines and T-cell homeostasis in HIV infection. J Infect Dis. (2016) 214(suppl_2):S51–7. doi: 10.1093/infdis/jiw287

PubMed Abstract | Crossref Full Text | Google Scholar

61. Cardoso HJ, Figueira MI, Socorro S. The stem cell factor (SCF)/c-KIT signalling in testis and prostate cancer. J Cell Commun Signal. (2017) 11:297–307. doi: 10.1007/s12079-017-0399-1

PubMed Abstract | Crossref Full Text | Google Scholar

62. Frumkin LR. Role of granulocyte colony-stimulating factor and granulocyte-macrophage colony-stimulating factor in the treatment of patients with HIV infection. Curr Opin Hematol. (1997) 4:200–6. doi: 10.1097/00062752-199704030-00008

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: cytokine biomarkers, HIV incidence, pre-exposure prophylaxis, stepwise Cox PH, Kaplan–Meier

Citation: Ogutu S, Mohammed M and Mwambi H (2024) Investigating the effects of cytokine biomarkers on HIV incidence: a case study for individuals randomized to pre-exposure prophylaxis vs. control. Front. Public Health 12:1393627. doi: 10.3389/fpubh.2024.1393627

Received: 29 February 2024; Accepted: 07 June 2024;
Published: 25 June 2024.

Edited by:

John Shearer Lambert, University College Dublin, Ireland

Reviewed by:

Wei Li Adeline Koay, Medical University of South Carolina, United States
Sayuri Seki, National Institute of Infectious Diseases (NIID), Japan

Copyright © 2024 Ogutu, Mohammed and Mwambi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Sarah Ogutu, b2d1dHVzYXJhaEBnbWFpbC5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.