Predicting Verbal Learning and Memory Assessments of Older Adults Using Bayesian Hierarchical Models

Ebrahim, Endris Assen; Cengiz, Mehmet Ali

doi:10.3389/fpsyg.2022.855379

ORIGINAL RESEARCH article

Front. Psychol., 14 April 2022

Sec. Quantitative Psychology and Measurement

Volume 13 - 2022 | https://doi.org/10.3389/fpsyg.2022.855379

Predicting Verbal Learning and Memory Assessments of Older Adults Using Bayesian Hierarchical Models

Endris Assen Ebrahim^1,2*

Mehmet Ali Cengiz¹

¹Department of Statistics, Faculty of Science and Literature, Institute of Graduate Studies, Ondokuz Mayis University, Samsun, Turkey
²Department of Statistics, College of Natural and Computational Sciences, Debre Tabor University, Gondar, Ethiopia

Verbal learning and memory summaries of older adults have usually been used to describe neuropsychiatric complaints. Bayesian hierarchical models are modern and appropriate approaches for predicting repeated measures data where information exchangeability is considered and a violation of the independence assumption in classical statistics. Such models are complex models for clustered data that account for distributions of hyper-parameters for fixed-term parameters in Bayesian computations. Repeated measures are inherently clustered and typically occur in clinical trials, education, cognitive psychology, and treatment follow-up. The Hopkins Verbal Learning Test (HVLT) is a general verbal knowledge and memory assessment administered repeatedly as part of a neurophysiological experiment to examine an individual’s performance outcomes at different time points. Multiple trial-based scores of verbal learning and memory tests were considered as an outcome measurement. In this article, we attempted to evaluate the predicting effect of individual characteristics in considering within and between-group variations by fitting various Bayesian hierarchical models via the hybrid Hamiltonian Monte Carlo (HMC) under the Bayesian Regression Models using ‘Stan’ (BRMS) package of R. Comparisons of the fitted models were done using leave-one-out information criteria (LOO-CV), Widely applicable information criterion (WAIC), and K-fold cross-validation methods. The full hierarchical model with varying intercepts and slopes had the best predictive performance for verbal learning tests [from the Advanced Cognitive Training for Independent and Vital Elderly (ACTIVE) study dataset] using the hybrid Hamiltonian-Markov Chain Monte Carlo approach.

1. Introduction

Verbal learning and memory tests are highly varied among older-aged adults due to various influences. Early cognitive intervention in older adults is a critical program to reduce the future risk of dementia (Thomas et al., 2019). The efficacy of the Chinese form Hopkins Verbal Learning Test (HVLT) for screening dementia and mild cognitive impairment in a Chinese population showed that HVLT scores were affected by age, education, and sex (Shi et al., 2012). The dataset of Advanced Cognitive Training for Independent and Vital Elderly (ACTIVE) study consists of two hierarchies in which four different repeated measures are nested within each participant (Luo and Wang, 2014). The outcome measures of the cognitive training interventions were the total HVLT from three learning trials and the baseline measure (Gross, 2011).

Bayesian logistic and hierarchical probit models of accuracy data that allow two levels of mixed-effects in repeated-measures designs have been implemented. The Bayes factor through the Bayesian information criterion estimate and the Widely applicable information criterion (WAIC) model selection techniques were used (Song et al., 2017). Duff (2016) used stepwise regression model to scrutinize the effect of age, education, and gender on HVLT scores in 290 cognitively intact older adults. The study revealed that age was negatively correlated with the HVLT score, while education status was positively correlated. Moreover, there were fewer gender differences among four repeatedly measured verbal learning tests (Lekeu et al., 2009).

Another study showed that besides capabilities through training, personal characteristics like age, unmarried status, and lower occupational cognitive requirements increased the likelihood of cognitive risk (Silva et al., 2012). Higher educational levels and active engagement in exercise may contribute to cognitive reserve and have a protective effect on cognitive decline in late life (Shen et al., 2021).

Gender effects on neuropsychological performance were negligible when the age and educational status of elderly people were controlled (Welsh-Bohmer et al., 2009). Recently, the Markov chain Monte Carlo (MCMC) methods have been widely used to generate samples from complicated and high-dimensional distributions (Hadfield, 2017). Among all Bayesian computational methods, the Hamiltonian Monte Carlo (HMC) (Almond, 2014) approach is the most efficient for approximating complex data structure models and converges faster than the traditional Metropolis-Hastings and Gibbs methods (Kruschke and Vanpaemel, 2015). The common MCMC approaches show poor performance and tremendously slow convergence in complex parameter structures (Yao and Stephan, 2021).

The HVLT is the ultimate in situations calling for multiple neuropsychological assessments (Benedict et al., 1998). Classical statistical inferences and single-level models have limitations for predicting naturally nest data. Bayesian hierarchical models (Congdon, 2020) were able to predict verbal learning test and memory scores from baseline personal characteristics, such as age, gender, cognitive status [mini-mental state exam (MMSE) score], years of education, and participants’ booster training and reasoning ability measured by training progress (Kuslansky et al., 2004).

In Bayesian inference, the WAIC, the leave one out information criterion (LOO-IC), and K-fold cross-validation (K-fold-CV) are recently developed measures of complexity penalized fitting models (Almond, 2014; Sivula et al., 2020). In this article, model comparisons and model selections were performed using these three methods under the Bayesian Regression Models using ‘Stan’ (BRMS) package of R (Bürkner, 2018). In most cases, WAIC and LOO-IC showed a slight preference for the random slope model over other models (Bürkner, 2018). However, the general model selection principle shows to choose the null model when diffuse priors are used in the parameters to be included or rejected by the algorithms (Liu, 2000). Therefore, in this article, we used the HMC approach to fit the three different Bayesian hierarchical models and select the best predictive model.

2. Materials and Methods

2.1 Data and Variables

The ACTIVE study was a randomized controlled trial conducted in 1999–2001 at six diverse research centers in the United States and organized by the New England Research Institutes (NERI). A total of 1,575 purposively selected older adults were included in this study (Willis et al., 2015), in which 26% of the participants were African American. The ACTIVE dataset accessed from the study of Willis et al. (2015) has 13 variables. However, this modeling paper used six explanatory variables, and the dependent variable HVLT is used as repeated measures of learning tests and memory ability. In this dataset, HVLT has four different repeated measurement scores doi: 10.3886/ICPSR04248.v3.

2.2 Bayesian Hierarchical Model for Repeated Measures Data

Suppose X is the matrix of explanatory variables, and Y is the outcome variable that is the Total Hopkins Verbal Learning Test Score (THVLTS). Besides the classical statistics, a more flexible Bayesian model is required that can accommodate the varying correlation between covariates and independent variables that occur in repeated measures-type longitudinal data. The general form of the Bayesian hierarchical model for repeated measures data can be expressed as:

Y_{N \times 1} = \underset{f i x e d e f f e c t s}{\underset{⏟}{X_{N \times p} β_{p \times 1}}} + \underset{r a n d o m e f f e c t s}{\underset{⏟}{Z_{N \times m q} U_{m q \times 1}}} + \underset{r e s i d u a l s (e r r o r t e r m)}{\underset{⏟}{ε_{N \times 1}}} (1)

Where Y denotes the vector ${(y_{1}^{'}, y_{2}^{'}, \dots y_{m}^{'})}^{'}$ of outcome variable; β denotes a vector of fixed effects parameters; U denotes a vector ${(u_{1}^{'}, u_{2}^{'}, \dots u_{m}^{'})}^{'}$ of associated random effects (specifictoeachsubject); X is a matrix of covariates (explanatory variables); Z denotes a block diagonal matrix of covariates for the random effects as a complement of X embraced of m blocks that each block has n_i × q dimension matrix and ε denote a column vector of residuals. We assumed that the random effects U∼N(0_d, Ω) and the residuals $ε \sim N (0_{n_{i}}, R = σ_{e}^{2})$ . Where U and ε are independently distributed. Based on the unknown vector of φ_Ω and φ_R, the unknown random effects in Ω and R can be written as Σ = (φΩ, φ_R) (Laird and Ware, 1982).

Y_{i} = X_{i}^{(F)} β^{(F)} + X_{i}^{(R)} β^{(R)} + ε_{i}

(\begin{matrix} Y_{i 1} \\ Y_{i 2} \\ ⋮ \\ Y_{i n_{i}} \end{matrix}) = (\begin{matrix} 1 t_{i 1} \\ 1 t_{i 2} \\ ⋮ ⋮ \\ 1 t_{i n_{i}} \end{matrix}) (\begin{matrix} a_{11} \\ a_{21} \end{matrix}) + (\begin{matrix} 1 t_{i 1} \\ 1 t_{i 2} \\ ⋮ ⋮ \\ 1 t_{i n_{i}} \end{matrix}) (\begin{matrix} u_{1 i} \\ u_{2 i} \end{matrix}) + (\begin{matrix} ε_{i 1} \\ ε_{i 2} \\ ε_{i 3} \\ ⋮ \\ ε_{i n_{i}} \end{matrix})

Where X is divided into two columns corresponding to fixed effects and a corresponding random effects design matrix denoted as $X_{i}^{(F)}$ and $X_{i}^{(R)}$ , respectively. And the parameters are divided into fixed effects β^(F) and random effects β^(R) = U. Cov(u_i,u_i) = Var(u_i) = Ω and

[\begin{matrix} u \\ β \end{matrix}] \sim M V N ((\begin{matrix} μ_{u} \\ μ_{β} \end{matrix}), (\begin{matrix} σ_{u}^{2} & {ρσ}_{u} σ_{β} \\ {ρσ}_{u} σ_{β} & σ_{β}^{2} \end{matrix}))

It can be assumed that the hyperparameters of both the intercept and the coefficient/slope model have uniform hyper-prior distributions with appropriate assumptions for the parameters μ_u, μ_β, σ_u, σ_β ve ρ. Then, the mathematical form of the three possible Bayesian hierarchical models (Nalborczyk and Vilain, 2019) for predicting the verbal learning and memory test with two (group/subject and time) random effects (Hilbe, 2009) can be written as follows:

Model 1: Null Model

Here, the model is fitted by varying the intercept without including any predictor variable. Thus, this model shows the overall within and between-subject variations of the outcome variable (Goldstein et al., 2009).

T H V L T S_{i} \sim N o r m a l (μ_{i}, σ_{e}), i = 1, 2, 3, \dots n

μ_{i} = α + α_{s u b j e c t [i]} + α_{t i m e [i]}

α_{s u b j e c t} \sim N o r m a l (0, σ_{s u b j e c t})

α_{t i m e} \sim N o r m a l (0, σ_{t i m e})

α \sim N o r m a l (0, 10)

σ_{s u b j e c t} \sim H a l f C a u c h y (0, 1)

σ_{t i m e} \sim H a l f C a u c h y (0, 1)

σ_{e} \sim H a l f C a u c h y (0, 1)

Model 2: Varying Intercept Model

Here, the BRMS command is fitted in R with varying intercepts for both clusters (i.e., participating subjects) and repeated measures (i.e., measurement time point) by including all predictor variables in the model. Thus, this model can be called a random intercept and fixed slope model (McGlothlin and Viele, 2018).

T H V L T S_{i} \sim N o r m a l (μ_{i}, σ_{e})

μ_{i} = α + α_{s u b j e c t [i]} + α_{t i m e [i]} + β X_{i}

α_{s u b j e c t} \sim N o r m a l (0, σ_{g r o u p})

α_{t i m e} \sim N o r m a l (0, σ_{t i m e})

β \sim N o r m a l (0, 10)

α \sim N o r m a l (0, 10)

σ_{s u b j e c t} \sim H a l f C a u c h y (0, 1)

σ_{t i m e} \sim H a l f C a u c h y (0, 1)

σ_{e} \sim H a l f C a u c h y (0, 1)

Model 3: Varying Slopes

Here, we can focus on examining the dependence between the random intercepts and the random coefficients (Bafumi and Gelman, 2011). In this case, we are interested in whether the effects of age and reasoning skill have correlations with variations in verbal and memory test skills measured by trail scores.

T H V L T S_{i} \sim N o r m a l (μ_{i}, σ_{e})

μ_{i} = α + α_{s u b j e c t [i]} + α_{t i m e [i]} + β X_{i} + (β + β_{s u b j e c t}) X_{i})

[\begin{matrix} α_{s u b j e c t} \\ β_{s u b j e c t} \end{matrix}] \sim M V N ([\begin{matrix} α \\ β \end{matrix}, S]

S = [(\begin{matrix} σ_{α} & 0 \\ 0 & σ_{β} \end{matrix}) R (\begin{matrix} σ_{α} & 0 \\ 0 & σ_{β} \end{matrix})] = [\begin{matrix} σ_{α, s u b j e c t}^{2} & σ_{α} σ_{β} ρ \\ σ_{α} σ_{β} ρ & σ_{β, s u b j e c t}^{2} \end{matrix}]

α_{s u b j e c t} \sim N o r m a l (0, σ_{s u b j e c t})

α_{t i m e} \sim N o r m a l (0, σ_{t i m e})

β \sim N o r m a l (0, 10)

α \sim N o r m a l (0, 10)

σ_{α, s u b j e c t} \sim H a l f C a u c h y (0, 1)

σ_{t i m e} \sim H a l f C a u c h y (0, 1)

σ_{e} \sim H a l f C a u c h y (0, 1)

σ_{β s u b j e c t} \sim H a l f C a u c h y (0, 1)

R \sim L K J_{c o r r} (2)

Where S is the covariance matrix, $R = (\begin{matrix} 1 & ρ \\ ρ & 1 \end{matrix})$ is the corresponding correlation matrix, and ρ is the association between intercepts and coefficients used in the calculation of S. The prior matrix R is the LKJ-correlation (Lewandowski et al., 2009) with a parameter ζ(zeta) which regulates the strength of the association.

As shown in Figure 1 above, each component of the mixed effect model appears in the graph as a node. The dotted arrows represent deterministic (fixed) dependencies between the parameters (e.g., from β to μ_ij), whereas the solid arrows represent probabilistic (random) dependencies (e.g., from $σ_{e}^{2}$ to Y_ij) (Bürkner, 2018). The hyper-parameters of the varying both intercept and slope model (μ_α, μ_β, σ_α, σ_β, and ρ) can be assumed to have hyper-prior distributions with appropriate assumptions for the parameters (Liu, 2016; Congdon, 2020).

FIGURE 1

Figure 1. A varying intercept and slope model (Bayesian Framework).

2.3 Bayesian Information Criterion for Model Comparison and Selection

Watanabe’s Widely Applicable (WAIC)

WAIC (Watanabe, 2010) could be achieved as an improvement over the divergence-based information criterion (DIC) for Bayesian models. The deviation term used in the calculation of the WAIC is Log-Point Based -Requires Predictive-Density (LNTTY). LNTTY is calculated as:

LNTTY = \sum_{i = 1}^{N} \log \int p (y_{i} | θ) \times p_{p o s t} (θ) d θ

The whole p_post(θ) is the posterior distribution used in the calculation of LNTTY. Similar to LNTTY, WAIC’s penalty term is purely Bayesian and is computed as:

p_{W A I C} = \sum_{i = 1}^{N} V a r_{p o s t} (\log p (y_{i} | θ))

Where p_WAIC is the penalty term which is the variance of the log-predictive-density terms aggregated over N data points. Thus, the WAIC can be calculated as:

W A I C = - 2 L P P D + 2 p_{W A I C}

Leave-One-Out Information Criteria (LOO-CV)

Bayesian leave-one-out cross-validation (LOO-CV) is different from the WAIC. Because there is no penalty term in its calculation. LOO-CV can be computed as:

L O O I C = - 2 L P P D_{l o o} = - 2 * \sum_{i = 1}^{N} \log

\int p (y_{i} | θ) \times p_{p o s t (- i)} (θ) d θ

Where p_post(−i)(θ) is the posterior distribution based on a sub-set of the data at point i from the dataset. LNTTY used i^th data points to calculate both the posterior distribution and the parameter estimation. Here, in contrast, the log-pointwise predictive density (LPPD_loo) is used the same for prediction only. Therefore, there is no need for a penalty term to correct potential bias by using the data twice (Vehtari et al., 2017).

K-Fold Cross-Validation

Sometimes, multiple Pareto Corrected Significance Sampling (PSIS-LOO) fails, and it takes too long to remodel in the iteration. Therefore, we can estimate LOO-CV using K-fold-CV by separating the data into completely random multiples, which leads to looking at each cross-validation estimate distinctly (Vehtari et al., 2018).

The Bayesian K-fold-CV partitions the dataset into k subsets y_k(k = 1, 2, …, K). The Bayesian hierarchical model (BHM) generates each training dataset y_{k_e} separately, which returns a p_post(e)(θ) = p(θ|y_{(k_e)}) posterior distribution (Vehtari and Gelman, 2014). To preserve reliability with WAIC and LOO-IC, defining the predictive accuracy of every point in the dataset is essential. Therefore, the log-predictive distribution function is

\log p_{p o s t (k . e)} (y_{ı}) - \log \int p_{p r e d} (y_{ı} | θ) p_{p o s t (k . e)} (θ) d θ, i ϵ k .

Using “S” simulations corresponding to a subset of k (usually K = 10) containing the i^th data point and the posterior distribution P(θ|y_{(k_e)}). The overall estimate of the expected log point predictive density for a new dataset is determined as follows:

\hat{e l p d_{v a l}} = \sum_{i = 1}^{n} {\hat{l p d}}_{i} = \sum_{i = 1}^{n} \log (\frac{1}{S} \sum_{s = 1}^{S} p (y_{i} | θ^{k, s}))

Therefore, a point estimate of the k-fold value is the sum of the iterative folds from the data points.

2.4 The Hamiltonian Monte Carlo Algorithm in Bayesian Regression Models Using ‘Stan’ Package of R

Similar to Gibbs sampling, HMC practices a proposal distribution that changes subject to the recent location in the parameter space (Liu, 2000). However, unlike the Gibbs algorithm, HMC does not rely on computing the conditional posterior distribution of parameters and sampling from it (Mai and Zhang, 2018). HMC has two advantages over other MCMC methods: little or no autocorrelation of the samples and fast mixing, i.e., the chain converges to the distribution immediately (Nalborczyk and Vilain, 2019). Therefore, it is the best approach for continuous distributions with low (auto) correlation and low rejection of samples.

When the model parameters are continuous rather than discrete, HMC, also known as Hybrid Monte Carlo, can overpower such random walk behavior using a clever scheme of supplementary variables that converts the tricky of sampling from the targeted function into the simulating Hamiltonian dynamics (Britten et al., 2021). HMC is an MCMC algorithm that avoids the random walk behavior and sensitivity to correlated parameters that outbreak other MCMC approaches by performing a series of steps informed by first-order gradient information (Hilbe, 2009).

The HMC algorithm is based on the Hamiltonian (total energy) calculating the trajectory for a time t = 0, …, T and then taking the final position X(T) = X_n+1.

The steps of the algorithm are as follows:

HMC algorithm

1. Choose a starting point and a velocity distribution θ₀ = X₀q(v)

2. for n = 0, …

3. Set the initial position as X(t = 0) = X_n

4. Draw a random initial velocity, v(t = 0)∼q(v);

5. Integrate the orbit numerically with the total energy for some time (use the Leapfrog method):

H (X, v) = U (X) + K

= - \log p (X) - \log q (v) T

6. Calculate the probability of acceptance:

α (X_{n + 1}, X_{n}) = m i n {1, \frac{e x p [- H (X_{n + 1}, v_{n + 1})]}{e x p [- H (X_{n}, v_{n})]}}

7. Set X_n+1 = X(t = T)

8. Increment

3. Results

In practice, the three basic Bayesian hierarchical models have been fitted in BRMS default settings, and population-level (fixed) effects and subject-level (random) effects were obtained (Luo et al., 2021). All three models (Models 1, 2, and 3) had both fixed and random (mixed) parts but with different estimated parameter types. In the result, the estimate shows the posterior mean and Est. Error is the SD for each parameter. Model convergence was achieved well enough both the bulk effective sample size (Bulk_ESS) and the tail effective sample size (Tail_ESS) for the 95% CIs were adequate (Vehtari et al., 2017). In general, every parameter is summarized using the posterior distribution’s mean (“Estimate”) and SD (“Est. Error”), as well as two-sided 95% credible intervals as lower and upper bounds based on quintiles.

Table 1 of the fixed effects shows that the posterior mean verbal testing score was estimated to be 26.33 with an SD of 0.73. The 95% credible interval shows that the posterior distribution mean (intercept) was significant. On the other hand, the random effect showed significant verbal score test variation between groups (participant subjects) and within-subjects (between different measurements of different time points). Thus, according to the null model, the HVLT score showed more between-group/subject variation than within-group (between repeated measurements) variation.

TABLE 1

Table 1. Results from the fitted null model: Model 1.

Table 2 showed that the coefficient of booster training was positive with a zero overlapping 95% CI. This indicates that, on average, there is little evidence that taking booster training increases elderly adults’ verbal learning and memory test scores by 0.1865, but the evidence-based on the data and random intercept model. On the other hand, adults’ years of education (edu) estimate was negative with a zero overlapping 95% CI. This negative estimate indicates that, on average, in the random intercept model, there is little evidence that increasing the years of education decreases elderly adults’ verbal learning and memory test scores by 0.0034 units.

TABLE 2

Table 2. Results from the fitted varying intercept model: Model 2.

According to the predictive effects of each explanatory variable shown in Figure 2 and Table 3, taking booster training, age, and gender were the most influential factors affecting participants’ cognitive verbal test and memory ability. Table 3 reveals that there is also an adverse association between the intercepts and coefficients for reasoning ability, which implies reasoning ability has a large average score value showing additional variability by poor reasoning ability than by good reasoning ability. Nevertheless, it can be seen that the slope estimate of such a model is even further unreliable than that of the preceding models, as it can be clearly understood from the associated standard error and the size of the 95% CIs. Table 3 also showed that booster training had a significant positive predictive effect on elderly adults’ verbal learning and memory test scores. In contracts, adults’ years of education had a significant negative impact on elderly adults’ verbal learning and memory test scores.

FIGURE 2

Figure 2. Bayesian hierarchical varying slope convergence diagnosis.

TABLE 3

Table 3. Results from the fitted varying slope mode: Model 3.

We also noticed in Figure 2 and Figure 3 below that adding any term to the early model showed predictive performance improvements on the fitted models are ordered from Models 1 to 3 (full model). However, such a result may not be interpreted as a universal rule, subsequent adding extra terms to a unique model may also result in overfitting, which corresponds to a condition in which the fitted model is over-specified about the data, making the model good at clarifying the sample dataset but poor at predicting no observed data. The model convergence diagnosis plots are hairy caterpillars which showed the model converged. On the other hand, the models have well converged based on the estimated statistical values. This means that the R-hat $(\hat{R})$ statistics were close to 1 and the (bulk and tail) ESSs values were sufficiently high when ESS > 100 was chosen as the cutoff (Vehtari et al., 2021). The majority of parameters still showed sufficiently high ESS values when more conservative cutoffs were chosen (i.e., ESS > 400 or even 1,000, see Zitzmann and Hecht, 2019).

FIGURE 3

Figure 3. Bayesian hierarchical varying slope convergence diagnosis (Continuous).

Based on the fitted varying slope model, which accounted for six predictors from the data, fixed effects showed that age, gender, reasoning ability, and booster training were significant predictors of verbal learning and memory test scores, whereas random-effect showed that much of the variation in test scores occurred within-subjects (between measurement time points) than between subjects.

After we have built the three different models, it is necessary to identify relatively the best model that can be used to predict the outcome variable and make inferences. However, choosing the model that has the best predictive and a better fit on the actual data is complicated with diverse information criteria since all selected models on the actual data might not essentially achieve as fit on a different dataset. In its place, it is necessary to decide on a model that fits best in terms of predicting new data which had not been practiced.

In case of the non-existence of extra information, cross-validation methods such as WAIC and LOO-CV can be used. According to Table 4, the varying slope model has the lowest WAIC, LOO-IC, and 10-fold estimates. However, the difference is relatively small when we compare the difference in estimates of criteria for each model and the corresponding standard errors (in the column SE).

TABLE 4

Table 4. Model comparisons based on predictive performance.

Among the fitted models above, it looks like the final model (Model 3) in the HMC algorithm is the best model. Therefore, as a function of the six explanatory variables and the random coefficient for age and reasoning ability, Model 3 has the best predictive performance for the cognitive HVLT.

According to Figure 4, the varying slope and intercept model fit well and produced nearly identical posterior observed density and posterior predictive distribution plots of the outcome variable of THVLTS from the ACTIVE study.

FIGURE 4

Figure 4. Bayesian hierarchical varying slope fitted model on the observed and predicted outcomes.

Furthermore, the marginal effect of each predictor variable revealed (Figure 5) that age and reasoning skills are the most significant explanatory variables that predict the THVLTS of the ACTIVE study.

FIGURE 5

Figure 5. Bayesian hierarchical varying slope model marginal prediction effects.

4. Discussion

Based on the selected sample participants in the ACTIVE study dataset (Willis et al., 2015), the Bayesian hierarchical linear models of three types were fitted by considering only six explanatory variables as predictors of the cognitive verbal learning test. The null model without any predictor effect but with only the intercept term was fitted, and it shows a mass of cognitive verbal learning ability variability across subjects. The varying intercept model with the addition of all predictor variables was fitted; and getting booster training, age, and reasoning ability were significant predictor of verbal test scores (Duff, 2016). The varying coefficient/slope model (i.e., Model 3) is the best-fitted model than the other fitted models since it had the lowest WAIC, LOO-IC, and 10-fold estimates (Bafumi and Gelman, 2011). A bulk of participants’ cognitive verbal test scores variations were observed between subjects (Ryoo, 2011). The full hierarchical model with varying intercepts and slopes has the best performance for predicting verbal learning tests (from ACTIVE study dataset) using the hybrid Hamiltonian Markov Chain Monte Carlo approach.

Socio-demographic and training-related characteristics influence elderly verbal learning tests that can be measured in multiple occupations (Welsh-Bohmer et al., 2009).

5. Conclusion

Total Hopkins Verbal Learning Test Score from the ACTIVE study can be used as a measure of elderly adults’ cognitive verbal learning ability. Four demographic characteristics of adults, such as age, gender, educational status, and cognitive status (MMSE score), were measured at the baseline, and characteristics measured after cognitive training such as reasoning ability and booster training were considered. THVLTS from the ACTIVE study can be used as a measure of elderly adults’ cognitive verbal learning ability. According to the findings, the varying intercept and slope model fit best, and age, gender, booster, and reasoning ability are the main significant predictors for THVLTS, which measures cognitive verbal learning. Taking booster training had a positive significant predictive effect, while years of education (edu) had a negative significant predictive effect on THVLTS.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/supplementary material.

Ethics Statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements. Written informed consent was not obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article. This is because this quantitative analysis and modeling paper used open-access secondary data on repeated measurements.

Author Contributions

EE participated in all aspects of the study: designing the study, performing data management, conducting the data analysis, writing the first draft of the manuscript, and discussing with MC to improve the manuscript, as it is a part of the first author’s Ph.D. dissertation. MC participated in revising the manuscript, commenting, and proofreading. Both authors listed have made a substantial, direct, and intellectual contribution to the manuscript and approved it for publication.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

Our great appreciation and thanks are forwarded to NACDA members for the helpful repeated measured data collected and accessibility.

References

Almond, R. G. (2014). A comparison of two MCMC algorithms for hierarchical mixture models. CEUR Workshop Proc. 1218, 1–19. doi: 10.1016/j.neuroimage.2008.02.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Bafumi, J., and Gelman, A. (2011). Fitting multilevel models when predictors and group effects correlate. SSRN Electron. J. 2, 8–14. doi: 10.2139/ssrn.1010095

Predicting Verbal Learning and Memory Assessments of Older Adults Using Bayesian Hierarchical Models

1. Introduction

2. Materials and Methods

2.1 Data and Variables

2.2 Bayesian Hierarchical Model for Repeated Measures Data

Model 1: Null Model

Model 2: Varying Intercept Model

Model 3: Varying Slopes

2.3 Bayesian Information Criterion for Model Comparison and Selection

Watanabe’s Widely Applicable (WAIC)

Leave-One-Out Information Criteria (LOO-CV)

K-Fold Cross-Validation

2.4 The Hamiltonian Monte Carlo Algorithm in Bayesian Regression Models Using ‘Stan’ Package of R

3. Results

4. Discussion

5. Conclusion

Data Availability Statement

Ethics Statement

Author Contributions

Conflict of Interest

Publisher’s Note

Acknowledgments

References

Appendix

The Priors in Bayesian Hierarchical Models and Sensitivity Analysis

94% of researchers rate our articles as excellent or good