A stacked ensemble machine learning model for the prediction of pentavalent 3 vaccination dropout in East Africa

Alemayehu, Meron Asmamaw; Kebede, Shimels Derso; Walle, Agmasie Damtew; Mamo, Daniel Niguse; Enyew, Ermias Bekele; Adem, Jibril Bashir

doi:10.3389/fdata.2025.1522578

ORIGINAL RESEARCH article

Front. Big Data , 07 April 2025

Sec. Medicine and Public Health

Volume 8 - 2025 | https://doi.org/10.3389/fdata.2025.1522578

A stacked ensemble machine learning model for the prediction of pentavalent 3 vaccination dropout in East Africa

$\r\nMeron Asmamaw Alemayehu$ Meron Asmamaw Alemayehu¹^*

Shimels Derso Kebede²

Agmasie Damtew Walle³

Daniel Niguse Mamo⁴

Ermias Bekele Enyew²

Jibril Bashir Adem⁵

¹Department of Epidemiology and Biostatistics, College of Medicine and Health Sciences, Institute of Public Health, University of Gondar, Gondar, Ethiopia
²Department of Health Informatics, School of Public Health, College of Medicine and Health Science, Wollo University, Dessie, Ethiopia
³Department of Health Informatics, School of Public Health, Asrat Woldeyes Health Science Campus, Debre Berhan University, Debre Birhan, Ethiopia
⁴Department of Health Informatics, College of Medicine and Health Sciences, Arba Minch University, Arba Minch, Ethiopia
⁵Department of Public Health, College of Medicine and Health Science, Arsi University, Asella, Ethiopia

Introduction: Vaccination is critical for reducing childhood mortality, yet completion rates for the third dose of the pentavalent vaccine (Penta 3) in East Africa remain inadequate. This study aims to predict Penta 3 vaccination dropout using a stacking ensemble machine learning model with Demographic and Health Survey (DHS) data. The objective is to identify predictors of dropout and enhance intervention strategies.

Methods: The study utilized seven base machine learning algorithms to create a stacked ensemble model with three meta-learners: Random Forest (RF), Generalized Linear Model (GLM), and Extreme Gradient Boosting (XGBoost). The H2O package facilitated the development of base learners and the stacking of super learners. Feature selection (FS) and comparisons were performed using the LASSO and Boruta algorithms. The selected features were one-hot encoded, and ordinal encoding was applied where appropriate. Hyperparameter optimization (HPO) and comparisons were conducted using grid search and random search. Model performance was assessed using five key metrics, including accuracy and the area under the curve (AUC). SHAP (Shapley Additive Explanations) values were used to interpret the model outputs and identify influential predictors. The experimental design was employed to present the results.

Results: Four experiments were conducted to evaluate feature selection and HPO methods. All stacked ensemble models outperformed individual learners, with the XGBoost meta-learner optimized with grid search and LASSO FS achieving the highest performance: 93.9% accuracy and 99.4% AUC. While RF and GLM meta-learners were also evaluated, they were outperformed by the XGBoost meta-learner. SHAP analysis revealed key features influencing Penta 3 dropout, including the place of delivery, decision-making autonomy, the mother's level of earning, and healthcare access. Home delivery increased the risk of dropout, while postnatal care by midwives and health insurance coverage lowered dropout likelihood.

Conclusion and recommendation: This study provides insights into the factors influencing Penta 3 vaccination dropout in East Africa. To reduce dropout rates, interventions should focus on enhancing maternal livelihood opportunities, improving healthcare access in rural areas, and promoting institutional deliveries.

Introduction

Childhood vaccination dropout is an ongoing public health challenge, particularly in sub-Saharan Africa, where achieving high vaccination coverage remains a priority. Among the vaccines included in the Expanded Program on Immunization (EPI), the pentavalent vaccine is essential for protecting children from five life-threatening diseases: diphtheria, tetanus, whooping cough, hepatitis B, and Haemophilus influenzae type B. While significant progress has been made in expanding vaccine coverage across many regions, the third dose of the pentavalent vaccine (Penta3) continues to see relatively low completion rates, especially in low- and middle-income countries, including those in East Africa. The failure to complete the full pentavalent series, particularly the third dose, poses serious public health risks, as incomplete vaccination leaves children vulnerable to preventable diseases and undermines efforts to achieve herd immunity. Addressing this dropout issue is crucial for improving vaccination coverage and preventing further strain on healthcare systems in the region (World Health Organization, 2023).

Various factors contribute to the high rates of pentavalent vaccine dropout. Logistical challenges, including limited access to healthcare facilities, transportation barriers, and inadequate infrastructure in rural areas, make it difficult for parents to return for subsequent doses. Additionally, misinformation and vaccine hesitancy, often fueled by myths and misinformation circulating through the community and social media, lead many caregivers to question the safety and necessity of vaccines. These issues are exacerbated by weak health systems that struggle with maintaining consistent vaccine supplies and outreach programs. Consequently, children in the region remain at heightened risk for vaccine-preventable diseases, and healthcare systems are further burdened by the failure to complete vaccination schedules (Tsegaw et al., 2024; World Health Organization, 2023).

This study aims to develop and evaluate a stacked ensemble machine learning model that can accurately predict pentavalent 3 vaccination dropout in East Africa and address these challenges by developing a robust prediction model. The model's goal is to identify and analyze the key predictors contributing to vaccination dropout, with the intention of providing actionable insights for targeted public health interventions. By employing a data-driven approach, this study fills a significant gap in the existing literature on vaccine dropout, offering a more objective and precise understanding of the issue. Additionally, it showcases the utility of machine learning models in public health research, demonstrating how such tools can be applied to complex, real-world problems that are difficult to address using traditional methods.

This research makes several important contributions. First, it introduces the use of a stacked ensemble machine learning model to predict Penta3 dropout, a novel methodology in the context of vaccine dropout prediction. This model integrates multiple machine learning algorithms to improve prediction accuracy and robustness, offering a significant advancement over stand-alone models, which have been more commonly used in similar studies. Second, this study has practical implications for policymakers and public health organizations, including the World Health Organization (WHO), that are working to improve vaccination completion rates. Despite the WHO's goal of achieving 90% coverage for the third dose of the pentavalent vaccine, many East African countries are still far from this target (World Health Organization, 2020, 2024). By identifying the socioeconomic, geographic, and healthcare system factors that influence vaccine dropout, this study provides evidence-based recommendations for interventions that are tailored to the specific needs and challenges of the region.

From a theoretical perspective, this research contributes to the growing field of machine learning applications in public health. By using a stacked ensemble framework, the study demonstrates how this technique can capture complex, non-linear relationships between various factors influencing vaccination behaviors. The approach used in this study is not only applicable to Penta3 dropout but also has the potential to inform similar studies in other areas of public health where predicting health-related behaviors can enhance intervention strategies. Additionally, this work offers new insights into the theoretical understanding of vaccination behavior by analyzing how different factors, ranging from socio-economic to health system-related, interact to affect vaccination completion rates.

In conclusion, this study presents a novel machine learning-based framework for predicting Penta3 dropout in East Africa, addressing a gap in existing research while providing a detailed analysis of the factors contributing to vaccination dropout. The insights gained from this study can inform future research and public health policies aimed at improving vaccination coverage and reducing dropout rates. Ultimately, the findings have the potential to guide targeted interventions that can contribute to better child health outcomes in East Africa and other similar regions around the world.

Related works

The prediction of vaccination dropout using machine learning (ML) techniques has gained increasing attention in recent years. Various studies have explored different methods for predicting vaccination behavior and outcomes. However, most of these studies focus on overall vaccine dropout (not specific vaccines), use statistical methods, focus on a single country, do not use model interpretability analysis like SHAP, and/or rely on single machine learning algorithms, often limiting their predictive power by not leveraging the strengths of multiple algorithms through ensemble learning.

One early work by Chandir et al. (2018) applied decision tree-based models to predict childhood vaccination dropout in low-resource settings. The model showed moderate success, particularly in identifying key predictors, such as maternal education and distance to health facilities, but struggled with generalization across different datasets. This study demonstrated the potential of ML models but highlighted the need for more sophisticated approaches to handle complex socio-demographic patterns. Similarly, a study by Kayembe-Ntumba et al. (2022) used logistic regression models to predict overall vaccination dropouts, focusing on a few socio-economic factors such as rural residence, unavailability of seats, and lack of a reminder system. While the study succeeded in identifying high-risk groups for dropout, the model lacked flexibility and failed to account for non-linear relationships between variables, reducing its overall predictive performance.

More recent studies have turned to ensemble methods to improve model performance. For instance, Demsash et al. (2023) employed Random Forests (RF) and Gradient Boosting Machines (GBM) to predict childhood vaccination in Ethiopia, demonstrating an increase in accuracy compared to single models. However, their study was limited since only one country was involved (i.e., Ethiopia), it was not focused on a specific childhood vaccine, no model interpretability method was employed, and only individual models were evaluated. Similarly, Nwachukwu (2024) applied machine learning techniques to predict immunization completion in Ogun State, Nigeria. The study employed Logistic Regression, Support Vector Machine (SVM), and K-Nearest Neighbors (KNN) models to analyze immunization patterns using retrospective data from 8,808 immunization records. Logistic Regression was favored with an accuracy of 99.77%, outperforming SVM and KNN. While the study provided valuable insights into immunization in a specific region of Nigeria, it lacked other strong ML models such as XGBoost and RF and also lacked model interpretability tools such as SHAP analysis and did not focus on vaccination dropout explicitly, but rather on overall completion rates. Furthermore, the data was limited to a single locality within Nigeria, limiting its generalizability to broader regions like East Africa.

In a separate study conducted in The Gambia, Ntenda et al. (2022) utilized Generalized Estimating Equation (GEE) models to examine the determinants of pentavalent and measles vaccination dropouts using data from the 2019–20 Gambia Demographic and Health Survey. This study identified key factors influencing dropout rates, such as antenatal care attendance, possession of a health card, and urban residency. While the study was important for highlighting vaccination challenges in The Gambia, it focused primarily on statistical methods and did not incorporate advanced machine learning techniques. Additionally, the scope was limited to The Gambia, without extending to other East African countries.

In contrast to these approaches, our study takes a more advanced approach by employing a stacked ensemble model, integrating predictions from multiple base learners such as Naive Bayes, Random Forests, Gradient Boosting Machines, eXtreme Gradient Boosting, and Deep Learning, among others. The novelty of our approach lies primarily in the stacking process, where a meta-learner is used to combine the predictions of base models, leveraging the strengths of each learner while mitigating their weaknesses. This model architecture allows for enhanced predictive accuracy and robustness, particularly in handling the complex, non-linear relationships often present in socio-demographic data. Previous studies, such as those by Hu et al. (2021), have shown that stacking models outperform traditional ML algorithms in healthcare settings, but their focus was limited to predicting hospital admissions rather than vaccination dropout. By extending this methodology to the prediction of Pentavalent 3 vaccination dropout, we address a critical gap in the literature.

Moreover, by leveraging comprehensive, high-quality, and representative data from the DHS, we enhanced model interpretability and transparency using SHAP analysis, which is not applied in the related works mentioned. Our study's unique focus on Pentavalent 3 dropout across multiple East African countries makes it a significant contribution compared to single-country or general vaccination studies. These combinations not only improve predictive performance but also provide insights into the key drivers of Penta 3 vaccination dropout in East Africa, making it highly relevant for policymakers and public health practitioners working toward vaccine coverage in the region.

Methods

Data source

The study utilized data from the Demographic and Health Surveys (DHS) conducted across multiple East African countries. DHS surveys are nationally representative and provide comprehensive information on health indicators, including vaccination coverage and demographic characteristics. Specifically, we accessed the latest available DHS datasets for each country in the region, ensuring consistency and relevance across the study period. These datasets are publicly accessible and rigorously collected using standardized methodologies, which enhances the reliability and comparability of our findings. By leveraging DHS data, we aimed to capture the diverse socio-demographic profiles and health service utilization patterns that influence vaccination behaviors in East Africa. Ethical considerations regarding data use and participant anonymity were adhered to, respecting the confidentiality and rights of survey respondents (The DHS Program, 2024).

Modeling software and packages

The machine learning analysis for this project was conducted using the R programming language, primarily in a Google Collaboratory environment. The h2o package was utilized to implement a stacked ensemble model consisting of seven base learners.

The integration of these tools provided a robust framework for model development and optimization. The choice of Collaboratory facilitated efficient computation without the need for local hardware resources, ensuring scalability and accessibility in the research workflow.

Study variables

Target

The primary outcome variable examined in this study was Penta 3 vaccination dropout, defined as the failure to complete the third dose of the pentavalent vaccine within the recommended timeframe (1 = dropped out and 0 = received/completed).

Features

Predictive variables encompassed a wide array of socio-demographic, economic, and geographic factors known to influence vaccination behaviors. These included maternal age, educational attainment, household income, urban or rural residence, distance to the nearest health facility, and accessibility to healthcare services. Each variable was carefully selected based on its theoretical relevance and empirical evidence linking it to vaccination uptake and completion rates in low-resource settings. The complete list of features and their label is presented in the Supplementary Table 1.

Data preprocessing

To prepare DHS data for a stacked ensemble machine learning model, comprehensive preprocessing is required. Data cleaning, addresses inconsistencies, duplicates, and incorrect values, ensuring that only high-quality information informs model training. Target and feature engineering refines both target variables and predictors to optimize predictive accuracy. This includes techniques like lumping categories in sparse variables, feature selection with Boruta or Lasso regularization for relevance, feature encoding (one-hot, ordinal, label), and removing highly correlated features to reduce multicollinearity. Handling missing values involves imputing or, if appropriate, removing missing data points to preserve dataset integrity, minimizing biases from incomplete entries (Luengo et al., 2020).

Moreover, splitting the data into training and test sets provides a framework for evaluating model performance, ensuring that the ensemble generalizes well across unseen data. Here, a carefully designed resampling strategy enhances model robustness by creating training subsets that reduce overfitting, which is particularly important in ensemble models that combine predictions from diverse learners. Finally, managing imbalanced data ensures that classes in the target variable are proportionally represented, typically through techniques like the Synthetic Minority Over-sampling Technique (SMOTE) or under-sampling, which prevents dominant classes from skewing model predictions. Together, these preprocessing steps create a balanced, well-prepared dataset, critical for building an effective stacked ensemble model that delivers robust predictions from large datasets like DHS data (Werner de Vargas et al., 2023).

Base learners and stacked ensemble model

In this study used a stacked ensemble machine learning model that integrates the predictions of seven distinct base learners: Naive Bayes (NB), Generalized Linear Model (GLM), Decision Tree (DT), Deep Learning (DL), Random Forest (RF), Gradient Boosting Machine (GBM), and Extreme Gradient Boosting (XGB). This ensemble approach is designed to enhance predictive performance by leveraging the unique strengths of each individual model while mitigating their weaknesses.

NB is a probabilistic classifier based on Bayes' theorem, which assumes independence among features. This model is particularly effective for text classification tasks and performs well with smaller datasets. The GLM is a flexible extension of traditional linear regression that accommodates various types of response variables through the use of different link functions. This model facilitates the interpretability of coefficients, enabling insights into the impact of predictors. A DT model structures data through a tree-like diagram, making decisions based on the most significant attribute at each node. While easy to interpret and visualize, decision trees can be prone to overfitting if not managed appropriately. DL employs neural networks with multiple layers to identify complex patterns in high-dimensional data. While deep learning excels in tasks involving images and text, it typically requires substantial datasets and computational resources. RF is an ensemble model that constructs multiple decision trees using random subsets of the training data and features. This method enhances predictive accuracy by reducing overfitting through the aggregation of predictions from individual trees. GBM is an ensemble learning method that builds models sequentially, primarily used for regression and classification tasks. It combines multiple weak learners, typically decision trees, to create a strong predictive model by optimizing the model's weights based on the errors of previous iterations. This approach enhances accuracy and reduces prediction errors over time. XGoost is a powerful boosting algorithm that builds models sequentially, with each new model addressing the errors made by its predecessor. Renowned for its efficiency and accuracy, XGBoost has demonstrated exceptional performance in various machine learning model comparisons (Shetty and Whitfield, 2023).

Stacking process

The process of stacking involves several key steps. First, each of the seven base learners are trained on the same training dataset. This initial training phase allows each model to learn from the data, capturing different patterns and relationships. The predictions generated by these models are stored for subsequent use, forming a diverse set of outputs. The next step involves creating a new feature set based on the predictions from the base learners. While some R packages (like h2o) provide a built-in mechanisms to automatically create meta-features as part of their stacking implementations, others (like caret and mlr3) do not. In these cases, predictions must be manually extracted and assembled from base learners into a new feature set for the meta-learner. In this study, the “h2o.stackedEnsemble” function of the “h2o” package automatically generates these predictions from the trained base learners, creating what is often referred to as meta-features. When cross-validation is used during the training of base learners, h2o ensures that the meta-features are based on out-of-fold predictions to prevent data leakage, thereby maintaining the integrity of the validation process.

Next, a meta-learner algorithm, which can be another model such as logistic regression, random forest, or eXtream gradient boosting, is then selected and trained on the meta-features. During this phase, the meta-learner learns how to optimally combine the outputs of the base learners to produce a final prediction. The choice of meta-learner can influence the performance of the ensemble, as it determines how to weigh the contributions from each base model. Lastly, the trained meta-learner produces the final predictions for unseen data. When new input data is provided, the base learners generate their respective predictions, which are then fed into the meta-learner. The meta-learner synthesizes these inputs, producing a robust final prediction that combines the insights gained from all base learners (Verma et al., 2024).

Stacked ensemble model offers numerous advantages, including improved performance through the integration of diverse model strengths, which enhances predictive accuracy and robustness. By leveraging multiple algorithms, the ensemble approach reduces the risk of overfitting, as it combines the strengths of various models while minimizing their individual weaknesses. Additionally, the flexibility of the stacking framework allows for the inclusion of various types of base learners, enabling researchers to tailor their ensemble to the specific characteristics of their dataset and the nature of their predictive task. Overall, stacked ensembles represent a powerful and versatile strategy for achieving superior performance in machine learning applications (Dey and Mathur, 2023). An architecture representing the model building process is presented in Figure 1.

Figure 1

Figure 1. Model-building architecture for a stacked ensemble machine learning model for the prediction of pentavalent 3 vaccination dropout in East Africa.

Hyperparameter optimization (HPO) and performance metrics

The hyperparameter optimization (HPO) process for a stacked model with seven base learners involves multiple stages, each aimed at optimizing the performance of the model ensemble. First, individual base learners are trained and tuned separately. Each learner has its own set of hyperparameters, such as learning rate, regularization strength, number of estimators, and depth of trees in case of tree-based algorithms.

This study experimented with grid search HPO and systematic random search HPO, both combined with cross-validation, to compare and identify the optimal hyperparameters for each base learner. Cross-validation helps prevent overfitting, ensuring that the selected hyperparameters generalize well to unseen data.

After tuning the base learners, their predictions on the training data set are used as input features for the second layer of the stacked model, typically referred to as the meta-learner. The meta-learner is another machine learning algorithm tasked with combining the predictions from the base learners in a way that minimizes overall prediction error. The hyperparameters of this meta-learner are crucial as well, and they require tuning similar to the base learners. This can be done independently or as part of a larger optimization process that considers both base and meta-learner hyperparameters simultaneously.

During the entire process, cross-validation is employed at different stages to prevent information leakage from the base learners to the meta-learner. This is often done using techniques like k-fold cross-validation or nested cross-validation, where the base learners are trained and tuned in the inner loop, while the meta-learner is trained and tuned in the outer loop. Once the final hyperparameters for both the base learners and the meta-learner are identified, the model is retrained on the full training dataset using the optimized settings. The final stacked model is then evaluated on a held-out test set to ensure robust performance. Hyperparameter tuning for stacked models, while complex, aims to exploit the strengths of each base learner and combine them in a way that maximizes predictive accuracy while minimizing overfitting (Zhang et al., 2023).

To evaluate the performance of the stacked ensemble model, several metrics were employed, including accuracy, precision, recall, area under the receiver operating characteristic curve (AUC-ROC), and F1 score. Accuracy measures the proportion of true positive and true negative predictions among the total number of instances examined, providing a general indication of model effectiveness. However, it may not adequately reflect performance in imbalanced datasets, prompting the use of additional metrics.

Precision, also known as positive predictive value, quantifies the number of true positive predictions divided by the sum of true positives and false positives. This metric highlights the model's ability to correctly identify positive instances, which is particularly important in contexts where the cost of false positives is high. Recall, or sensitivity, measures the proportion of true positive predictions relative to the total number of actual positives, thus providing insight into the model's effectiveness in capturing all relevant cases. The AUC-ROC metric assesses the model's ability to distinguish between positive and negative classes across various threshold settings, summarizing its performance in terms of both sensitivity and specificity. Finally, the F1 score is the harmonic mean of precision and recall, offering a balance between the two metrics and providing a single measure to evaluate model performance, particularly in scenarios where false negatives and false positives carry different implications. Together, these metrics provide a comprehensive evaluation of the model's predictive performance, facilitating comparisons with other modeling approaches in the study (Rainio et al., 2024).

An experimental design process was used, involving a structured approach to ensure high internal validity. Experimental study designs follow a set of established rules and techniques specifically created for conducting scientific research.

Model interpretability

The interpretability process for a stacked ensemble model with multiple base learners focuses on understanding the contributions of individual models and how their predictions are integrated by the meta-learner. A crucial component of this interpretative framework is the application of SHAP (Shapley Additive Explanations) values, particularly for the best-performing model within the ensemble. By utilizing SHAP values, researchers can gain insights into the specific contributions of each feature to the predictions made by this top-performing model. This method quantifies the importance of features while accounting for interactions, thus allowing for the identification of influential variables that drive the model's decisions. This analysis provides a clear understanding of the decision-making processes within the best-performing learner, laying a solid groundwork for interpreting the ensemble's overall effectiveness.

SHAP values can be employed at this level to elucidate how much the top-performing model's predictions contribute to the overall ensemble output, providing insights into the dynamics of the stacked model. While SHAP serves as the primary interpretability tool, additional methods such as integrated gradients and permutation feature importance can complement this analysis by offering broader perspectives on feature influence across the ensemble. Local interpretability methods, like LIME (Local Interpretable Model-agnostic Explanations), can further clarify individual predictions by approximating the model's behavior around specific instances. By emphasizing SHAP values for the best-performing model within this interpretative framework, the study enhances the transparency and trustworthiness of the stacked ensemble model's predictive capabilities, facilitating a comprehensive understanding of the underlying mechanisms driving its decisions (Baptista et al., 2022).

Ethical approval

This study utilized secondary data obtained from the Demographic and Health Surveys (DHS), which are publicly available datasets. As this research involved only anonymized data with no direct interaction with participants, further ethical review was not required. All analyses were conducted in compliance with ethical standards for research involving secondary data.

Results

Baseline characteristic

After the DHS data from the 10 East African countries was cleaned and merged, there were a total of 61,714 instances. Of those, 9,206 (14.9%) failed to complete (had dropped out of) the Penta3 vaccination. Among the 52,508 children who completed their vaccination, 75.2% of those living in rural areas completed the vaccination compared to 24.8% in urban areas. Children whose mothers had no education had a lower completion rate (18.1%) and a higher dropout rate (25.3%) compared to those with primary education (49.5% completed, 47.8% dropped out). Wealthier families had a higher vaccination completion rate (37.2%) compared to poorer households, where 43.7% of children dropped out. Mothers whose babies were delivered at government health facilities had a much higher vaccination completion rate (78.3%) compared to home deliveries (17.0%) (Supplementary Table 2).

Furthermore, children whose mothers were aged 20–29 had the highest Penta3 vaccination completion rates, with 26.2% for those aged 20–24 and 25.6% for those aged 25–29. The dropout rates were also higher in these age groups, at 28.2% and 24.1%, respectively. Among countries, Kenya had the highest completion rate at 16.3%, while Uganda showed the highest dropout rate at 18.9%. Moreover, mothers with higher education levels (secondary and above) had notably better completion rates (26.7% and 5.6%, respectively), reflecting a clear association between maternal education and vaccination completion (Figure 2 and Supplementary Figure 1).

Figure 2

Figure 2. Distribution of samples from each East-African Country: Penta3 vaccination dropout prediction.

Data preprocessing results

The dataset used in this study comprised a range of socio-economic, demographic, and health-related variables, which were initially subjected to extensive preprocessing steps to ensure suitability for machine learning modeling. Given the varied nature of the data, several stages of preprocessing were carried out to handle missing values, and categorical features. This was crucial to improve the quality of the data and ensure that it could effectively be used to build the stacked ensemble model.

First, the dataset was extracted from the DHS (Demographic and Health Survey) database, encompassing data from multiple East African countries. Two features (i.e., husband's educational status and husband's occupational status) had low levels of missing data, with 3.7% and 2.9%, respectively, while husband's age had a moderate level of missingness at 9.7%. Consequently, mode imputation was applied to the first two features. The most frequent category (mode) for each feature was identified and used to replace the missing values. For the feature “husband's age,” the pattern of missingness was analyzed and found to be Missing at Random (MAR). Given this missingness pattern, the recommended model-based imputation method (i.e., KNN) was performed using the KNN function of the VIM package. This ensured that the dataset retained as much information as possible without introducing distortions caused by missing data.

Once the missing values were addressed, feature engineering was performed. Derived features were created to capture complex interactions between variables. The media exposure variable was constructed by integrating the frequency of radio listening, newspaper reading, television viewing, and internet usage. The healthcare access feature was created based on whether the mother faces challenges in obtaining permission from her husband to visit a hospital, securing the funds required for treatment, overcoming the distance to the healthcare facility, or if attending the facility alone poses a difficulty.

Experimental results and comparisons

We performed different experiments and comparisons for both Boruta and LASSO feature selection methods with performance metrics obtained from Random Search HPO and Grid Search HPO.

Experiment I: feature selection

For experimentation purposes, both the LASSO (Least Absolute Shrinkage and Selection Operator) and the Boruta algorithm were used to identify and select the most relevant features from the dataset. For LASSO, since it requires numeric features, a matrix of encoded features and the target variable was constructed from the data, with the intercept column removed. Cross-validation was used to train the LASSO model and determine the optimal lambda (regularization parameter). The final LASSO model, fitted with the optimal lambda, identified important features with non-zero coefficients, while 26 features were eliminated. Features with both positive (direct relationship) and negative (inverse relationship) LASSO coefficients were considered important, as displayed in Figure 3A.

Figure 3

Figure 3. (A, B) LASSO and Boruta feature selections for a stacked ensemble machine learning model for the prediction of pentavalent 3 vaccination dropout in East Africa.

Unlike LASSO, which operates strictly on numerical features, Boruta does not require one-hot encoding for categorical variables. Therefore, it was performed before encoding to keep the feature space manageable and preserve the original relationships. As a result, the Boruta algorithm deemed eight features irrelevant and rejected them from further analysis. This process reduced the total number of features from 24 to 16. The results of the feature selection process by Boruta, including the importance of each variable and the rejection of the eight aforementioned features, are presented in Figure 3B.

Before model building, all categorical features deemed important by Boruta were transformed using one-hot encoding, while ordinal encoding was applied to “Educational Status of the Mother and Husband” to facilitate integration into machine learning models, which typically require numerical input. This transformation increased the number of features in the dataset from 16 to 63. Subsequently, highly correlated features were identified and removed using the “findCorrelation” function, which further reduced the number of features to 51. These preprocessing steps ensured that the categorical data were effectively utilized, enhancing model performance without compromising generalizability. The correlation matrix is illustrated on Supplementary Figure 2.

Experiment II: model building and optimal hyperparameters

Model building

To build the models, the dataset was divided into a training set comprising 70% of the data (43,199 instances) and a test set comprising 30% (18,515 instances). Class imbalance was a significant concern, as the dropout rate from the Penta3 vaccine fell well below the threshold for a balanced dataset, where the minority class constitutes < 30%−40% of the total. The imbalance ratio in the dataset was 5.71:1, indicating that for every instance of a child who dropped out (class 1), there were ~5.71 instances of children who completed their vaccination (class 0). This imbalance posed a risk of biased predictions favoring the majority class. To mitigate this issue, the Synthetic Minority Over-sampling Technique (SMOTE) was exclusively applied to the training set to generate synthetic samples for the minority class (children who dropped out of the Penta3 vaccination), thereby ensuring that no data leakage occurred during model evaluation. The SMOTE parameters, including the number of nearest neighbors (K = 5) and the duplication size (dup_size = 0), were set to their default values in the SMOTE function.

This technique effectively balanced the dataset, ensuring that the predictive models will not be biased toward the majority class while preserving the characteristics of the minority class and minimizing the risk of overfitting. The distribution of the target variable before and after the application of SMOTE is presented in Supplementary Figure 3.

Seven base learners were trained: Generalized Linear Model (GLM), Naive Bayes (NB), Decision Tree (DT), XGBoost (XGB), Random Forest (RF), Gradient Boosting Machine (GBM), and Deep Learning (DL). Each model was individually optimized using 10-fold cross-validation to prevent overfitting and ensure generalizability. The cross-validation was performed using stratified folds for all models except the GLM, which used modulo fold assignment. This approach preserved class balance across folds and ensured that the models generalized well to unseen data.

After training the base learners, their predictions were combined using a meta-learner in a stacked ensemble approach. Three different meta-learners were tested: XGBoost, Random Forest, and GLM. For each stacked ensemble, out-of-fold predictions from the base learners were used as features for the meta-learner, preventing data leakage and ensuring robust performance.

As the stacked ensemble model inherits the nfolds from base learners, a stacked cross-validation was, by default, employed for the meta-learner training. Each base learner was trained on 90% of the training data and generated out-of-fold predictions for the remaining 10%. These out-of-fold predictions were then passed to the meta-learner for training. Since the meta-learner was trained only on predictions generated from data that were held out from the base learners' training process, this prevents any form of information leakage. The key fact here is that the meta-learner did not have access to the raw features or base learner predictions from the same folds during training. This cross-validation process was repeated for all folds, ensuring that for each instance in the dataset, the corresponding meta-learner prediction was based solely on out-of-fold base learner predictions, which preserves the integrity of the model evaluation and avoids overfitting. Furthermore, early stopping was implemented in models like XGBoost, Gradient Boosting Machine (GBM), Random Forest, and Deep Learning to halt training when the performance did not improve for 50 rounds. AUC was used as a stopping metric.

Random search hyperparameters

The same systematic random search hyperparameter values were used for both experiments with Boruta and LASSO. The Generalized Linear Model (GLM) was trained with L2 regularization (alpha = 0.1) and the “binomial” family for binary classification. The glm function of h2o is a general framework for Generalized Linear Models (GLMs), and configuring it with the “binomial” family specifies logistic regression for binary outcomes. A single decision tree was trained with ntrees = 1, a maximum depth of 30, and min_rows = 1. XGBoost was configured with 1,000 trees, a learning rate of 0.05, and a maximum depth of 3, with early stopping after 50 rounds. The Random Forest model used 500 trees, a maximum depth of 20, and a sample rate of 0.8, while the GBM was set to 500 trees, a learning rate of 0.01, a maximum depth of 7, and early stopping after 50 rounds based on AUC. Lastly, the deep learning model had two hidden layers of 100 nodes each and was trained for 10 epochs, with early stopping after 50 rounds based on AUC. This model with two hidden layers and complex configuration fits the general definition of deep learning, which involves models with more than one layer of transformation (Table 1).

Table 1

Table 1. Random search HPO values for Penta-3 vaccination dropout prediction.

Experiment III: performance metrics with Boruta FS for both HPO methods

Table 2 presents the performance metrics for seven base learners and three Stacked Ensemble Models (SEMs) using the Boruta Feature Selection method under two hyperparameter optimization (HPO) approaches, Grid Search and Random Search.

Table 2

Table 2. Performance metrics of Boruta FS with both random search and grid search HPOs: Penta 3 dropout prediction.

Among the Stacked Ensemble Models (SEMs), the Random Forest-based ensemble using Grid Search delivers an outstanding AUC of 0.872 with a high Accuracy of 0.825, making it the top performer overall. The XGBoost-based ensemble also performs well, achieving a solid F1-score of 0.804 with Random Search, showcasing its balanced precision and recall. Additionally, the GLM-based ensemble using Grid Search shows competitive results, with a respectable AUC of 0.832 and an F1-score of 0.764, demonstrating its reliability as a meta-learner. These results highlight the effectiveness of different meta-learners in stacked ensemble models, with Random Forest-based ensembles standing out as the most robust (Table 2).

Experiment IV: performance metrics with LASSO FS for both HPO methods

Performance metrics of LASSO FS with random search HPO

For the models using LASSO feature selection with Random Search hyperparameter optimization, the XGBoost-based ensemble achieves the highest overall performance, with an AUC of 0.961 and an impressive Accuracy of 0.901, highlighting its strong predictive capability. Similarly, the Random Forest-based ensemble also performs exceptionally well, with an AUC of 0.953 and an Accuracy of 0.899. Meanwhile, the GLM-based ensemble shows consistent results, with an AUC of 0.957 and an Accuracy of 0.902, proving to be a competitive option. These stacked ensembles consistently outperform the individual base learners across most metrics, particularly in terms of AUC and Accuracy (Figure 4 and Table 3).

Figure 4

Figure 4. ROC-AUCs of LASSO FS with and random search HPO: Penta 3 dropout prediction in East-Africa.

Table 3

Table 3. Performance metrics of LASSO FS with random search HPO: Penta 3 dropout prediction.

Performance metrics of LASSO FS with grid search HPO

The grid search for XGBoost evaluated 108 models, with the best model achieved using a learning rate of 0.1, max depth of 9, 200 trees, 3 minimum rows, and a sample rate of 0.8. The GBM grid search evaluated 110 models, and the best combination was obtained with a learning rate of 0.1, max depth of 10, 26 trees, 5 minimum rows, and a sample rate of 0.8. The RF grid search evaluated 181 models, with the top-performing models using a max depth of 30, 13 trees, 1 minimum row, and a sample rate of 0.8. The NB grid search, with only 3 models evaluated, showed similar performance with Laplace smoothing values of 0, 1, and 2. The GLM grid search evaluated 16 models, and the best combination was alpha = 0.0 and lambda = 0.001, with minimal variations in performance across other similar values. The deep learning grid search evaluated multiple configurations, and the best model was achieved with hidden layers of (100, 100), 10 epochs, and a learning rate of 0.01. Finally, the Decision Tree grid search identified the optimal model with a max depth of 10, 5 minimum rows, and a sample rate of 0.8 (Table 4).

Table 4

Table 4. Hyperparameter tuning results: grid search HPO parameter ranges and optimal values for Penta-3 vaccination dropout prediction.

In this final experiment using LASSO feature selection with Grid Search HPO, the XGBoost-based ensemble stands out once again with the highest AUC of 0.994 and a strong Accuracy of 0.939, demonstrating its consistent performance across experiments. The GLM-based ensemble matches its AUC of 0.990, along with a comparable Accuracy 0.936, proving its reliability and competitiveness. Meanwhile, the Random Forest-based ensemble follows closely with an AUC of 0.983 and an Accuracy of 0.935 (Figures 5, 6).

Figure 5

Figure 5. Performance metrics of LASSO FS with and grid search HPO: Penta 3 dropout prediction in East-Africa.

Figure 6

Figure 6. ROC-AUCs of LASSO FS with and grid search HPO: Penta 3 dropout prediction in East-Africa.

When compared to the previous experiments, both feature selection methods, Boruta and LASSO, yielded strong results, particularly for the stacked ensembles, which consistently outperformed the individual base learners. The Grid Search HPO method, especially in this LASSO feature selection experiment, delivered slightly better overall performance than Random Search, with noticeably higher AUC scores across all SEMs. Notably, the XGBoost and GLM-based ensembles achieved perfect alignment in key metrics here, while in previous experiments, XGBoost ensembles generally held a slight edge (Figures 5, 6).

These final results confirm that stacked ensemble models, especially those using Random Forest and XGBoost as meta-learners, deliver robust and accurate predictions, with LASSO feature selection combined with Grid Search yielding the strongest overall performance across all experiments.

Due to its overall superior performance, the XGBoost Meta-Learner Stacked Ensemble, built with LASSO-selected features and optimized through Grid Search, was selected for further analysis using advanced interpretability tools like SHAP (SHapley Additive exPlanations) bar and beeswarm plots, providing deeper insights into feature contributions and model behavior. Overall, these results demonstrate that the stacked ensembles effectively leveraged the strengths of the base learners, leading to a significant improvement in predictive performance.

Model interpretability

The output of the SHAP (SHapley Additive exPlanations) values reveals intricate insights into the attributes influencing predictions made by the best-performing Stacked Ensemble XGB-Meta learner Model. This SHAP matrix highlights the contributions of each feature to the model's output.

In analyzing the SHAP BeeSwarm plot for model interpretability, several nuances must be considered to avoid misinterpretation. Features with a mix of colors where both high (golden) and low (purple) feature values have SHAP values that vary between positive and negative impacts, suggest the presence of non-linear relationships or interactions between features. In such cases, high values of the feature can contribute positively to the model's predictions for some instances, while contributing negatively for others, indicating that the feature's influence is context-dependent. From a practical decision-making perspective, this means that interventions or policies relying on these features must be carefully tailored to specific subgroups or conditions. For example, a feature might be highly beneficial for one group but could have adverse effects or little relevance in another, suggesting the need for personalized or context-aware strategies.

Additionally, features for which data points (instances) predominantly cluster around the SHAP value of 0, with minimal horizontal spread, indicate that these features have little to no impact on most predictions. These features may be irrelevant or redundant for the model's decision-making process, though they might occasionally have a significant effect in a small subset of instances. Careful attention to such patterns ensures that only meaningful features are prioritized for interpretation or model refinement, avoiding overemphasis on features with low or highly conditional importance (Mane et al., 2024; Lundberg et al., 2020).

In terms of research direction, these findings point to areas where further investigation is needed to understand the contexts or subpopulations where these features hold more weight. This could guide future studies to explore the non-linear relationships or interactions identified through the SHAP analysis, ultimately informing both model refinement and the development of more targeted, effective interventions or policies.

Considering these facts, seven features (place of delivery, health insurance, healthcare access, type of health professional who performed the post-natal check-up, wealth index, and who decides on the healthcare need of the family) visualized in the SHAP BeeSwarm and bar plots were identified as having significant and interpretable contributions to the model's predictions. These features exhibited considerable SHAP value variability and relatively clear patterns of influence, highlighting their importance in driving the model's decisions. In contrast, the remaining features, characterized by either tight clustering around the zero SHAP value or a lack of clear directional impact, were deemed less influential. Their mixed and limited spread suggests that these features contribute minimally to the model's predictive power or have contextually dependent effects. This understanding directs the focus toward the most relevant features, thereby enhancing the interpretability and reliability of the model.

Accordingly, the best-performing stacked ensemble model predicted that children born at home have a substantial positive impact, with a SHAP value of 0.0348, indicating that home births are associated with a higher probability of Penta 3 vaccination dropout. Similarly, children of women who earns more money than the husband/father and who earns about the same amount of money exhibited notable negative impacts (SHAP value: −0.0222 and −0.0218, respectively) on Penta 3 vaccination dropout. In contrast, children of mothers with health insurance showed a negative impact, with a SHAP value of −0.0223, suggesting a lower likelihood of dropout. The SHAP values for decision on the healthcare need of the family indicate that when the child's mother decides, the likelihood of Penta 3 vaccination dropout slightly decreases (SHAP value: 0.0321). Post-natal check-ups conducted by midwifery professionals had a negative impact on dropout, suggesting that mothers who received post-natal care from these professionals have a reduced likelihood of dropout (SHAP value: −0.0327). Middle wealth index families also contributed negatively to the prediction, indicating a lower probability of Penta 3 vaccination dropout (SHAP value: −0.0223). Overall, this detailed analysis of SHAP values not only clarifies which features are driving the model's predictions but also underscores the complex interplay between socioeconomic factors, healthcare access, and demographic characteristics in determining health-related outcomes (Figures 7, 8).

Figure 7

Figure 7. SHAP Beeswarm plot of the best XGB-meta stacked ensemble model for Penta 3 vaccination dropout prediction.

Figure 8

Figure 8. SHAP bar plot of the best XGB-meta stacked ensemble model for Penta 3 vaccination dropout prediction.

Discussion

This study aimed to develop and evaluate a stacked ensemble machine learning models that can accurately predict pentavalent 3 vaccination dropout in East Africa using the Demographic and Health Surveys (DHS) dataset. The DHS provides rich, nationally representative data on health, population, and nutrition, making it an ideal source for analyzing health outcomes across diverse populations.

We evaluated various machine learning models and stacked ensemble methods using Boruta and LASSO feature selection techniques combined with Random Search and Grid Search hyperparameter optimization. The results demonstrated that Stacked Ensemble Models (SEMs) consistently outperformed individual base learners across key metrics. In Boruta feature selection experiments, the Random Forest-based SEM achieved strong performance, showcasing the significant improvement that stacked ensembles bring by combining the strengths of multiple base models.

In the LASSO feature selection experiments, the XGBoost Meta-Learner Stacked Ensemble optimized with Grid Search emerged as the top performer, consistently demonstrating high AUC, Accuracy, and F1-score. This model outperformed other base learners and SEMs, underscoring the effectiveness of XGBoost as a meta-learner in stacking. The Random Forest-based SEM and GLM-based SEM also delivered strong results, further confirming the power of stacking diverse models for improved predictive performance.

The final XGBoost Meta-Learner Stacked Ensemble, built with LASSO-selected features and optimized through Grid Search, was selected for further analysis using advanced interpretability tools like SHAP bar and beeswarm plots. These tools provided deeper insights into feature contributions and model behavior. Overall, the experiments highlight the superiority of stacked ensembles, particularly when using Grid Search for optimization and LASSO for feature selection, making the XGBoost-based SEM the optimal choice for complex machine learning tasks.

By interpreting the SHAP values of the best stacked ensemble model, we were able to assess the contribution of individual features to the model's predictions, offering transparent and interpretable insights into the drivers of vaccination dropout. Seven key features were identified as critical in explaining dropout rates: place of delivery, health insurance, healthcare access, type of health professional who performed the post-natal check-up, wealth index, and who decides on the healthcare need of the family. These findings provide valuable insights into the complex interplay of socioeconomic and healthcare-related factors influencing vaccination outcomes in East Africa, where access to healthcare remains a significant challenge.

The place of delivery emerged as a key determinant, with the model revealing a substantial positive SHAP value of 0.0348 for home births, indicating a strong association with higher Penta 3 vaccination dropout in East Africa. This finding reflects the significant challenges that arise when children are born at home, particularly in rural or underserved areas where access to healthcare services is limited. In many East African countries, home births are common due to cultural practices, geographic isolation, and lack of healthcare facilities, leading to missed opportunities for postnatal care and immunization follow-up. The absence of skilled birth attendants in home settings likely contributes to the higher dropout rates observed, as mothers may not receive timely reminders or guidance on vaccination schedules (Ntenda et al., 2022; Odiit and Amuge, 2003).

Alternative explanations for this finding could include the broader socioeconomic conditions of households opting for home births. These families may face other barriers, such as limited transportation, financial constraints, or inadequate health literacy, which further exacerbate the risk of vaccination dropout. While the SHAP value indicates a strong association between home births and dropout, it is essential to recognize that causality cannot be directly inferred. Furthermore, cultural preferences for home births, especially in rural and remote communities, may limit the effectiveness of interventions focused solely on healthcare access. Future policies should consider integrating traditional birth attendants with formal healthcare systems to ensure that newborns receive timely vaccinations, even in home birth scenarios (Mmanga et al., 2022).

The study found that the presence of health insurance was associated with a negative SHAP value of −0.0223, suggesting that children of mothers with health insurance were less likely to experience Penta 3 vaccination dropout. In East Africa, health insurance coverage remains low, but where available, it often facilitates better access to healthcare services, reducing the financial burden of medical visits and vaccinations. Mothers with insurance are more likely to visit healthcare facilities regularly, leading to better adherence to vaccination schedules. This protective effect of health insurance aligns with findings from other low- and middle-income countries, where health insurance has been shown to improve healthcare utilization and child health outcomes (Fenta et al., 2023; Smith et al., 2006).

However, the effect of health insurance might differ across countries in the region, given the varying structures and coverage of health insurance schemes. In some cases, public health insurance programs may be limited in scope, covering only certain services or medications, which could reduce their overall impact on vaccination adherence. Additionally, the availability of health insurance may be correlated with other favorable factors, such as higher income, urban residence, or education, which may also contribute to lower dropout rates. While the SHAP value underscores the importance of insurance, future studies should explore the specific mechanisms through which insurance coverage enhances vaccination rates in East Africa and identify strategies to expand coverage to underserved populations (Escobar et al., 2011; Kalies et al., 2008).

Mother's/woman's earning was another important features, indicating that mothers who earn more money or the same amount of money as their husband were less likely to have children who dropped out of the Penta 3 vaccination schedule. This finding is consistent with extensive research showing that maternal earning is a critical determinant of child health outcomes in sub-Saharan Africa. Financially secured mothers are generally more empowered to make healthcare decisions, have greater access to healthcare services, and are more likely to prioritize preventive measures like vaccinations for their children. Hence, their financial autonomy not only enhances their ability to cover direct and indirect healthcare costs but also strengthens their role in advocating for their children's wellbeing, reducing the likelihood of missed or delayed vaccinations (Yeboah et al., 2025; Bain et al., 2022).

Maternal financial independence often shifts household dynamics, allowing women greater influence over resource allocation, including healthcare spending. This leverage improves household nutrition, access to health information, and consistent healthcare-seeking behaviors. Women with independent earnings are also more knowledgeable about healthcare options and confident in engaging with providers, ensuring vaccination adherence. Beyond financial power, maternal earning reflects broader societal shifts in gender roles and equity. Policies supporting maternal employment and income generation are essential for enhancing empowerment and improving child health outcomes, particularly in vaccination programs (Zahidi et al., 2024; Fawole and Adeoye, 2015).

The SHAP value for decision-making on the healthcare needs of the family was 0.0321, indicating that when the mother is the primary decision-maker, the likelihood of Penta 3 vaccination dropout decreases. In many sub-Saharan African contexts, maternal involvement in healthcare decisions is a crucial factor influencing child health outcomes. Mothers who have the autonomy to make decisions regarding their children's health are more likely to prioritize preventive measures such as vaccinations. This aligns with existing research suggesting that maternal empowerment is key to improving child health indicators (Ozawa et al., 2016; Danchin et al., 2018).

However, decision-making power within the household is often complex. While maternal autonomy is important, other factors such as family dynamics, cultural norms, and paternal involvement can also influence vaccination adherence. In some cases, mothers may still face constraints despite being the primary decision-makers, including limited access to financial resources or healthcare services. Therefore, efforts to reduce vaccination dropout should not only empower mothers but also address broader socioeconomic and cultural barriers that may impact their ability to act on healthcare decisions. Policies that promote gender equity in healthcare decision-making, coupled with initiatives to increase maternal financial independence, can contribute to improving vaccination coverage in low-resource settings (Damnjanović et al., 2018).

Post-natal care provided by midwifery professionals had a SHAP value of −0.008, indicating a protective effect against Penta 3 vaccination dropout. In many East African countries, midwives play a critical role in maternal and child health, particularly in rural areas where doctors may be scarce. Midwives often provide essential care during and after childbirth, including counseling mothers on the importance of vaccinations. This finding aligns with evidence from other low-income regions, where midwifery-led care has been associated with improved maternal and child health outcomes (Charbit and Omrane, 2023; Schoeps et al., 2013).

However, the relatively modest SHAP value suggests that while midwifery care contributes to reducing dropout rates, it may need to be complemented by broader health system interventions. In some areas, midwives may lack sufficient training, resources, or support to deliver comprehensive postnatal care, which could limit their effectiveness in promoting vaccination adherence. Additionally, the integration of midwifery services with formal healthcare systems may vary across countries, affecting the consistency of care provided. Expanding midwifery training programs, enhancing community health outreach, and integrating vaccination services into postnatal care visits could further strengthen the role of midwives in reducing vaccination dropout (Frawley et al., 2020; Kaufman et al., 2019).

The wealth index, particularly for middle-income families, was associated with a negative SHAP value of −0.006, indicating that children from middle-income households had a lower likelihood of vaccination dropout. Wealthier families typically have better access to healthcare services, transportation, and information, all of which contribute to higher vaccination adherence. In East Africa, where wealth disparities are significant, middle- and upper-income families may have more consistent access to healthcare, enabling them to adhere to vaccination schedules more effectively (Shiferie et al., 2023; Khan and Saggurti, 2020).

However, it is important to note that wealth alone may not fully explain vaccination behavior. Cultural practices, health literacy, and social norms may also influence whether families follow through with vaccination appointments. In addition, wealthier families in rural areas may still face access challenges despite their financial resources, particularly if healthcare facilities are far away or poorly staffed. While the SHAP value highlights the protective role of wealth, it also suggests the need for targeted interventions aimed at lower-income families who are most at risk of vaccination dropout. Strengthening healthcare systems to provide equitable access regardless of socioeconomic status is essential to achieving higher vaccination coverage across all income groups (Adebowale et al., 2019; Hajizadeh, 2018).

In conclusion, the SHAP-based analysis of the stacked ensemble model revealed critical insights into the factors driving Penta 3 vaccination dropout in East Africa. These findings highlight the importance of addressing both socioeconomic and healthcare access barriers in improving vaccination adherence in the region. While the machine learning model provided valuable predictive insights, further research and targeted interventions are necessary to address the multifaceted challenges faced by vulnerable populations in East Africa.

Limitations and strengths

Despite the valuable insights gained from this study, a few limitations must be acknowledged. First, the reliance on secondary data from the Demographic and Health Surveys (DHS) may restrict the depth of analysis regarding the causal relationships between the identified features and Penta 3 vaccination dropout. While DHS datasets are comprehensive and nationally representative, they are inherently cross-sectional, capturing a snapshot of health behaviors at a single point in time. This limitation constrains the ability to assess temporal dynamics or changes in vaccination behavior, which could vary significantly due to interventions, policy changes, or evolving socioeconomic conditions. Additionally, self-reported measures within the dataset may introduce biases related to recall or social desirability, particularly in sensitive areas such as healthcare access and maternal education. Such biases could obscure the true extent of the relationships between variables, potentially impacting the reliability of the findings.

On the other hand, this study benefits from several notable strengths that enhance its contribution to understanding vaccination dropout in East Africa. The use of a stacked ensemble machine learning model, particularly with the XGBoost meta-learner, allowed for improved predictive performance and interpretability compared to traditional modeling approaches. By integrating multiple base learners, the study effectively leveraged the strengths of different algorithms, mitigating the limitations inherent to individual models. Moreover, the application of SHAP values provided a robust framework for understanding feature importance, offering nuanced insights into how various socioeconomic and healthcare factors influence vaccination outcomes. The focus on a relevant and rich dataset, coupled with the application of advanced analytical techniques and comparisons, make this study as a significant addition to the literature on child health and immunization, informing policymakers and practitioners about critical intervention points to reduce vaccination dropout rates in the region.

Conclusion and recommendation

The stacked ensemble model has elucidated key attributes contributing to Penta 3 vaccination dropout among children in East Africa. Notably, features such as place of delivery, health insurance, maternal earning, healthcare need decision maker, post-natal care by midwifery professionals, and wealth index emerged as significant predictors of vaccination adherence. These findings underscore the complex interplay between socioeconomic determinants and healthcare accessibility, revealing that children born at home and those from families with limited education and media exposure are at a higher risk of vaccination dropout. While the results provide valuable insights into the challenges of maintaining vaccination coverage, they also emphasize the necessity for targeted strategies that address these multifaceted issues to improve health outcomes in vulnerable populations.

Based on the findings of this study, it is essential to develop and implement comprehensive public health strategies aimed at increasing Penta 3 vaccination coverage in East Africa. Interventions should focus on enhancing healthcare access, particularly for families residing in rural areas where home births are prevalent. Additionally, public health campaigns should prioritize improving maternal education and media exposure, which have been shown to significantly influence vaccination rates. Strengthening the role of midwifery professionals in post-natal care could further mitigate dropout risks by providing mothers with essential information and support regarding vaccination. Finally, policymakers should consider integrating socioeconomic factors into health planning, ensuring that programs are tailored to address the specific needs of lower-income families to promote equity in healthcare access and utilization.

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found here: https://www.dhsprogram.com/data.

Ethics statement

Ethical approval was not required for the studies involving humans because this study was done based on secondary data analysis, and permission was obtained from the MEASURE DHS program to download and use the data for our study purposes. Hence, ethical approval and participants' consent do not apply to this particular study. The dataset is publicly available in the official database of the MEASURE DHS program with no information for the identification of households. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

MA: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. SK: Data curation, Validation, Visualization, Writing – original draft, Writing – review & editing. AW: Data curation, Validation, Visualization, Writing – original draft, Writing – review & editing. DM: Data curation, Validation, Visualization, Writing – original draft, Writing – review & editing. EE: Data curation, Validation, Visualization, Writing – original draft, Writing – review & editing. JA: Data curation, Validation, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Acknowledgments

We would like to thank the MEASURE DHS program for providing us with the data for further analysis.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Gen AI was used in the creation of this manuscript.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fdata.2025.1522578/full#supplementary-material

Abbreviations

AUC, Area Under the Curve; AUC-ROC, Area Under the Receiver Operating Characteristic; Boruta, Boruta Algorithm; DHS, Demographic and Health Survey; DPT-HepB-Hib, Diphtheria-Hepatitis B and Haemophilus Influenza; EDHS, Ethiopian Demographic and Health Survey; EPI, Expanded Program of Immunization; F1 Score, F1 Score (harmonic mean of precision and recall); GLM, Generalized Linear Model; ML, Machine Learning; PCA, Principal Component Analysis; Penta 3, Pentavalent 3 Vaccination; RF, Random Forest; ROC, Receiver Operating Characteristic; SEL, Stacked Ensemble Learner; SHAP, Shapley Additive Explanations; SMOTE, Synthetic Minority Over-sampling Technique; WHO, World Health Organization; XGBoost, eXtreme Gradient Boosting.

References

Adebowale, A., Obembe, T., and Bamgboye, E. (2019). Relationship between household wealth and childhood immunization in core-North Nigeria. Afr. Health Sci. 19, 1582–1593. doi: 10.4314/ahs.v19i1.33

PubMed Abstract | Crossref Full Text | Google Scholar

Bain, L. E., Aboagye, R. G., Dowou, R. K., Kongnyuy, E. J., Memiah, P., Amu, H., et al. (2022). Prevalence and determinants of maternal healthcare utilisation among young women in sub-Saharan Africa: cross-sectional analyses of demographic and health survey data. BMC Public Health 22:647. doi: 10.1186/s12889-022-13037-8

PubMed Abstract | Crossref Full Text | Google Scholar

Baptista, M. L., Goebel, K., and Henriques, E. M. (2022). Relation between prognostics predictor evaluation metrics and local interpretability SHAP values. Artif. Intell. 306:103667. doi: 10.1016/j.artint.2022.103667

Crossref Full Text | Google Scholar

Chandir, S., Siddiqi, D., Hussain, O., Niazi, T., Shah, M., Dharma, V., et al. (2018). Using predictive analytics to identify children at high risk of defaulting from a routine immunization program: feasibility study. JMIR Public Health Surveill. 4:e63. doi: 10.2196/publichealth.9681

PubMed Abstract | Crossref Full Text | Google Scholar

Charbit, Y., and Omrane, M. (2023). “Postnatal care and immunization of children,” in Gender Inequalities and Vulnerability of Sub-Saharan Adolescents: The Case of Benin, eds. Y. Charbit and M. Omrane (Cham: Springer Nature Switzerland), 171–179. doi: 10.1007/978-3-031-38096-9_13

Crossref Full Text | Google Scholar

Damnjanović, K., Graeber, J., Ilić, S., Lam, W. Y., Lep, Ž., Morales, S., et al. (2018). Parental decision-making on childhood vaccination. Front. Psychol. 9:735. doi: 10.3389/fpsyg.2018.00735

PubMed Abstract | Crossref Full Text | Google Scholar

Danchin, M. H., Costa-Pinto, J., Attwell, K., Willaby, H., Wiley, K., Hoq, M., et al. (2018). Vaccine decision-making begins in pregnancy: correlation between vaccine concerns, intentions and maternal vaccination with subsequent childhood vaccine uptake. Vaccine 36, 6473–6479. doi: 10.1016/j.vaccine.2017.08.003

PubMed Abstract | Crossref Full Text | Google Scholar

Demsash, A. W., Chereka, A. A., Walle, A. D., Kassie, S. Y., Bekele, F., Bekana, T., et al. (2023). Machine learning algorithms' application to predict childhood vaccination among children aged 12–23 months in Ethiopia: evidence 2016 Ethiopian Demographic and Health Survey dataset. PLoS ONE 18:e0288867. doi: 10.1371/journal.pone.0288867

PubMed Abstract | Crossref Full Text | Google Scholar

Dey, R., and Mathur, R. (2023). “Ensemble learning method using stacking with base learner, a comparison,” in Proceedings of International Conference on Data Analytics and Insights, ICDAI 2023. Lecture Notes in Networks and Systems, eds. N. Chaki, N. D. Roy, P. Debnath, and K. Saeed (Singapore: Springer), 159–169.

Google Scholar

Escobar, M. L., Griffin, C. C., and Shaw, R. P., (eds.). (2011). The Impact of Health Insurance in Low-and Middle-Income Countries. Lanham, MD: Rowman and Littlefield.

Google Scholar

Fawole, O. I., and Adeoye, I. A. (2015). Women's status within the household as a determinant of maternal health care use in Nigeria. Afr. Health Sci. 15, 217–225. doi: 10.4314/ahs.v15i1.28

PubMed Abstract | Crossref Full Text | Google Scholar

Fenta, S. M., Biresaw, H. B., Fekadu, G. A., Teshome, D. F., and Tadesse, A. W. (2023). Factors associated with pentavalent vaccine coverage among 12-23 months old children in Ethiopia: a multilevel analysis. PLoS ONE 18:e0289744. doi: 10.1371/journal.pone.0289744

PubMed Abstract | Crossref Full Text | Google Scholar

Frawley, J. E., McKenzie, K., Cummins, A., Sinclair, L., Wardle, J., Hall, H., et al. (2020). Midwives' role in the provision of maternal and childhood immunisation information. Women Birth 33, 145–152. doi: 10.1016/j.wombi.2019.02.006

PubMed Abstract | Crossref Full Text | Google Scholar

Hajizadeh, M. (2018). Socioeconomic inequalities in child vaccination in low/middle-income countries: what accounts for the differences?. J. Epidemiol. Commun. Health 72, 719–725. doi: 10.1136/jech-2017-210296

PubMed Abstract | Crossref Full Text | Google Scholar

Hu, Z., Qiu, H., Su, Z., Shen, M., and Chen, Z. (2020). A stacking ensemble model to predict daily number of hospital admissions for cardiovascular diseases. IEEE Access. 8, 138719–138729. doi: 10.1109/ACCESS.2020.3012143

Crossref Full Text | Google Scholar

Kalies, H., Redel, R., Varga, R., Tauscher, M., and von Kries, R. (2008). Vaccination coverage in children can be estimated from health insurance data. BMC Public Health 8, 1–6. doi: 10.1186/1471-2458-8-82

PubMed Abstract | Crossref Full Text | Google Scholar

Kaufman, J., Attwell, K., Hauck, Y., Omer, S. B., and Danchin, M. (2019). Vaccine discussions in pregnancy: interviews with midwives to inform design of an intervention to promote uptake of maternal and childhood vaccines. Hum. Vaccines Immunother. 15, 2534–2543. doi: 10.1080/21645515.2019.1607131

PubMed Abstract | Crossref Full Text | Google Scholar

Kayembe-Ntumba, H. C., Vangola, F., Ansobi, P., Kapour, G., Bokabo, E., Mandja, B. A., et al. (2022). Vaccination dropout rates among children aged 12-23 months in Democratic Republic of the Congo: a cross-sectional study. Arch. Public Health 80:18. doi: 10.1186/s13690-021-00782-2

PubMed Abstract | Crossref Full Text | Google Scholar

Khan, N., and Saggurti, N. (2020). Socioeconomic inequality trends in childhood vaccination coverage in India: findings from multiple rounds of National Family Health Survey. Vaccine 38, 4088–4103. doi: 10.1016/j.vaccine.2020.04.023

PubMed Abstract | Crossref Full Text | Google Scholar

Luengo, J., García-Gil, D., Ramírez-Gallego, S., García, S., and Herrera, F. (2020). Big Data Preprocessing. Cham: Springer. doi: 10.1007/978-3-030-39105-8

Crossref Full Text | Google Scholar

Lundberg, S. M., Erion, G., Chen, H., DeGrave, A., Prutkin, J. M., Nair, B., et al. (2020). From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2, 56–67. doi: 10.1038/s42256-019-0138-9

PubMed Abstract | Crossref Full Text | Google Scholar

Mane, D., Magar, A., Khode, O., Koli, S., Bhat, K., Korade, P., et al. (2024). Unlocking machine learning model decisions: a comparative analysis of LIME and SHAP for enhanced interpretability. J. Electr. Syst. 20, 1252–1267. doi: 10.52783/jes.1768

Crossref Full Text | Google Scholar

Mmanga, K., Mwenyenkulu, T. E., Nkoka, O., and Ntenda, P. A. M. (2022). Tracking immunization coverage, dropout and equity gaps among children ages 12–23 months in Malawi – bottleneck analysis of the Malawi Demographic and Health Survey. Int. Health 14, 250–259. doi: 10.1093/inthealth/ihab038

PubMed Abstract | Crossref Full Text | Google Scholar

Ntenda, P. A. M., Sixpence, A., Mwenyenkulu, T. E., Mmanga, K., Chirambo, A. C., Bauleni, A., et al. (2022). Determinants of pentavalent and measles vaccination dropouts among children aged 12–23 months in the Gambia. BMC Public Health 22:520. doi: 10.1186/s12889-022-12914-6

PubMed Abstract | Crossref Full Text | Google Scholar

Nwachukwu, R. C. (2024). Machine learning applications in rural healthcare: predictive modeling for immunization completion rates in Ogun State, Nigeria. Open J. Phys. Sci. 5, 32–44. doi: 10.52417/ojps.v5i2.722

Crossref Full Text | Google Scholar

Odiit, A., and Amuge, B. (2003). Comparison of vaccination status of children born in health units and those born at home. East Afr. Med. J. 80, 3–6. doi: 10.4314/eamj.v80i1.8658

PubMed Abstract | Crossref Full Text | Google Scholar

Ozawa, S., Clark, S., Portnoy, A., Grewal, S., Brenzel, L., Walker, D. G., et al. (2016). Return on investment from childhood immunization in low- and middle-income countries, 2011–20. Health Aff. 35, 199–207. doi: 10.1377/hlthaff.2015.1086

PubMed Abstract | Crossref Full Text | Google Scholar

Rainio, O., Teuho, J., and Klén, R. (2024). Evaluation metrics and statistical tests for machine learning. Sci. Rep. 14:6086. doi: 10.1038/s41598-024-56706-x

PubMed Abstract | Crossref Full Text | Google Scholar

Schoeps, A., Ouédraogo, N., Kagone, M., Sie, A., Müller, O., Becher, H., et al. (2013). Socio-demographic determinants of timely adherence to BCG, Penta3, measles, and complete vaccination schedule in Burkina Faso. Vaccine 32, 96–102. doi: 10.1016/j.vaccine.2013.10.063

PubMed Abstract | Crossref Full Text | Google Scholar

Shetty, B., and Whitfield, B. (2023). 5 Classification algorithms for machine learning. Built In. Available online at: https://builtin.com/data-science/supervised-machine-learning-classification (accessed November 07, 2024).

Google Scholar

Shiferie, F., Gebremedhin, S., Andargie, G., Tsegaye, D. A., Alemayehu, W. A., Mekuria, L. A., et al. (2023). Vaccination dropout and wealth related inequality among children aged 12–35 months in remote and underserved settings of Ethiopia: a cross-sectional evaluation survey. Front. Pediatr. 11:1280746. doi: 10.3389/fped.2023.1280746

PubMed Abstract | Crossref Full Text | Google Scholar

Smith, P. J., Stevenson, J., and Chu, S. Y. (2006). Associations between childhood vaccination coverage, insurance type, and breaks in health insurance coverage. Pediatrics 117, 1972–1978. doi: 10.1542/peds.2005-2414

PubMed Abstract | Crossref Full Text | Google Scholar

The DHS Program (2024). Demographic and Health Surveys (DHS). Available online at: https://dhsprogram.com/ (accessed November 13, 2024).

Google Scholar

Tsegaw, T. K., Alemaw, H. B., Wale, Y. B., Nigatu, S. G., Birhan, T. Y., and Taddese, A. A. (2024). Incomplete immunization uptake and associated factors among children aged 12–23 months in Sub-Saharan African countries; multilevel analysis evidenced from latest demography and health survey data, 2023. Ital. J. Pediatr. 50:96. doi: 10.1186/s13052-024-01642-9

PubMed Abstract | Crossref Full Text | Google Scholar

Verma, S., Dhir, R., Kumar, M., and Gupta, M. (2024). “Digital healthcare system using stacked ensemble machine learning model,” in Data Science and Artificial Intelligence for Digital Healthcare (Cham: Springer International Publishing), 109. doi: 10.1007/978-3-031-56818-3_7

Crossref Full Text | Google Scholar

Werner de Vargas, V., Schneider Aranda, J. A., dos Santos Costa, R., da Silva Pereira, P. R., and Victória Barbosa, J. L. (2023). Imbalanced data preprocessing techniques for machine learning: a systematic mapping study. Knowl. Inf. Syst. 65, 31–57. doi: 10.1007/s10115-022-01772-8

PubMed Abstract | Crossref Full Text | Google Scholar

World Health Organization (2020). Immunization Agenda 2030: A Global Strategy to Leave No One Behind. Available online at: https://www.who.int/teams/immunization-vaccines-and-biologicals/strategies/ia2030 (accessed November 27, 2024).

Google Scholar

World Health Organization (2023). Status of Immunization Coverage in Africa as of the End of 2022. WHO African Region. Available online at: https://www.afro.who.int/sites/default/files/2023-10/Status%20of%20immunization%20coverage_final-compressed_compressed.pdf (accessed November 27, 2024).

Google Scholar

World Health Organization (2024). Immunization Coverage. Available online at: https://www.who.int/news-room/fact-sheets/detail/immunization-coverage (accessed November 27, 2024).

Google Scholar

Yeboah, H., Omonaiye, O., and Yaya, S. (2025). The impact of economic growth and recessions on maternal and child health outcomes in Sub-Saharan African countries: a systematic literature review. Reprod. Health 22:35. doi: 10.1186/s12978-025-01973-8

PubMed Abstract | Crossref Full Text | Google Scholar

Zahidi, K., Obtel, M., and Naji, S. (2024). “Healthcare accessibility: metrics, assessment, policies, and barriers,” in Economics of Healthcare, Studies and Cases, ed. A. I. Tavares (IntechOpen). doi: 10.5772/intechopen.1006821

Crossref Full Text | Google Scholar

Zhang, Y., Zhang, Z., and Wang, S. (2023). Hyperparameter optimization in stacking ensemble models for predicting financial distress. Expert Syst. Appl. 206:117857.

Google Scholar

Keywords: Penta 3 vaccination, stacked ensemble model, SHAP values, East Africa, machine learning

Citation: Alemayehu MA, Kebede SD, Walle AD, Mamo DN, Enyew EB and Adem JB (2025) A stacked ensemble machine learning model for the prediction of pentavalent 3 vaccination dropout in East Africa. Front. Big Data 8:1522578. doi: 10.3389/fdata.2025.1522578

Received: 11 December 2024; Accepted: 24 March 2025;
Published: 07 April 2025.

Edited by:

Alessandro Bria, University of Cassino, Italy

Reviewed by:

Vibhuti Gupta, Meharry Medical College, United States
Jayakumar Kaliappan, Vellore Institute of Technology (VIT), India

Copyright © 2025 Alemayehu, Kebede, Walle, Mamo, Enyew and Adem. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Meron Asmamaw Alemayehu, bWVycnlhbGVtMTAxQGdtYWlsLmNvbQ==; TWVyb24uQXNtYW1hd0B1b2cuZWR1LmV0

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

A stacked ensemble machine learning model for the prediction of pentavalent 3 vaccination dropout in East Africa

Introduction

Related works

Methods

Data source

Modeling software and packages

Study variables

Target

Features

Data preprocessing

Base learners and stacked ensemble model

Stacking process

Hyperparameter optimization (HPO) and performance metrics

Model interpretability

Ethical approval

Results

Baseline characteristic

Data preprocessing results

Experimental results and comparisons

Experiment I: feature selection

Experiment II: model building and optimal hyperparameters

Model building

Random search hyperparameters

Experiment III: performance metrics with Boruta FS for both HPO methods

Experiment IV: performance metrics with LASSO FS for both HPO methods

Performance metrics of LASSO FS with random search HPO

Performance metrics of LASSO FS with grid search HPO

Model interpretability

Discussion

Limitations and strengths

Conclusion and recommendation

Data availability statement

Ethics statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Generative AI statement

Publisher's note

Supplementary material

Abbreviations

References

95% of researchers rate our articles as excellent or good