Skip to main content

ORIGINAL RESEARCH article

Front. Nutr., 10 May 2022
Sec. Nutrition and Sustainable Diets
This article is part of the Research Topic Innovation and Trends in the Global Food Systems, Dietary Patterns and Healthy Sustainable Lifestyle in the Digital Age View all 16 articles

Cluster Analysis and Classification Model of Nutritional Anemia Associated Risk Factors Among Palestinian Schoolchildren, 2014

\nRadwan Qasrawi,
Radwan Qasrawi1,2*Diala Abu Al-Halawa
Diala Abu Al-Halawa3*
  • 1Department of Computer Science, Al-Quds University, Jerusalem, Palestine
  • 2Department of Computer Engineering, Istinye University, Istanbul, Turkey
  • 3Medical Faculty, Al-Quds University, Jerusalem, Palestine

Nutritional inadequacy has been a major health problem worldwide. One of the many health problems that result from it is anemia. Anemia is considered a health concern among all ages, particularly children, as it has been associated with cognitive and developmental delays. Researchers have investigated the association between nutritional deficiencies and anemia through various methods. As novel analytical methods are needed to ascertain the association and reveal indirect ones, we aimed to classify nutritional anemia using the cluster analysis approach. In this study, we included 4,762 students aged between 10 and 17 years attending public and UNRWA schools in the West Bank. Students' 24-h food recall and blood sample data were collected for nutrient intake and hemoglobin analysis. The K-means cluster analysis was used to cluster the hemoglobin levels into two groups. Vitamin B12, folate, and iron intakes were used as the indicators of nutrient intake associated with anemia and were classified as per the Recommended Dietary Allowance (RDA) values. We applied the Classification and Regression Tree (CRT) model for studying the association between hemoglobin clusters and vitamin B12, folate, and iron intakes, sociodemographic variables, and health-related risk factors, accounting for grade and age. Results indicated that 46.4% of the students were classified into the low hemoglobin cluster, and 60.7, 72.5, and 30.3% of vitamin B12, folate, and iron intakes, respectively, were below RDA. The CRT analysis indicated that vitamin B12, iron, and folate intakes are important factors related to anemia in girls associated with age, locality, food consumption patterns, and physical activity levels, while iron and folate intakes were significant factors related to anemia in boys associated with the place of residence and the educational level of their mothers. The deployment of clustering and classification techniques for identifying the association between anemia and nutritional factors might facilitate the development of nutritional anemia prevention and intervention programs that will improve the health and wellbeing of schoolchildren.

Introduction

Anemia is a worldwide public health concern, with 24.8% of the global population suffering from anemia (1). It is associated with many diseases that affect all the age groups (13). The effects of the lack of oxygen are particularly troublesome in children, as that may affect their physical growth and cognitive development (48). There are many causes of this disease, with nutritional inadequacies being a primary factor contributing to anemia (911). Various studies have associated certain nutritional insufficiencies with this disease. Iron, vitamin B12, folic acid, vitamin A, and vitamin C are some nutrients that have been linked to its development (6, 1116). The association between nutrient intake deficiencies and other health and cognitive development problems among schoolchildren has been investigated by several studies, indicating a strong correlation between nutrition and the health of children. Researchers have investigated the prevalence of nutrient deficiencies among medically documented patients with anemia, and it has become well known that iron deficiency is the most common cause of anemia (13, 1719). Ensuring adequate nutrient intake requires dietary variety in food consumption, which could be lacking in children age group (2025).

Other risk factors linked to the disease include children living conditions. Goswami et al. is one of the many researchers who found that the place of residence is a major factor associated with anemia, where rural children were more anemic than urban children (26). Other researchers have also established the linkage of parental educational level with this entity (2628). It has become clear that, as many factors interact in the development of anemia, there is a need for an in-depth study of this complex interaction, one beyond prevalence and associations.

Investigators have used many statistical methods in discerning the interaction of sociodemographic and health-related risk factors resulting in anemia. Furthermore, data mining and machine learning techniques for clustering and classifications have been improved as effective tools in the clustering and classification of risk factors associated with the health of children (29, 30). The K-means, Decision Tree, Chi-square automatic interaction detection (CHAID), the K-Nearest Neighbors' algorithm (K-NN), and Classification and Regression Trees (CRT) clustering and classification models have been used by many research studies for predicting and identifying nutrition, lifestyle, and health diseases, such as obesity, diabetes, and anemia (3134). In medical research, several ML clustering and classification, such as support vector machine, artificial neural network, and random forest techniques, are used for improving early detection and diagnosis of diseases (29, 30, 3538). Particularly in anemia, M. Visser et al. have employed factor analysis for the determination of the nutrient pattern that is most associated with anemia, accounting for differences between provinces (14). On the other hand, Sow et al. employed machine learning algorithms in the identification of associated sociodemographic risk factors for anemia development (38). A study in China employed the CHAID decision tree analysis for the identification of infant anemia-related risk factors (31).

To the best of our knowledge, few studies have employed cluster analysis for the classification of hemoglobin levels among a certain population. The present study is a part of the national nutritional formative research among schoolchildren aimed at assessing children's health, nutrition, and mental health. The main aim of this study was to derive dietary patterns related to anemia in schoolchildren and to investigate the association between hemoglobin, vitamin B12, iron, and folate intakes with other sociodemographic variables, and health-related risk factors among Palestinian schoolchildren. We used the K-means and CRT machine learning techniques in identifying schoolchildren clusters based on hemoglobin data and finding the association patterns through the classification of the associated risk factors.

Materials and Methods

Participants and Data Collection

This study utilizes data from the Health Behaviour in School-aged Children (HBSC) survey conducted by Al-Quds University and the Ministry of Health in the year 2013–2014, which aims to assess the nutritional, physical, and psychological health of Palestinian schoolchildren (34, 39, 40). The study sample included children aged 10–17 years who were enrolled in public and the United Nations Relief and Works Agency (UNRWA) for Palestinian refugee schools. A representative and clustered random sample of 5,000 students was selected from 100 schools in the West Bank and weighted for grade and age. Only the relevant variables and available data were used for this study; therefore, a final sample of 4,762 students was used for data analysis. Data collection included blood sampling, sociodemographic data, health-related practices, and 24-h dietary recalls. For hemoglobin acquisition, data were collected using the Hemoglobin (Hb) test using the HemoCue Hb 201+ analyzer. On-site 50 μl of non-fasting venous whole blood was collected for all subjects for complete blood count (CBC) analysis via nurses and doctors registered in the Ministry of Health on schools' grounds. The total Hb concentration was calculated to the nearest 0.1 g/dl. The anthropometric data were collected by the digital floor scale for measuring students' weights in kg and the portable stadiometer for measuring the height in cm. Prior to data collection, all parents or caregivers signed an informed consent form. The study received ethical approval from the Ministry of Education and Al-Quds University Institutional Review Board (IRB approval number: 05-Aug-2013-12/10).

Hematological Parameter

In this study, we acquired hemoglobin levels as an indicator of anemia. The use of hemoglobin as a single anemia parameter has been applied in various research studies (10, 38, 41). In this study, the K-means clustering method was used for identifying the hemoglobin groups (42). Using the K-means clustering method, two different hemoglobin clusters were identified. The first cluster includes the participants with average hemoglobin of <12 g/dl, the second cluster includes those ≥12 g/dl.

Health-Related Risk Factors

To measure nutrient intake, we used the 24-h food recall method. The tool gives a detailed insight into food consumption over a 24-h period (4345). The food frequency scale was developed using the eight food items scale, which was categorized based on similarity in the nutrient profile (46). These categories were as follows: (1) vegetables; (2) fruits; (3) milk and other dairy products; (4) sweets and chocolate; (5) soft drinks; (7) beverages (juices and sugar); and (8) energy drinks. Response categories were (1) never, (2) 1–2 times a week, (3) 3–4 times a week, and (4) 5–7 times a week (almost daily). The food consumption quantities were identified using the recipes from the consumption weight book developed by ANAHRI (47). The food intake was entered and analyzed using Nutribase Professional V.9, the United States Department of Agriculture (USDA), and the Palestinian Food Recipes databases (47, 48). As studies have shown that there is an association between certain nutrients with anemia, we have chosen vitamin B12, iron, and folate as key nutrients for the analysis of and association with hemoglobin clusters. The nutrient values were analyzed and compared with the USDA Recommended Dietary Allowance (RDA) values according to the ages of participants (49).

As for health-related nutritional practices, we have used the food frequency questionnaire to classify the practices of students into (healthy and unhealthy) food consumption patterns. The indicated frequency of consumption was obtained over a week's duration (1) never, (2) 1–2 times a week, (3) 3–4 times a week, and (4) 5–7 times a week (almost daily).

The healthy group included participants who were in the top 2 quartiles (+50 percentile) of intake of fruits and vegetables (indicated as the indicators of a healthy diet by the WHO) (50) and milk and had indicated that they did not eat any unhealthy item. The unhealthy group included participants who had indicated that they did not eat any healthy nutritional items and were in the top 2 quartiles of the frequency of eating sweets, soft drinks, sugary juices, and energy drinks, respectively. The score for each variable was classified into yes and no values according to quartile classification.

Taking into consideration the numerous non-dietary risk factors that may be associated with anemia, we have included data regarding the general health status of the participants. Anthropometric measurements, such as height, weight, and neck circumference, were measured by school nurses, and the body mass index [BMI = weight/(height2)] of each participant was calculated. The BMI was classified into 4 categories based on the WHO cutoff values for interpretation of BMI in 5–19-year-old children (51). Furthermore, the students were asked about their physical activity during the school days and excluding weekends. The following questions comprised the frequency of physical activities over a week duration: (1) In the last week, how many days were you physically active for more than 60 min? (2) In the last week, how many hours were you playing sports outside school? (3) In the last week, how many hours do you exercise per week? These responses were summed, and the final score was classified according to its quartiles into low (1st and 2nd quartiles), moderate (3rd quartile), and high physical activity (4th quartile), respectively. In addition, as leisure time activities were accounted for in the questionnaire, the respondents were asked about their screen behavior over school days in the past week. The questions included: (1) how many hours do you spend watching TV? (2) how many hours do you play video games? (3) how many hours do you spend using the internet? Again, the sum of these questions was used to generate a leisure time activity scale, and the quartiles were used to classify the outcome into low (1st and 2nd quartiles), moderate (3rd quartile), and high (4th quartile) leisure time activities. Moreover, students were asked about their smoking practices, and the responses were yes or no to smoking cigarettes and/or nargileh, respectively. Furthermore, as anemia could result from a multitude of chronic disease processes, we asked the students and their parents about any known chronic illnesses, in addition to checking the available school records on the chronic illnesses of students, and we ultimately excluded those students from this study.

Statistical Analysis

We employed descriptive statistics, the K-means clustering method, and the CRT technique (52). The statistical analysis was conducted using IBM Statistical Package for Social Science V21. The K-means clustering method is a type of statistical analysis that classifies the sample population into homogenous groups with different characteristics using a specific variable as the comparison criterion (42). The K-means clustering method defines the segments of a dataset and assigns each observation into a specific cluster, the algorithm identifies the smallest variation within each cluster. A non-hierarchical K-means clustering method was used to produce two hemoglobin clusters, with the random seed and 10 iterations to refine and optimize the classifications. The final clusters were selected based on interpretability and the percentage of participants in each cluster. The sociodemographic and nutrition variables were analyzed for these clusters. Chi-squared and univariate analysis tests were used to assess the differences between categorical data. Continuous data were assessed for normality and, if required, normalized with natural log transformation. Of the available decision tree models, we chose the CRT model for the identification of potential risk factors for childhood anemia, relating to each relevant nutrient of choice (vitamin B12, iron, and folate). Cases in each subgroup are further classified by the second most significant predictor. The analysis continues until the last significant risk factor is identified.

Results

Demographics and Health-Related Risk Factors

The results of the demographic characteristics and health-related risk factors are depicted in Table 1. The study sample consists of 66.9% girls and 33.1% boys, 72.6% aged 10–13 years, and 27.4% aged 14–17 years. The students were mostly from urban habitats (42.8%), but 34.6% were from rural habitats and the rest were from refugee camps. When asked about their family income, almost 45% of the participants reported they had low family income compared with the minimum wages in the West Bank, and only 19.2% had high income, respectively. Additionally, students were asked about the level of their parents' education. Over two-thirds reported that their parents have an education level of secondary school and above, 60.4% for fathers and 61.4% for mothers, respectively. The results of nutritional status assessment indicated that almost 80% were found to have normal BMI levels, while 5.1% exhibited obesity as per the BMI scale. Of the study participants, 50% reported healthy food consumption and 68% reported unhealthy food consumption. Most students reported medium or high levels of physical activity (74.3%), as well as leisure time activities (73.9%), respectively. Furthermore, about 16.5% of students were smoking cigarettes or nargileh.

TABLE 1
www.frontiersin.org

Table 1. Demographic variables of the study sample (n = 4,762).

Table 1 depicts the mean and standard deviation (SD) of hemoglobin values for each demographic variable category. There were statistically significant differences in hemoglobin values per age, gender, and locality (p 0.000, 0.000, and 0.000, respectively). The mean hemoglobin values were higher in boys than girls (13.1 and 12.9, respectively), and in the older age group (13.07). Moreover, the mean hemoglobin was higher in urban than rural and camp residents (13.1, 12.8, and 12.9), respectively.

Overall Nutritional Profile

The macronutrient and micronutrients intake analysis of 24-h food recall data is reported in Table 2. All intakes differed significantly by age and gender (p < 0.05). The results indicated that the mean energy intake was higher in boys than girls across both age groups (2494.2, 2570.6) and (2101, 1936.7), respectively. In general, boys had a higher nutrient intake of carbohydrates, proteins, fat, cholesterol, vitamin B12, iron, and folate than girls.

TABLE 2
www.frontiersin.org

Table 2. Mean (± SD) nutrient intake per 24-h by age and gender.

Table 3 reports the percentage distribution of students' vitamin B12, iron, and folate intake according to the RDA as nutritional anemia indicators. All intakes differed significantly by age and gender (p < 0.05). Both boys and girls had vitamin B12 intake lower than the RDA across both age groups, with a higher percentage in girls (52.2 and 56.8% for boys, and 60.8 and 74% for girls, respectively). Iron intake was above the RDA for all boys (84.9 and 67.7%), and younger (10–13 years) girls (71.9%), respectively. However, 65.7% of older girls (14–17 years) had iron intake below RDA. Folate intake was below RDA for boys and girls across all age groups, with a lower intake for boys (61.1 and 77.6% for boys; 71.9 and 86.4% for girls), respectively.

TABLE 3
www.frontiersin.org

Table 3. Nutrient intake levels according to the Recommended Dietary Allowance (RDA) per age and gender.

K-Means Cluster Analysis of Hemoglobin

Table 4 shows the results of the K-means clustering of the hemoglobin values of the study participants. The clustering technique resulted in two distinct hemoglobin clusters, with statistically different average hemoglobin values of 11.85 and 13.9 g/dl (p < 0.05). The clusters were identified as low and high (46.4 and 53.6%) in reference to the WHO criteria for anemia definition, for both genders (1). There were significant differences in the distribution of children across hemoglobin clusters by age, gender, and locality. The high hemoglobin cluster included 915 boys and 1,636 girls. A higher percentage of students from both age groups were classified into the high hemoglobin cluster (52.2 and 57.2%). Furthermore, more students from urban and camps residences were classified into the high hemoglobin cluster (57.5 and 51.8%), while 50.8% of rural residents were in the low hemoglobin cluster.

TABLE 4
www.frontiersin.org

Table 4. Distribution of study sample by hemoglobin clusters.

Nutrition Profile per Hemoglobin Clusters

Table 5 shows the mean energy and main nutrients among the hemoglobin clusters. The mean energy, carbohydrates, vitamin B12, and fat are roughly equal among the low and high hemoglobin clusters (2,207.7, 2,208; 311.4, 311.5; 2.1, 2.2; and 76.3, 76.1). The high hemoglobin cluster had higher mean protein, iron, and folate intake (76.7, 17.8, and 263.6). These differences, however, were not found to be statistically significant across hemoglobin groups.

TABLE 5
www.frontiersin.org

Table 5. Mean (±SD) distribution of nutrient intake by hemoglobin clusters.

CRT Analysis of Anemia Related Risk Factors

The CRT classification technique has been used for classifying the hemoglobin clusters by schoolchildren's nutritional intake (vitamin B12, iron, and folate). Table 6 shows the variables included in the CRT classification analysis. Figures 13 show the CRT analysis. Each node contains three statistical values (category, %, n) in addition to node number, category stands for vitamin B12, iron, and folate intake, n stands for low and high hemoglobin clusters in this category, and % is the percentage of students in each cluster.

TABLE 6
www.frontiersin.org

Table 6. Classification tree variables' description.

FIGURE 1
www.frontiersin.org

Figure 1. Classification and regression decision tree (CRT) analysis of anemia-associated risk factors as per vitamin B12 intake.

FIGURE 2
www.frontiersin.org

Figure 2. Classification and regression decision tree (CRT) analysis of anemia-associated risk factors as per iron intake.

FIGURE 3
www.frontiersin.org

Figure 3. Classification and regression decision tree (CRT) analysis of anemia-associated risk factors as per folate intake.

The CRT classification of hemoglobin in reference to vitamin B12 is shown in Figure 1, the classification tree includes a total of 39 nodes with 20 terminal nodes. In each category, the population of hemoglobin clusters differs significantly (all p < 0.05). The estimated error of risk in the model is 0.439, and the standard error (SE) is 0.007. The vitamin B12 intake was classified into below and above RDA values, the below RDA group was classified by locality, iron, gender, unhealthy food consumption, age, BMI, and physical activity, while the above RDA group was classified by gender, folate, age, iron, and locality. The below RDA node by locality was classified into two groups, urban group and camp and rural group, and the urban group was classified by iron intake (below and above RDA); the below group was classified by gender and unhealthy food consumption. Girls reported a higher percentage of low hemoglobin levels that were significant with unhealthy food consumption. The camp and rural group was further classified by gender. Boys were classified by age group, whereas girls were classified by BMI and physical activity. Other variables did not reach a significance of 0.05 and were not included in the model, such as father's and mother's education, healthy food consumption, and leisure time activity.

Figure 2 shows the hemoglobin clusters' classification per iron intake. The model includes a total of 39 nodes with 20 terminal nodes. In each category, the population of hemoglobin clusters differs significantly (all p < 0.05). The estimated error of risk in the model is 0.441, and the SE is 0.007.

The iron intake was classified into below and above RDA groups; the below group was classified by gender, locality, age group, unhealthy food consumption, BMI, physical activity, and family income. The above group was classified by locality, gender, folate intake, age group, physical activity, BMI, and mother education. The below RDA group was further classified by gender; the boys' group were associated with age, while the girls' group were further classified by locality, in which the urban residents group was associated with unhealthy food consumption, whereas those who had high unhealthy food consumption were classified by their family income. However, camp and rural residents were classified by their BMI level, and those who are overweight or obese were further classified by their physical activity. Those who have normal BMI or are underweight were classified by family income.

On the other hand, the above RDA group was classified first by locality (camp and rural residents and urban residents). Among the camp and rural residents group, the hemoglobin level was affected by folate intake. In the left branch, those who had low folate intake were classified by gender and then by age. In students who had high folate intake, physical activity was a significant factor, split into low, medium, and high levels, which were classified by the BMI level. In the latter group, those in the urban residence group were classified by gender, and boys were classified by age, and younger students (10–13-year-old) were further classified by the education level of their mothers, respectively.

Figure 3 demonstrates the hemoglobin clusters' classification per folate intake. The model includes a total of 35 nodes with 18 terminal nodes. In each category, the population of hemoglobin clusters differs significantly (all p < 0.05). The estimated error of risk in the model is 0.441, and the SE is 0.007.

The folate intake was classified into below and above RDA values, the below RDA group was classified by locality, age group, physical activity, family income, and mother's education. The above RDA group was classified by locality, gender, age group, vitamin B12 intake, iron intake, and mother's education. Both folate intakes above and below RDA values were first classified by locality. On the left branch of the tree, the camp and rural residents were classified per age. Older students (14–17 years) were further classified by gender; boys were classified by their residence in camp and rural areas. Younger students (10–13 years) were classified by physical activity; those who had either low or high levels of physical activity were further classified by their vitamin B12 intake. Residents of the urban were first categorized by gender. Boys were classified by their age, where the education of a mother was a significant factor in younger boys. On the other hand, girls were first classified per their vitamin B12 intake; those who had vitamin B12 intake below RDA were further classified according to their iron intake. On the right branch of the tree, those who had higher folate intake and lived in camps or rural areas were classified based on their physical activity levels. Those who had either low or medium levels were affected by their family income. Furthermore, those with low family income were classified by the education level of their mothers. Those who lived in urban residences were only classified by their age.

Discussion

Anemia remains a major health concern among schoolchildren (6). Almost half of the sample were found to have mean hemoglobin levels below the WHO cutoff value for anemia (1). The results of the present study indicate that there is an interlinked association between nutritional intake and sociodemographic factors in the development of the disease, which is consistent with other studies (11, 18, 41). The sole deficiency of certain nutrients is rarely found as a single nutritional anemia associated factor. The CRT analysis partitioned the sample's hemoglobin clustered groups into homogeneous subgroups, taking vitamin B12, iron, and folate intakes as key splitting variables. The results indicate that two-thirds of hemoglobin clusters were found to have a vitamin B12 intake below the RDA level and were significantly associated with the place of residence factor. This result is consistent with studies that associate living place and anemia among children (26, 41, 53). Interestingly, our results indicate that weight and physical activity are significant anemia-related factors in girls living in camps and rural areas. Whereas, in urban areas, iron intake and unhealthy food consumption were found to be significant anemia-related factors, which could be explained by the differences in lifestyle and eating habits between these residences. Furthermore, girls in camps and rural areas spend more indoor time than those in urban areas, which might lead to limited access to physical activities and higher obesity rates furthermore, this finding is consistent with other studies which indicated that obesity is associated with vitamin B12 deficiency and iron deficiency, therefore possible nutritional anemia (5457). The coexistence of these nutrients' deficiencies has been observed in other studies. Ahmed et al. have demonstrated the prevalence of coexisting vitamin B12 and iron deficiencies among adolescent girls, albeit in rural rather than urban areas as found in our study (58). In boys, however, we have observed that the anemia differs significantly across the urban and non-urban areas by age, where younger boys have the higher rates of anemia. This finding is inconsistent across studies. Research carried out in Ethiopia, for example, has found that anemia levels are lower in 10–13-year-old boys than older adolescents (59); however, other studies have demonstrated that anemia is higher among early adolescent years (10–13 years) compared with the late adolescent period (60). Another study in Nepal concluded that the anemia level in older boys and girls (15–19 years) is higher than the anemia level in younger children (61).

Even those who have an adequate vitamin B12 intake can still be affected by other variables. It seems that, in this group, iron intake is a significant factor in young boys, while the hemoglobin levels of girls might be more affected by folate intake. This emphasizes the need for a holistic approach to nutrient intake enhancement when developing policies to prevent nutritional anemia.

On the other hand, when we classified the anemic status with iron intake, it seems that lower iron intake is higher in girls. Furthermore, anemia in girls residing in urban areas is associated with food consumption and family income, whereas in camps and rural areas, it seems that obesity and lower physical activity correspond with the higher rates of anemia. In boys, age again is a major factor whatever their iron intake may be. Of note, folate intake seems to affect both boys and girls in non-urban areas with adequate iron intake; however, it is more significant in younger boys and older girls, respectively. This result coincides with the literature, which has found that average folate intake in girls differs by age (15) and between boys and girls, especially in 13–15-year-old children (62).

When we classified hemoglobin clusters by folate intake, there were differences by locality across all levels of folate intake. Younger children living in camps and rural areas who had a lower intake of folate are affected by lower physical activity and lower vitamin B12 intake. However, the hemoglobin levels of older children in the same conditions are affected by age and gender, where anemia is higher among girls. This may indicate that focusing on the folate levels of girls in camps and rural areas might significantly affect their anemic status. On examining urbanite children with folate intake lower than the RDA, vitamin B12 and iron intake are important factors, however, young boys are particularly affected by the education of their mothers. Unexpectedly, in this group,the hemoglobin level is higher in boys whose mothers have a lower education. This could be explained by the fact that mothers with lower education are most likely housewives and spend more time with their children and family, which might contribute to the better care of children's health and nutrition. This finding needs further investigation, as the educational status of mothers is found to be positively correlated with the lower odds of anemia across children in numerous studies (26, 63, 64).

The K-means and CRT clustering and classification techniques showed a different pattern of nutritional deficiencies in reference to the hemoglobin results of schoolchildren. The classification by vitamin B12 is associated with locality, iron, gender, unhealthy food consumption, age, BMI, and physical activity. The classification by iron intake is associated with gender, locality, age group, unhealthy food consumption, BMI, physical activity, and family income. The classification by folate intake is associated with locality, age, gender, physical activity, vitamin B12, locality, mother's education, and iron intake.

The findings of this study have considerable implications for researchers and policymakers to reduce the prevalence of nutritional anemia. This study supports the implementation of data mining and machine learning techniques in developing AI solutions for the development of patient-specific anemia prevention and intervention programs. As indicated, there are several sociodemographic gaps that have to be taken into consideration when designing interventional strategies to improve nutritional deficiencies. Overall, it seems that it is rather important to address the vitamin B12, iron, and folate intake in girls, while accounting for age, locality, food consumption patterns, and physical activity levels, and focus on iron and folate intake in young boys with consideration to the place of residence. However, there are some limitations to our study, namely, the use of a single 24-h recall interview, and the nutrient was calculated from food intake rather than blood biomarkers. Furthermore, the study sample only included the schoolchildren age group and did not include the younger age groups (<10 years). Hopefully, the findings of this study might aid policymakers in signifying the population groups requiring priority attention, what strategy to choose, and where to allocate public resources, to customize intervention plans based on the most significant factors involved in nutritional anemia occurrence.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

The studies involving human participants were reviewed and approved by Ministry of Education and Al-Quds University Institutional Review Board (IRB). Written informed consent to participate in this study was provided by the participants' legal guardian/next of kin.

Author Contributions

RQ and DA: conceptualization, methodology, data curation, formal analysis, validation, writing—original draft preparation, and review and editing. All authors contributed to the article and approved the submitted version.

Funding

The work included in this study is part of the wider project Determinants of Cognitive Development in Deprived Environments: evidence from the West Bank funded by the German Research Foundation (DFG) under Grant Number JU 2769/2.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We wish to thank the Ministry of Education and Al-Quds University, Palestine for its permission to perform the study, all children and their parents for their patience, as well as all field workers, Ziad Abdeen and Sameh Al Halaq for their contributions, and we are also grateful to Mohammad Remawi and Suzan Tutah for their support in data collection.

References

1. de Benoist Bruno World Health Organization Centers for Disease Control and Prevention (U.S.). Worldwide Prevalence of Anaemia 1993-2005 of : WHO Global Database of Anaemia. Geneva: World Health Organization (2008). p. 40.

Google Scholar

2. Llanos MJ. Significance of anaemia in the different stages of life/Significado de la anemia en las diferentes etapas de la Vida. Enfermería Glob. (2016) 15:419–30. Available online at: https://scielo.isciii.es/scielo.php?pid=S1695-61412016000300015&script=sci_arttext&tlng=en

Google Scholar

3. Kassebaum NJ, Jasrasaria R, Naghavi M, Wulf SK, Johns N, Lozano R, et al. A systematic analysis of global anemia burden from 1990 to 2010. Blood. (1990) 123:615–24. doi: 10.1182/blood-2013-06-508325

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Yadav D, Chandra J. Iron deficiency: beyond anemia. Indian J Pediatr. (2011). 78:65–72. doi: 10.1007/s12098-010-0129-7

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Mattiello V, Schmugge M, Hengartner H. von der Weid N, Renella R. Diagnosis and management of iron deficiency in children with or without anemia: consensus recommendations of the SPOG pediatric hematology working group. Eur J Pediatr. (2020) 179:527–45. doi: 10.1007/s00431-020-03597-5

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Allali S, Brousse V, Sacri AS, Chalumeau M, de Montalembert M. Anemia in children: prevalence, causes, diagnostic work-up, and long-term consequences. Expert Rev Hematol. (2017) 10:1023–8. doi: 10.1080/17474086.2017.1354696

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Krämer M, Kumar S, Vollmer S. Anemia, Diet, Cognitive Development: Impact of Health Information on Diet Quality Child Nutrition in Rural India. (2021). www.iza.org (accessed January 04, 2022).

Google Scholar

8. Janus J, Moerschel SK. Evaluation of anemia in children. Am Fam Physician. (2010) 81:1462–71.

Google Scholar

9. Stiller CK, Golembiewski SKE, Golembiewski M, Mondal S, Biesalski HK, Scherbaum V. Prevalence of undernutrition and anemia among santal adivasi children, Birbhum District, West Bengal, India. Int J Environ Res Public Health. (2020) 17:342. doi: 10.3390/ijerph17010342

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Jinghuan J, Hu Y, Li M, Chen J, Mao D, Li W, et al. Prevalence of anemia in chinese children and adolescents and its associated factors. Int J Environ Res Public Health. (2019) 16:1416. doi: 10.3390/ijerph16081416

PubMed Abstract | CrossRef Full Text | Google Scholar

11. André HP, Sperandio N, Siqueira RL de, Franceschini S. do CC, Priore SE. Food and nutrition insecurity indicators associated with iron deficiency anemia in Brazilian children: A systematic review. Cien Saude Colet. (2018) 23:1159–67. doi: 10.1590/1413-81232018234.16012016

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Gupta PM, Perrine CG, Mei Z, Scanlon KS. Iron, anemia, and Iron deficiency anemia among young children in the United States. Nutrients. (2016) 8:330. doi: 10.3390/nu8060330

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Cappellini MD, Musallam KM, Taher AT. Iron deficiency anaemia revisited. J Intern Med. (2020) 287:153–70. doi: 10.1111/joim.13004

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Visser M, van Zyl T, Hanekom SM, Baumgartner J, van der Hoeven M, Taljaard-Krugell C, et al. Nutrient patterns and their relation to anemia and iron status in 5- to 12-y-old children in South Africa. Nutrition. (2019) 62:194–200. doi: 10.1016/j.nut.2019.01.016

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Bhardwaj A, Kumar D, Raina SK, Bansal P, Bhushan S, Chander V. Rapid assessment for coexistence of vitamin B12 and iron deficiency anemia among adolescent males and females in northern himalayan state of India. Anemia. (2013) 2013:959605. doi: 10.1155/2013/959605

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Sharourou ASA, Hassan MA, Teclebrhan MB. Anemia : Its Prevalence, Causes and Management. Egypt J Hosp Med. (2018) 70:1877–9. doi: 10.12816/0044770

CrossRef Full Text | Google Scholar

17. Stelle I, Kalea AZ, Pereira DIA. Symposium 1: Competition and Bioavailability of Dietary Components: Iron Deficiency Anaemia: Experiences and Challenges. Proceedings of the Nutrition Society. Cambridge: Cambridge University Press (2019). p. 19–26.

Google Scholar

18. Hussein NA, Ouda MM. Life style risk factors of Iron deficiency anemia among adolescents' girls. IJND. (2018) 8:18–28. doi: 10.15520/ijnd.v8i10.2326

CrossRef Full Text | Google Scholar

19. Stoltzfus RJ. Iron deficiency: global prevalence and consequences. Food Nutr Bull. (2003) 24:S99–S103. doi: 10.1177/15648265030244S106

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Lioret S, Touvier M, Lafay L, Volatier J-L, Maire B. Dietary and physical activity patterns in french children are related to overweight and socioeconomic status. J Nutr. (2018) 138:101–7. doi: 10.1093/jn/138.1.101

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Wirfält AKE, Jeffery RW. Using cluster analysis to examine dietary patterns: Nutrient intakes, gender, and weight status differ across food pattern clusters. J Am Diet Assoc. (1997) 97:272–9. doi: 10.1016/S0002-8223(97)00071-0

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Hosseini SH, Papanikolaou Y, Islam N, Rashmi P, Shamloo A, Vatanparast H. Consumption patterns of grain-based foods among adults in Canada: evidence from Canadian community health survey-nutrition 2015. Nutrients. (2019) 11:784. doi: 10.3390/nu11040784

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Niermann CYN, Spengler S, Gubbels JS. Physical activity, screen time, and dietary intake in families: a cluster-analysis with mother-father-child triads. Front Public Health. (2018) 6:1–12. doi: 10.3389/fpubh.2018.00276

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Gubbels JS, van Assema P, Kremers SPJ. Physical activity, sedentary behavior, and dietary patterns among children. Curr Nutr Rep. (2013) 2:105–12. doi: 10.1007/s13668-013-0042-6

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Pérez-Rodrigo C, Gil Á, González-Gross M, Ortega RM, Serra-Majem L, Varela-Moreiras G, et al. Clustering of dietary patterns, lifestyles, and overweight among Spanish children and adolescents in the ANIBES study. Nutrients. (2015) 8:1–17. doi: 10.3390/nu8010011

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Goswmai S, Das KK. Socio-economic and demographic determinants of childhood anemia. J Pediatr. (2015) 91:471–7. doi: 10.1016/j.jped.2014.09.009

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Kumar A, Goyal A, Verma N, Mahesh A. Study of anemia among adolescent school girls and young adults. Int J Adv Med. (2018) 5:877. doi: 10.18203/2349-3933.ijam20182753

CrossRef Full Text | Google Scholar

28. Asmare B, Taddele M, Berihun S, Wagnew F. Nutritional status and correlation with academic performance among primary school children, northwest Ethiopia. BMC Res Notes. (2018) 11:1–6. doi: 10.1186/s13104-018-3909-1

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Qasrawi R, VicunaPolo S, Al-Halawa DA, Hallaq S, Abdeen Z. Predicting school children academic performance using machine learning techniques. Adv Sci Technol Eng Syst J. (2021) 6:8–15. doi: 10.25046/aj060502

CrossRef Full Text | Google Scholar

30. Qasrawi R, Abdeen Z, Taweel H, Younis MA, Al-Halawa DA. Data mining techniques in identifying factors associated with schoolchildren science and arts academic achievement. Proceedings - 2020 International Conference on Promising Electronic Technologies, ICPET 2020. Piscataway, NJ: Institute of Electrical and Electronics Engineers Inc. (2020). p. 78–83.

Google Scholar

31. Ye F, Chen ZH, Chen J, Liu F, Zhang Y, Fan QY, et al. Chi-squared automatic interaction detection decision tree analysis of risk factors for infant anemia in Beijing, China. Chin Med J. (2016) 129:1193–9. doi: 10.4103/0366-6999.181955

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Meena K, Tayal DK, Gupta V, Fatima A. Using classification techniques for statistical analysis of Anemia. Artif Intell Med. (2019) 94:138–52. doi: 10.1016/j.artmed.2019.02.005

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Kilicarslan S, Celik M, Sahin S. Hybrid models based on genetic algorithm and deep learning algorithms for nutritional Anemia disease classification. Biomed Signal Process Control. (2021) 63:102231. doi: 10.1016/j.bspc.2020.102231

CrossRef Full Text | Google Scholar

34. Qasrawi R. Cluster analysis for food group consumption patterns in a national sample of Palestinian schoolchildren: evidence from HBSC Survey 2013-2014. Al-Quds J Acad Res. (2021) 01:33–52. doi: 10.47874/2021p7

CrossRef Full Text | Google Scholar

35. Khan JR, Chowdhury S, Islam H, Raheem E. Machine learning algorithms to predict the childhood anemia in Bangladesh. J Data Sci. (2021) 17:195–218. doi: 10.6339/JDS.201901_17(1).0009

CrossRef Full Text | Google Scholar

36. Gunčar G, Kukar M, Notar M, Brvar M, Cernelč P, Notar M. Application of machine learning for hematological diagnosis. Sci Rep. (2018) 8:411. doi: 10.1038/s41598-017-18564-8

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Chaves LO, Domingos ALG, Fernanades DL, Cerqueira FB, Siqueira-batista R, Bressan J. Applicability of machine learning techniques in food intake assessment: A systematic review. Crit Rev Food Sci Nutr. (2021) 1–18. doi: 10.1080/10408398.2021.1956425. [Epub ahead of print].

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Sow B, Mukhtar H, Ahmad HF, Suguri H. Assessing the relative importance of social determinants of health in malaria and anemia classification based on machine learning techniques. Inform Health Soc Care. (2020) 45:229–41. doi: 10.1080/17538157.2019.1582056

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Roberts C, Currie C, Samdal O, Currie D, Smith R, Maes L. Measuring the health and health behaviours of adolescents through cross-national survey research: recent developments in the health behaviour in school-aged children (HBSC) study. J Public Health. (2007) 15:179–86. doi: 10.1007/s10389-007-0100-x

CrossRef Full Text | Google Scholar

40. Qasrawi R. Links between nutrition, life style habits and academic acheivment in Palestinian schoolchildren: A cross-sectional study. Al-Quds J Acad Res. (2021) 01:90–102. doi: 10.47874/2021p6

CrossRef Full Text | Google Scholar

41. Mohammed SH, Habtewold TD, Muhammad F, Esmaillzadeh A. The contribution of dietary and non-dietary factors to socioeconomic inequality in childhood anemia in Ethiopia: a regression-based decomposition analysis. BMC Res Notes. (2019) 12:646. doi: 10.1186/s13104-019-4691-4

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Khanmohammadi S, Adibeig N, Shanehbandy S. An improved overlapping k-means clustering method for medical applications. Expert Syst Appl. (2017) 67:12–8. doi: 10.1016/j.eswa.2016.09.025

CrossRef Full Text | Google Scholar

43. Foster E, Bradley J. Methodological considerations and future insights for 24-hour dietary recall assessment in children. Nutrition Research. (2018) 51:1–11. doi: 10.1016/j.nutres.2017.11.001

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Bailey RL. Overview of dietary assessment methods for measuring intakes of foods, beverages, and dietary supplements in research studies. Curr Opin Biotechnol. (2021) 70:91–6. doi: 10.1016/j.copbio.2021.02.007

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Gibson RS, Ferguson EL. An Interactive 24-Hour Recall for Assessing the Adequacy of Iron and Zinc Intakes in Developing Countries. International Food Policy Research Institute (IFPRI); International Center for Tropical Agriculture (CIAT) (2008).

Google Scholar

46. Frank GC, Nicklas TA, Webber LS, Major C, Miller JF, Berenson GS, et al. Food frequency questionnaire for adolescents: defining eating patterns. J Am Diet Assoc. (1992) 92:313–8. doi: 10.1016/S0002-8223(21)00617-9

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Abdeen Z, Qasrawi R. Palestinian Food: The First Palestinian National Nutrition Survey (FPNNS2000). Jerusalem: VDM Publishng House (2010).

Google Scholar

48. Lee RD. Nutribase nutrition manager. J Acad Nutr Diet. (1997) 97:219.

Google Scholar

49. of Medicine I. Dietary Reference Intakes for Energy, Carbohydrate, Fiber, Fat, Fatty Acids, Cholesterol, Protein, and Amino Acids. Washington, DC: The National Academies Press (2005).

Google Scholar

50. World Health Organization Regional Office for the Eastern Mediterranean. Healthy Diet. Cairo: World Health Organization (2019).

Google Scholar

51. World Health Organization. World Health Organization Reference Curves. (2015). Available online at: https://www.who.int/tools/growth-reference-data-for-5to19-years/indicators/bmi-for-age (accessed February 22, 2022).

Google Scholar

52. Morgan J,. Classification Regression Tree Analysis. (2014). Available online at: https://www.bu.edu/sph/files/2014/05/MorganCART.pdf (accessed December 17, 2021).

Google Scholar

53. Policy H, Children, FOR,. Growing Up Unequal : Gender Socioeconomic Differences in Young People's Health Well-Being. (2014). Available online at: https://apps.who.int/iris/handle/10665/326320 (accessed December 17, 2021).

Google Scholar

54. Zhao L, Zhang X, Shen Y, Fang X, Wang Y, Wang F. Obesity and iron deficiency: a quantitative meta-analysis. Obesity Reviews. (2015) 16:1081–93. doi: 10.1111/obr.12323

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Ho M, Halim JH, Gow ML, El-Haddad N, Baur LA, Cowell CT, et al. Vitamin B12 in obese adolescents with clinical features of insulin resistance. Nutrients. (2014) 6:5611–8. doi: 10.3390/nu6125611

PubMed Abstract | CrossRef Full Text | Google Scholar

56. Pinhas-Hamiel O, Doron-Panush N, Reichman B. Obese children and adolescents a risk group for low vitamin B 12 concentration. Arch Pediatr Adolesc Med. (2006) 160:933–6. doi: 10.1001/archpedi.160.9.933

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Alshwaiyat N, Ahmad A, Wan Hassan WMR, Al-jamal H. Association between obesity and iron deficiency (Review). Exp Ther Med. (2021) 22:1268. doi: 10.3892/etm.2021.10703

PubMed Abstract | CrossRef Full Text | Google Scholar

58. Ahmed F, Khan MR, Banu CP, Qazi MR, Akhtaruzzaman M. The coexistence of other micronutrient deficiencies in anaemic adolescent schoolgirls in rural Bangladesh. Eur J Clin Nutr. (2008) 62:365–72. doi: 10.1038/sj.ejcn.1602697

PubMed Abstract | CrossRef Full Text | Google Scholar

59. Zeleke MB, Shaka MF, Anbesse AT, Tesfaye SH. Anemia and its determinants among male and female adolescents in southern ethiopia: a comparative cross-sectional study. Anemia. (2020) 2020:3906129. doi: 10.1155/2020/3906129

PubMed Abstract | CrossRef Full Text | Google Scholar

60. Shaka MF, Wondimagegne YA. Anemia, a moderate public health concern among adolescents in South Ethiopia. PLoS ONE. (2018) 13:e0191467. doi: 10.1371/journal.pone.0191467

PubMed Abstract | CrossRef Full Text | Google Scholar

61. Chalise B, Aryal KK, Mehta RK, Dhimal M, Sapkota F, Mehata S, et al. Prevalence and correlates of anemia among adolescents in Nepal: findings from a nationally representative cross-sectional survey. PLoS ONE. (2018) 13:e0208878. doi: 10.1371/journal.pone.0208878

PubMed Abstract | CrossRef Full Text | Google Scholar

62. Jani R, Salian N, Udipi S, Lohia N, Haas J, Boy E. Folate status and intake of tribal Indian adolescents aged 10 to 17 year. Food Nutr Bull. (2015) 36:14–23. doi: 10.1177/156482651503600102

PubMed Abstract | CrossRef Full Text | Google Scholar

63. Choi HJ, Lee HJ, Jang HB, Park JY, Kang JH, Park KH, et al. Effects of maternal education on diet, anemia, and iron deficiency in Korean school-aged children. BMC Public Health. (2011) 11:870. doi: 10.1186/1471-2458-11-870

PubMed Abstract | CrossRef Full Text | Google Scholar

64. Al-Shookri A, Al-Shukaily L, Hassan F, Al-Sheraji S, Al-Tobi S. Effect of mothers nutritional knowledge and attitudes on Omani children's dietary intake. Oman Med J. (2011) 26:253–7. doi: 10.5001/omj.2011.61

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: nutritional anemia, anemia classification, cluster analysis, classification model, Classification and Regression Tree, schoolchildren

Citation: Qasrawi R and Abu Al-Halawa D (2022) Cluster Analysis and Classification Model of Nutritional Anemia Associated Risk Factors Among Palestinian Schoolchildren, 2014. Front. Nutr. 9:838937. doi: 10.3389/fnut.2022.838937

Received: 18 December 2021; Accepted: 21 March 2022;
Published: 10 May 2022.

Edited by:

Juan Mielgo-Ayuso, University of Burgos, Spain

Reviewed by:

Majid Hajifaraji, National Nutrition and Food Technology Research Institute, Iran
Lovejeet Kaur, University of Delhi, India

Copyright © 2022 Qasrawi and Abu Al-Halawa. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Radwan Qasrawi, cmFkd2FuJiN4MDAwNDA7c3RhZmYuYWxxdWRzLmVkdQ==; cmFkd2FuLnFhc3Jhd2kmI3gwMDA0MDtpc3RpbnllLmVkdS50cg==; Diala Abu Al-Halawa, aGFsYXdhZGlhbGEmI3gwMDA0MDtnbWFpbC5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.