Skip to main content

ORIGINAL RESEARCH article

Front. Public Health, 17 March 2020
Sec. Children and Health

Clustering and Tracking the Stability of Biological CVD Risk Factors in Adolescents: The Malaysian Health and Adolescents Longitudinal Research Team Study (MyHeARTs)

  • 1Department of Social and Preventive Medicine, Faculty of Medicine, Centre for Population Health (CePH), University of Malaya, Kuala Lumpur, Malaysia
  • 2Faculty of Health and Medical Sciences, School of Medicine, Taylor's University, Selangor, Malaysia
  • 3Jeffery Cheah School of Medicine and Health Sciences, Monash University, Selangor, Malaysia
  • 4Department of Paediatrics, Faculty of Medicine, University of Malaya, Kuala Lumpur, Malaysia
  • 5T.H. Chan School of Public Health, Harvard University, Boston, MA, United States
  • 6Department of Nutrition, Faculty of Public Health, Airlangga University, Surabaya, Indonesia

Background: Cardiovascular disease (CVD) risk factors tend to cluster and progress from adolescence to young adulthood. Reliable and meaningful clustering of CVD risk factors is essential to circumvent loss of information. Tracking adverse and high-risk profiles of adolescents is hoped to curb CVD progression later in life. The study aims to investigate the clustering of biological CVD risk factor among adolescents in Malaysia and the transitions between clusters over time.

Method: The Malaysian Health and Adolescents Longitudinal Research Team study (MyHeARTs) examined school students aged 13 in 2012 and re-examined them in 2014 and 2016. In a two-stage stratified cluster sampling, 1,361 students were recruited, of which, 1,320 had complete data. In the follow-up, there were 881 and 637 students in 2014 and in 2016, respectively. Pearson's correlation coefficients were used to identify and remove highly correlated CVD risk factors. All risk factors were standardized into z-scores. The hierarchical and non-hierarchical (k-means) cluster analyses were used to classify students into high, medium and low risk clusters in each screening year. The tracking and stability of cluster transitions through cross-classification were enumerated with Pearson's inter-age correlations and percentages.

Results: Three significant clusters of high, medium and low risk groups were derived from the clustering of eight biological CVD risk factors. The transitions between risk clusters from one screening year to the other were categorized as either stagnant, improved or adverse. The number of students who had adverse transitions increased from 15.5% (13–15 year) to 19.5% (15–17 year), 13.8 to 18.2% among the girls and 19.9 to 22.8% among the boys. For girls, the number of them who remained at high risk over the two transition periods were about the same (13.6 vs. 13.8%) whereas for boys, the percentage reduced from 14.6 to 12.3%.

Conclusion: Over time, more than 12% of adolescents remained in the high risk cluster. There were sizable adverse transitions over time as more adolescents appear to be shifting toward an increased risk of having CVD. Collaborative and constant measures should be taken by parents, school, health promotion boards and policy makers to curb the multiplicative effect of clustering CVD risk factors among adolescents.

Introduction

Non-communicable diseases (NCDs) contribute to 71% of deaths globally. Cardiovascular diseases (CVD) is one of the primary NCDs (1). In 2016, WHO reported that 17.9 million or 44% of NCD deaths were due to CVD. Three quarter of these deaths occurred in low and middle income countries. In Malaysia, CVD is the leading cause of death as it accounted for an estimated 35% of all deaths due to NCDs. The common risk factors for CVD, namely heart attack and stroke, are hypertension, hypercholesteremia, diabetes and overweight/obesity. The 2015 National Health and Morbidity Survey (NHMS) reported 63% of adults had at least one of these risk factors and that children and adolescents are as vulnerable to these risk factors (2). Among ASEAN countries, Malaysia is ranked second highest in terms of childhood obesity, with a 12.7% prevalence of age standardized overweight (1).

It is widely known that biological risk factors for cardiovascular diseases (CVD) such as blood pressure, body composition, body lipid, body fitness and blood glucose tend to cluster (35). The clustering of these risk factors has a multiplicative effect that induces an elevated risk of CVD (69). These risk factors start early in life during adolescence and persist into adulthood (10). In adolescence, clustering of multiple risk factors leads to initial stages of atherosclerosis. Streaks of fat, cholesterol and fibrous plaques begin to accumulate in the artery walls at a very young age of 10 and slowly accumulates over time (6, 11, 12). These atherosclerotic lesions are usually not manifested until the child becomes an adult. As such, tracking the clustering of multiple risk factors of CVD from young adolescence is of vital importance (13, 14). Tracking is often done by evaluating the transitions in risk clusters over time. Clusters are said to be stable when the transitions between risk clusters remain the same.

Although several risk clustering and tracking methods exist, most findings show high risk clusters tend to be moderately to highly stable (13, 14). A study among schoolchildren, aged 15 to 19 years, in Denmark measured total risk scores as the sum of seven risk factors; systolic and diastolic blood pressure, total cholesterol, HDL-cholesterol, triglyceride, smoking and skinfold thickness. The tracking of total risk score over 8 years found a significantly high number of subjects remaining in upper quintiles (r = 0.85) (15). Similarly, another study from the same cohort of Danish adolescents reported the probability of being in the upper quartile of two or more risk factors (TC:HDL ratio, triglyceride, systolic blood pressure, and body fat) at the first examination was six times greater than the second examination 8 years later (16). A study of young Finnish children aged 6–18 years, tracked the transitions of high-risk groups using 3 risk factors; total cholesterol, HDL-cholesterol and diastolic blood pressure. The 6-years tracking of risk factors found about 25% of children remained in high risk groups with extreme tertiles (17). The Bogalusa Heart Study examined the persistence of clustering multiple CVD risk factors consisting adverse levels of systolic blood pressure, TC:HDL ratio and plasma insulin. The 8 years follow up study of individuals aged 5–17 years old showed that 61% of the individuals in the highest quintile of the multiple index score maintained their rankings (18). The Aerobics Center Longitudinal Study measured a composite risk factor score of waist circumference, HDL-cholesterol, triglyceride, glucose and mean arterial blood pressure and found them to track moderately well (r = 0.56) from a mean age of 15.8–26.6 years (19).

The purpose of the study was to investigate the clustering of biological CVD risk factors and its' transitions over time, among adolescents in Malaysia. The first part of the study examined if the relevant CVD risk factors identified among the adolescents clustered in a reliable and meaningful way. In the second part of the study, the transitions from year 2012–2014 and 2014–2016 were examined separately in order to track the stable (especially those who remained in high risk clusters), adverse and improved changes in clusters among the adolescents. We hypothesized adolescents in high risk clusters can be clearly distinguished to indicate high risk of behavioral patterns.

Methods

The STROBE statement was adhered in reporting this study.

Data Source

The Malaysian Health and Adolescents Longitudinal Research Team study (MyHeARTs) is an inaugural initiative conducted in Malaysia. The study was designed to examine the trends of risk factors of non-communicable diseases among an adolescent cohort. The respondents were followed from the ages of 13–17. Measurements were taken at baseline (13 years), 15 years and 17 years.

The formula used to calculate sample size was n = (z2 ×p ×qr × e2)×design effect where z = standard normal deviate set at 1.96 at 5% level for two tailed test, p = estimated prevalence of adolescents aged 13–15 who smoked in school at 33%, q = 1-p, r = response rate and e = precision level. A total sample of 1,500 students were estimated. A two-stage stratified cluster sampling design was used in this study. In the first stage, 15 schools (eight urban-based and seven rural-based) were randomly selected based on the calculated sample size (20). The selection was based on a complete list of secondary public schools (sampling frame) located in the Federal Territory of Kuala Lumpur (capital city of Malaysia) and the central and northern zone of Peninsular Malaysia, specifically in the states of Selangor and Perak. In the second stage all 13-years-old students from the selected schools were invited to enroll in the study. There were 1,361 students at baseline in 2012 and subsequent follow ups were conducted in 2014 and 2016.

Data collection and examination of students were carried out during school hours with approvals from respective school administration and Ministry of Education. Self-administered questionnaires were distributed among students. Trained enumerators present during the data collection period ensured the smooth flow. Information on socio-demographics, lifestyle, health background and high risk behaviors were collected. More details about the study procedure and sampling have been reported elsewhere (20). All procedures involving human subjects were approved by the Ethics Committee of University Malaya Medical Centre (Ref. No. 896.34).

Response Rate and Follow-Up

Of those invited, 2,694 students agreed to participate with written informed consent from their parents. Out of those 2,694 students, only 1,361 of them participated in the study (51%). In the follow-ups there were 925 and 654 students in 2014 and 2016, respectively. The attrition rate was 32.0% in the first follow up and 29.3% in the second follow. These attrition rates are very common in cohort studies (21) and was mainly due to withdrawal of consent and shifting to different schools. In 2012, out of the 1,361 students, 1,320 students had complete measurements for all the risk factors considered in this study. Out of the 1,320 students, 881 and 637 students had complete measurements for the required risk factors in 2014 and 2016, respectively.

Measurements

Eleven CVD risk factors were considered in this study; systolic blood pressure, diastolic blood pressure, total cholesterol: HDL ratio, HDL cholesterol, LDL cholesterol, triglyceride, total cholesterol, body fat, waist circumference, BMI, and blood glucose. Details of each measurement are described briefly as they have been presented elsewhere (20). Risk factors that were considered for analysis is explained in variable selection.

Blood Pressure

Both systolic and diastolic blood pressure of the students were measured by medical doctors. Measurements were taken three times repeatedly with a 2-min interval between each reading. The average reading was calculated. At each measurement, the student was seated with right upper arm positioned at the level of the heart with both feet flat on the floor. The measurements were taken using a stethoscope and a mercurial sphygmomanometer (CK-101C, Spirit Medical Co., Taiwan).

Body Lipid

A total of 15 ml of fasting blood was withdrawn from each student by a phlebotomist. The students were asked to fast for at least 10 h prior to blood taking. All blood samples were sent to the hospital laboratory before storing it temporarily at 4°C in a cool box upon blood withdrawal. The blood samples were processed at the field laboratories in each state. The samples were spun and stored as serum and divided into several aliquots of 0.5 ml of serum for individual tests. In a plain test tube, 3 ml of blood was collected for the measurement of fasting lipids (Advia Chemistry, Siemens, Germany—triglyceride, total cholesterol, high density lipoprotein cholesterol and low-density lipoprotein cholesterol).

Body Composition

Waist circumference (WC) was measured using a circumference measuring tape (Seca 201, Seca, UK). The WC was measured at the midpoint between the lowest rib margin and the iliac crest and recorded to the nearest 0.1 cm. The percentage of body fat was measured using the Tanita portable Body Composition Analyzer SC-240 MA (22). The machine was placed on a flat surface and each student was asked to step on the platform, bare-footed. The percentage of body fat was recorded to the nearest decimal.

Statistical Analysis

Variable Selection

All analyses were performed using IBM SPSS Statistics (version 22; SPSS Inc., Chicago, IL, USA). The first part of the study involved clustering the selected CVD risk factors. Preliminary analyses on 11 initially selected risk factors (systolic blood pressure, diastolic blood pressure, total cholesterol: HDL ratio, HDL cholesterol, LDL cholesterol, triglyceride, total cholesterol, body fat, waist circumference, BMI, and blood glucose) was done prior to clustering. First, for each risk factor, extreme values (more than three standard deviations above or below mean) were removed as it may result in too few observations in any one cluster (23, 24). Next, correlation coefficients between the 11 traditional CVD risk factors were examined. Variable reduction was done by looking at high correlation values between variables. CVD risk factors with correlations more than 0.9 were excluded from the clustering analysis to avoid over-representation of any single factor (25, 26). Finally, the remaining risk factors were transformed into z-scores due to varying means and variances (27). Standardizing the risk factors into z-scores was essential to identify subjects with similar characteristics so that clusters of homogeneous risk factors can be segmented.

Cluster Analysis

A two stage cluster analysis combining hierarchical and non-hierarchical (k-means) clustering methods were used (23, 28, 29). At the first stage, hierarchical method based on squared Euclidean distance and Ward's minimum variance algorithm was applied to form initial cluster centers. These non-random starting points are then applied at the second stage of k-means clustering to identify homogeneous subgroups (clusters) of students at high, medium or low risk of CVD. The reliability of the cluster solution was examined by splitting the sample into two random subsamples (23, 30). The clustering procedure was repeated to check for agreement (Kappa, κ) in cluster solution between subsamples and total sample (31). Lastly, the resulting clusters were profiled based on sociodemographic and CVD risk factors using descriptive cross-tabulation (chi-square). Since the risk factors deviated from the normal distribution, results were presented using median (lower quartile, upper quartile) and were compared using the non-parametric Kruskal Wallis (post-hoc Dunnett T3).

Tracking/Stability

In the second part of the analysis, all steps above were repeated for each follow up year from 2012 to 2014 and 2016. Each year, students were clustered cross-sectionally, into low, medium and high risk clusters based on the eight finalized risk factors. A 3 × 3 matrix was constructed to show the number of students belonging to the corresponding clusters in two subsequent years. Subjects are considered to track well if the transitions between two evaluation periods maintained their ranks over time (32). Nine possible transitions between low, medium and high risk clusters were examined.

The transitions from year 2012 to 2014 and from 2014 to 2016 was examined separately to further track the transition of students over time. For this part of the analysis, the nine identified transitions were broadly categorized into five transitions of stable; moving from low to low, medium to medium, high to high, adverse (low to high, low to medium and medium to high) and improved (medium to low, high to medium and high to low). Subsequently, the percentage change of each CVD risk was calculated for the periods from 2012–2014 as (x2014x2012x2012×100) and from 2014–2016 as (x2016x2014x2014×100). The differences in the percentage change of each CVD risk factor among the 5 identified transitions were compared using the non-parametric Kruskal Wallis test.

Results

CVD risk factors with correlations more than 0.9 were excluded from the clustering analysis to avoid over-representation of any single factor (25, 26). Based on this, the total number of risk factors considered dropped from 11 to 8; systolic blood pressure, diastolic blood pressure, TC:HDL ratio, HDL cholesterol, LDL cholesterol, triglyceride, body fat and waist circumference. Cluster analysis based on eight CVD risk factors of the adolescents resulted in them being in either one of the three distinct cluster solutions of low, medium or high risk of CVD in each year, cross-sectionally. Each cluster was distinguished based on the mean of final cluster centers in z-values, as shown in Figure 1. Subjects from the high risk cluster in all 3 years have distinctively higher means in final cluster center as compared to the subjects from the medium and low risk clusters. The reliability of the cluster solutions was determined by randomly splitting the total sample into two subsamples. The degree of agreement between the new clusters obtained from random subsample and those of the total sample are calculated. In 2012 and 2014, the agreement was excellent with κ = 0.962 (ρ < 0.001) and κ = 0.959 (ρ < 0.001), respectively. In 2016, the kappa statistic was 0.64 (ρ < 0.001) indicating a fairly high degree of replication as well.

FIGURE 1
www.frontiersin.org

Figure 1. Mean of final cluster centers of all the biological risk factors in z-scores by each risk cluster (low, medium, and high) and year in 2012, 2014, and 2016.

Sociodemographic characteristics of the clusters by each year are presented in Table 1. The associations between cluster groups and gender, ethnicity, locality, origins by states, gross household income and highest education of parent were tested using chi-square tests. In 2012, only gender, ethnicity and states were found to be significantly associated with the clusters. In 2014, only gender and gross household income were significantly associated with cluster membership. In 2016, all relationships were significant except for ethnicity, gross household income and highest education of parent. The association between gender and cluster membership was statistically significant at all three time periods, with more females in each risk-cluster group, χ2(2) = 9.87, ρ < 0.01 in 2012; χ2(2) = 8.46, ρ < 0.05 in 2014; χ2(2) = 37.36, ρ < 0.001 in 2016.

TABLE 1
www.frontiersin.org

Table 1. Sociodemographic characteristics of adolescents by clusters and by year.

The median (lower(Q1) and upper(Q3) quartile) values for each CVD risk factor are presented by clusters for each year in Table 2. Since the data deviated from normal/Gaussian distribution, the median differences between clusters were compared using the non-parametric Kruskal Wallis test. Each component of the CVD risk factors (blood pressure, body composition and blood lipid) differed by clusters. In each year, statistically significant differences (ρ < 0.001) were found between at least one pair of clusters in all three components of biological CVD risk factors. The high risk clusters for each year comprised of subjects with the highest median (Q1, Q3) for each CVD risk factor measured. The systolic blood pressure among students from high risk clusters were consistently higher across the years; 118.00 (110.00, 122.00) in 2012, 112.00 (108.00, 120.00) in 2014 and 114.00 (108.50, 122.00) in 2016. The median body fat and waist circumference values in all three study periods were distinctly higher among students of the high risk clusters (BF : 37.80 (31.00, 43.90) in 2012, 38.95 (30.80, 45.40) in 2014 and 33.00 (26.20, 41.00) in 2016; WC : 83.00 (75.00, 89.00) in 2012, 86.85 (80.00, 95.50) in 2014 and 81.50 (72.25, 89.75) in 2016. Median HDL for high risk clusters in all years are reported to be 1.30 (1.10, 1.41) in 2012, 1.20 (1.10, 1.40) in 2014, and 1.30 (1.10, 1.50) in 2016. In post-hoc multiple comparisons, the high risk clusters were found to be statistically significantly different from the low and medium risk clusters (post-hoc test, p < 0.05).

TABLE 2
www.frontiersin.org

Table 2. Median differences in individual risk factors between clusters in 2012, 2014, and 2016, Kruskal Wallis (KW) and results of post-hoc multiple comparisons.

Figure 2 shows the 3 × 3 matrices tracking the transitions of the students between 2012 and 2014 (13–15 years old) and 2014 to 2016 (15–17 years old). For the purpose of tracking, only students with complete data from 2012 to 2016 were considered. A total of 606 students, 171 boys and 435 girls, had complete data. In terms of locality, 276 (45.5%) students were from rural residential areas and 330 (54.5%) of them were from urban areas.

FIGURE 2
www.frontiersin.org

Figure 2. Tracking of students between clusters from 2012–2014 to 2014–2016.

The transitions between risk clusters were categorized as either stagnant (or stable) (LL: low to low, MM: medium to medium and HH: high to high), improved (medium to low, high to low and high to medium) or adverse (low to medium, low to high, medium to high). Figure 3 shows the transitions between clusters from 2012 to 2014 and from 2014 to 2016. Overall, the number of subjects who remained at high risk was 13.9% from 2012 to 2014 and 13.4% from 2014 to 2016. The number of subjects who had adverse transitions increased from 15.5% (13–15 years) to 19.5% (15–17 years). For the girls, the number that remained at high risk over the two transition periods was about the same (13.6 vs. 13.8 %) whereas for boys, the percentage reduced from 14.6 to 12.3%. Adverse transitions among the girls increased quite a bit from 13.8% (2012–2014) to 18.2% (2014–2016). The same trend was observed among the boys (19.9%, 2012–2014 vs. 22.8%, 2014–2016). However, compared to girls, the transition among the boys to higher risk clusters were significantly greater. The improved transitions among the adolescents are also noteworthy. The number of those from the total sample who had improved transitions in the two transition periods were about the same (22.3%, 2012–2014 vs. 23.8%, 2014–2016). However, a marked decrease was observed among boys with 24.6% of them having improved transitions between the period 13–15 years old but only 19.9% between the period 15–17 years old. On the contrary, for girls, the number of those with improved transitions increased from the period between 13–15 and 15–17 years old (21.4 vs. 25.3 %).

FIGURE 3
www.frontiersin.org

Figure 3. Transition of clusters from 2012–2014 (13–15 years old) to 2014–2016 (15–17 years old).

Table 3 shows the percentage of change in cluster transitions by each biological CVD risk factor. Some large changes in percentage were observed between groups with improved and adverse transitions. The systolic and diastolic blood pressure for those in adverse transitions from 2012 to 2014 increased significantly from 10.00% (3.77,20.00) to 16.67% (3.45,16.67), respectively. On the contrary, the blood pressure levels for those in improved transitions reduced by 12.28% (−18.18, −6.57) for systolic and 16.67% (−25.36, −8.57) for diastolic. The median differences between all the transition groups were statistically significant (χ2 = 172.4, ρ < 0.001 for systolic and χ2 = 168.3, ρ < 0.001 for diastolic). In terms of body composition, only body fat showed significant differences in median values among the groups, χ2 = 46.2, ρ < 0.001 with median percentage change for the adverse transition of 13.03% (−3.35, 34.29). The median percentage change of triglyceride in adverse transitions was only 3.67% (−18.91, 25.00). As for transitions from 2014 to 2016, the Kruskal Wallis test showed statistically significant differences in percentage change in blood pressure (χ2 = 156.4, ρ < 0.001 for systolic, χ2 = 145.8, ρ < 0.001 for diastolic), body composition (χ2 = 37.2, ρ < 0.001 for body fat) and blood lipid (χ2 = 17.98, ρ < 0.001 for HDL). The median for the percent change were much higher in adverse transitions with 16.49% (5.00, 24.44) for systolic, 15.00% (1.67, 27.50) for diastolic and 4.43% (−6.43, 15.02) for body fat. As for percentage change in body lipid, the median in improved transitions was greatest for triglyceride at −11.11% (−25.00, 16.67).

TABLE 3
www.frontiersin.org

Table 3. Median percentage of change in biological risk factors in groups with different transition patterns.

Discussion

This study investigated the clustering of biological CVD risk factors in a young population of adolescents in Malaysia, a country suffering from the heavy burden of mortality from CVD for more than 40 years (33). Clustering was defined according to eight biological CVD risk factors: systolic blood pressure, diastolic blood pressure, body fat, waist circumference, HDL cholesterol, LDL cholesterol, triglyceride and TC:HDL ratio. Many studies included only single risk factors such as blood pressure (17, 34), blood lipids (17, 35, 36), body composition (3739), fasting blood glucose (35) or physical fitness (16, 40, 41) but only a few considered combinations of risk factors to form clustered cardiovascular risks (1519, 42). The combination of risk factors that form meaningful clustering differ from one population to another and is chosen based on theoretical grounds for maximal discrimination between clusters (13, 27). Although metabolic syndrome is a form of clustering (19, 42), the pre-identified risk factors do not necessarily contribute to the clustering of the sample studied. For example, in this study, fasting blood glucose, a pre-requisite for metabolic syndrome, was found to contribute the least (13, 43) in explaining the differences between high, medium and low risk clusters for each screening year and was therefore omitted in clustering. This was perhaps due to low predictive values of the risk factor among the younger population.

The results of the present study suggest that clustering of biological CVD risk factors occur among adolescents and can be segmented into three risk clusters; low, medium and high. The young and relatively healthy population of adolescents aged 13–17 years old account for the comparatively small number of subjects in high risk clusters (n = 345 in 2012, n = 170 in 2014, and n = 147 in 2016). Nevertheless, it is important to take note that the standardized mean values of the final cluster centers of the high risk clusters were distinctively higher (Z = 0.618 in 2012, Z = 0.730 in 2014, and Z = 0.665 in 2016) compared to the mean cluster centers of the low and medium risk clusters. Furthermore, the post-hoc multiple comparison test of differences concurred with statistically significant differences between the high-risk clusters and other lower risk clusters. Thus, we can conclude that students belonging to high risk clusters are more susceptible to attaining cardiovascular diseases during adulthood.

Once the clusters were identified, it was interesting to track the number of subjects who maintained their cluster membership or moved toward higher or lower ranks over the 5-years (2012–2016) tracking period. The adolescent cohort showed stability among younger boys but noticeable transitions among older boys. Contrarily, substantial drifts in cluster transitions observed among younger girls turned out to be slightly better among older girls. The different patterns of tracking among boys and girls probably adhere to the different physiological growth between genders. These findings concur with a 10-years study of trends of risk factors among school children in Serbia (44) that found the proportion of waist circumference increase to be higher among younger girls compared to older girls. As for boys, a later take-off in growth spurt cause rampant transitions only later in adolescent period. However, in principal, the overall tracking pattern between clusters in both transition periods was quite low. Basically, this tells us that both boys and girls in this population do not remain in their ranks of risk cluster but keep changing ranks over time. The possible explanation could be that in addition to maturation and physiological development, lifestyle changes especially in physical activity and dietary intake among boys and girls vary erratically causing instability in rank orders of the clusters.

On top of tracking the movements or transitions of clusters, it was even more important to categorize and monitor these transitions as either stagnant (stable), adverse or improved. The findings from this longitudinal observation of the transitions are intriguing. Notably, the number of Malaysian adolescents who remained at high risk clusters and had adverse transitions over the 5-years cluster tracking period was reasonably high especially in the risk factors for body composition. The proportion of adolescents from the MyHeARTs study who maintained in high risk cluster ranks over 5 years were about 13.5–14%. The Cardiovascular Risk in Young Finns study (17) found 25% of adolescents remaining in high risk tertiles over 6 years whereas the Bogalusa Heart study (18) found about 61% of them staying in high risk quartiles over 8 years. Although a smaller proportion of Malaysian adolescents remained in high risk cluster over time as compared to adolescents from Finland and United States, these results may not be comparable due to different clustering methodologies and combinations of risk factors.

In the present study, as the adolescents grew older, the percentage of those in adverse transitions increased from 15.5 to 19.5%. Particularly among the teenage boys, the percentage increased from 19.9 to 22.8% and among the teenage girls, it increased from 13.8 to 18.2%. These group of adolescents may be many folds at higher risk of CVD in the future. As such, it is essential to monitor the developmental record of these adolescents until young adulthood. The observed trend among these school-going adolescents could be due to various factors accrued during their teenage age. Among the few are sedentary lifestyle that includes lack of exercise, hours of passive television watching, video gaming and long digital hours involving gadgets and social media (45). Depending on personal preferences, the level of physical inactivity for each adolescent differ by gender. This perhaps explain the higher number of adverse transitions among boys compared to girls. Apart from that, unhealthy dietary intake and imbalanced nutrition contribute vastly to high risk and adverse transitions. In general, adolescent diet has been reported to be high in fat and sugar and low in vitamins and minerals (23, 46). Large quantities of fast food consumption and energy dense food is a reason for high-calorie diet in most adolescents (47). A systematic review on Malaysian adolescents revealed that high consumption of energy, macronutrients and skipping meals have an effect on cardio-metabolic health (48). Overall, the interaction of sedentary lifestyle and unhealthy nutrition are deemed to be major influences in the increasing prevalence of risk profiles among the younger generation (49). Regular health screenings should be conducted in schools to monitor adolescents at potential risk of poor health. Also, reward programmes could be implemented for health promoting and nutrient-friendly schools. Parents play an equally important role in inculcating healthy eating habits and promoting an active lifestyle among children.

Strengths and Limitations

This study is one of the first to investigate the longitudinal clustering of CVD risk factors among adolescents in Malaysia. The selection of relevant risk factors and the clustering method of the present study makes it unique and have some important features in comparison to other studies that investigated clustering of biological CVD risk factors. Most studies clustered individuals based on different criteria, such as cumulative risk scoring by percentile or cut-off points (9, 50, 51). These methods have limitations that incur loss of information due to restricted threshold values (9, 51). Even if a child falls behind by merely 1 unit measurement of a high risk factor, the child falls out of the high-risk group. This somewhat reduces capturing the actual number of children closest to the higher extreme end of the distribution. The hierarchical and k-means clustering is not based on dichotomization of high risk factors and simple additive risk scoring. Instead, both steps allow synergistical interaction of risk factors in multiple combinations to classify subjects into different risk clusters. K-means clustering splits the combined information of the risk factors into risk clusters where all data points in each cluster are closest to its' cluster center that relates to the mean of the cluster (52). The combined methods merge the two most similar observations with the smallest increase in overall within-cluster variance and simultaneously maximizes between-cluster variance (25, 27). These clustering methods serve as an alternative to risk factor clustering (13). To the best of our knowledge, none of the past studies have taken the approach of clustering a single individual with multiple risk factors of CVD as done in this study. This makes this study unique. The hierarchical and non-hierarchical clustering method empower other clustering techniques and are found to be more robust in terms of segregating adolescents into high, medium and low risk groups (27). The strength of this study lies in the method used and overcomes the limitation of categorization of risk factors that leads to loss of power and reduced stability (9). Since the proportion of Malay students were more than 75% in each screening year, results from this study may not mirror the racial composition of Malaysian adolescents. The low representation of the Chinese and Indian adolescents can be seen as a limitation to this study.

Conclusion

Reliable and meaningful clustering of multiple risk factors of CVD is of practical importance especially since it allows us to identify adolescents at high risk of CVD. These adolescents who remain in high risk profiles and move to higher risk clusters over time are at greater risk of developing cardiovascular diseases in adulthood. Findings from the study is hoped to be useful in forming relevant strategies to reduce the rate of cardiovascular diseases in adulthood by starting preventive measures during childhood. The perception that the young do not need medical attention is something of the past and should be corrected to be given equal care.

Data Availability Statement

The dataset analyzed for the current study is not publicly available due to participants confidentiality and privacy. Requests to access the datasets should be directed to the corresponding author.

Ethics Statement

The study was approved by the Ethics Committee, University of Malaya Medical Centre (ethical approval number-14-376-20486).

Consent For Publication

All participants, along with their parents, provided oral assent, and written consent prior to participating.

Author Contributions

HM, TS, NA-S, and MJ designed the study. HM and MJ collected the data. NT and KC conducted the analysis. NT, KC, and HM interpreted the data. NT wrote the manuscript. All authors read and approved the final manuscript.

Funding

The MyHeARTs was supported by the following grant—University Malaya's Research Programme (UMRP022A-14HTM) and IF017-2017.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors are very grateful to the students and their parents who gave their time to the study. We would also like to acknowledge all members of the MyHeARTs group not listed as co-authors of this paper.

References

1. Organization WH Global Health Observatory Data Repository: Prevalence of Overweight Age Standardized. (2019).

2. NHMS. Report of the Third National Health and Morbidity Survey (NHMS V). Kuala Lumpur: Institute for Public Health, Ministry of Health Malaysia (2015).

Google Scholar

3. Criqui MH, Barrett-Connor E, Holdbrook MJ, Austin M, Turner JD. Clustering of cardiovascular disease risk factors. Prev Med. (1980) 9:525–33. doi: 10.1016/0091-7435(80)90047-X

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Webber LS, Voors AW, Srinivasan SR, Frerichs RR, Berenson GS. Occurrence in children of multiple risk factors for coronary artery disease: the Bogalusa Heart Study. Prev Med. (1979) 8:407–18. doi: 10.1016/0091-7435(79)90018-5

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Khoury P, Morrison JA, Kelly K, Mellies M, Horvitz R, Glueck CJ. Clustering and interrelationships of coronary heart disease risk factors in schoolchildren, ages 6–19. Am J Epidemiol. (1980) 112:524–38. doi: 10.1093/oxfordjournals.aje.a113022

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Berenson GS, Srinivasan SR, Bao W, Newman W, Tracy RE, Wattigney WA. Association between multiple cardiovascular risk factors and atherosclerosis in children and young adults. N Engl J Med. (1998) 338:1650–6. doi: 10.1056/NEJM199806043382302

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Chen W, Bao W, Begum S, Elkasabany A, Srinivasan SR, Berenson GS. Age-related patterns of the clustering of cardiovascular risk variables of syndrome X from childhood to young adulthood in a population made up of black and white subjects: the Bogalusa Heart Study. Diabetes. (2000) 49:1042–8. doi: 10.2337/diabetes.49.6.1042

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Raitakari O, Porkka K, Viikari J, Ronnemaa T, Akerblom H. Clustering of risk factors for coronary heart disease in children and adolescents. the cardiovascular risk in Young Finns Study. Acta Paediatrica. (1994) 83:935–40. doi: 10.1111/j.1651-2227.1994.tb13176.x

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Twisk JW, Kemper HC, Van Mechelen W, Post GB. Clustering of risk factors for coronary heart disease: the longitudinal relationship with lifestyle. Annal Epidemiol. (2001) 11:157–65. doi: 10.1016/S1047-2797(00)00202-7

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Kemper HC, Snel J, Verschuur R, Storm-Van Essen L. Tracking of health and risk indicators of cardiovascular diseases from teenager to adult: Amsterdam Growth and Health Study. Prev Med. (1990) 19:642–55. doi: 10.1016/0091-7435(90)90061-N

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Smith CA, Pratt M. Cardiovascular disease. In: Brownson PLRRC, Davis JR, editor. Chronic Disease Epidemiology and Control. Washington, DC: American Public Health Association (1993). p. 83–108.

Google Scholar

12. Raitakari OT, Juonala M, Kähönen M, Taittonen L, Laitinen T, Mäki-Torkko N, et al. Cardiovascular risk factors in childhood and carotid artery intima-media thickness in adulthood: the cardiovascular risk in Young Finns Study. Jama. (2003) 290:2277–83. doi: 10.1001/jama.290.17.2277

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Camhi SM, Katzmarzyk PT. Tracking of cardiometabolic risk factor clustering from childhood to adulthood. Int J Pediatric Obes. (2010) 5:122–9. doi: 10.3109/17477160903111763

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Ovesen L. Adolescence: a critical period for long-term tracking of risk for coronary heart disease? Ann Nutr Metab. (2006) 50:317. doi: 10.1159/000094294

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Andersen LB, Haraldsdóttir J. Tracking of cardiovascular disease risk factors including maximal oxygen uptake and physical activity from late teenage to adulthood: an 8-year follow-up study. J Intern Med. (1993) 234:309–15. doi: 10.1111/j.1365-2796.1993.tb00748.x

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Andersen LB, Hasselstrøm H, Grønfeldt V, Hansen SE, Karsten F. The relationship between physical fitness and clustered risk, and tracking of clustered risk from adolescence to young adulthood: 8 years follow-up in the Danish Youth and Sport Study. Int J Behav Nutr Phys Act. (2004) 1:6. doi: 10.1186/1479-5868-1-6

CrossRef Full Text | Google Scholar

17. Raitakari OT, Porkka KV, Räsänen L, Rönnemaa T, Viikari JS. Clustering and 6 year cluster-tracking of serum total cholesterol, HDL-cholesterol and diastolic blood pressure in children and young adults the cardiovascular risk in young finns study. J Clin Epidemiol. (1994) 47:1085–93. doi: 10.1016/0895-4356(94)90094-9

CrossRef Full Text | Google Scholar

18. Bao W, Srinivasan SR, Wattigney WA, Berenson GS. Persistence of multiple cardiovascular risk clustering related to syndrome X from childhood to young adulthood: the Bogalusa Heart Study. Archiv Intern Med. (1994) 154:1842–7. doi: 10.1001/archinte.154.16.1842

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Eisenmann JC, Welk GJ, Wickel EE, Blair SN. Stability of variables associated with the metabolic syndrome from adolescence to adulthood: the Aerobics Center Longitudinal Study. Am J Hum Biol. (2004) 16:690–6. doi: 10.1002/ajhb.20079

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Hazreen MA, Su TT, Jalaludin MY, Dahlui M, Chinna K, Ismail M, et al. An exploratory study on risk factors for chronic non-communicable diseases among adolescents in Malaysia: overview of the Malaysian Health and Adolescents Longitudinal Research Team study (The MyHeART study). BMC Publ Health. (2014) 14(Suppl.3):S6. doi: 10.1186/1471-2458-14-S3-S6

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Fewtrell MS, Kennedy K, Singhal A, Martin RM, Ness A, Hadders-Algra M, et al. How much loss to follow-up is acceptable in long-term randomised trials and prospective studies? Archiv Dis Childhood. (2008) 93:458–61. doi: 10.1136/adc.2007.127316

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Sim PY, Su TT, Abd Majid H, Nahar AM, Jalaludin MY. A comparison study of portable foot-to-foot bioelectrical impedance scale to measure body fat percentage in Asian adults and children. BioMed Res Int. (2014) 2014:475659. doi: 10.1155/2014/475659

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Bel-Serrat S, Mouratidou T, Santaliestra-Pasías AM, Iacoviello L, Kourides YA, Marild S, et al. Clustering of multiple lifestyle behaviours and its association to cardiovascular risk factors in children: the IDEFICS study. Eur J Clin Nutr. (2013) 67:848. doi: 10.1038/ejcn.2013.84

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Anderberg MR. Cluster Analysis for Applications: Probability and Mathematical Statistics: A series of Monographs and Textbooks. Vol. 19. New York, NY: Academic press (2014).

Google Scholar

25. Sarstedt M, Mooi E. A Concise Guide to Market Research. The Process, Data, and Methods Using IBM SPSS Statistics. Berlin; Heidelberg: Springer-Verlag (2014). doi: 10.1007/978-3-642-53965-7_2

CrossRef Full Text | Google Scholar

26. Blashfield RK, Aldenderfer MS. The methods and problems of cluster analysis. In: Nesselroade JR, Cattell RB, editors. Handbook of Multivariate Experimental Psychology. Boston, MA: Springer (1988). p. 447–73. doi: 10.1007/978-1-4613-0893-5_14

CrossRef Full Text | Google Scholar

27. Milligan GW, Cooper MC. Methodology review: clustering methods. Appl Psychol Measure. (1987) 11:329–54. doi: 10.1177/014662168701100401

CrossRef Full Text | Google Scholar

28. Tinsley HE, Brown SD. Handbook of Applied Multivariate Statistics and Mathematical Modeling. San Diego, CA: Academic press (2000). doi: 10.1016/B978-012691360-6/50002-1

CrossRef Full Text | Google Scholar

29. Ottevaere C, Huybrechts I, Benser J, De Bourdeaudhuij I, Cuenca-Garcia M, Dallongeville J, et al. Clustering patterns of physical activity, sedentary and dietary behavior among European adolescents: the HELENA study. BMC Public Health. (2011) 11:328. doi: 10.1186/1471-2458-11-328

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Sabbe D, De Bourdeaudhuij I, Legiest E, Maes L. A cluster-analytical approach towards physical activity and eating habits among 10-years-old children. Health Edu Res. (2007) 23:753–62. doi: 10.1093/her/cyl135

CrossRef Full Text | Google Scholar

31. Viera AJ, Garrett JM. Understanding interobserver agreement: the kappa statistic. Fam Med. (2005) 37:360–3.

PubMed Abstract | Google Scholar

32. Robson P, Gallagher A, Livingstone M, Cran G, Strain J, Savage J, et al. Tracking of nutrient intakes in adolescence: the experiences of the Young Hearts Project, Northern Ireland. Br J Nutr. (2000) 84:541–8. doi: 10.1017/S0007114500001859

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Rampal L, Rampal S, Azhar M, Rahman A. Prevalence, awareness, treatment and control of hypertension in Malaysia: a national study of 16,440 subjects. Publ Health. (2008) 122:11–18. doi: 10.1016/j.puhe.2007.05.008

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Bao W, Threefoot SA, Srinivasan SR, Berenson GS. Essential hypertension predicted by tracking of elevated blood pressure from childhood to adulthood: the Bogalusa Heart Study. Am J Hypertens. (1995) 8:657–65. doi: 10.1016/0895-7061(95)00116-7

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Pagnan A, Ambrosio GB, Vincenzi M, Mormino Maiolino Gerin L, et al. Precursors of atherosclerosis in children: the Cittadella study. Follow-up and tracking of total serum cholesterol, triglycerides, and blood glucose. Prev Med. (1982) 11:381–90. doi: 10.1016/0091-7435(82)90042-1

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Viikari J, Rönnemaa T, Seppänen A, Mamiemi J, Porkka K, Räsänen L, et al. Serum lipids and lipoproteins in children, adolescents and young adults in 1980–1986. Ann Med. (1991) 23:53–9. doi: 10.3109/07853899109147931

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Garnett SP, Baur LA, Srinivasan S, Lee JW, Cowell CT. Body mass index and waist circumference in midchildhood and adverse cardiovascular disease risk clustering in adolescence. Am J Clin Nutr. (2007) 86:549–55. doi: 10.1093/ajcn/86.3.549

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Kvaavik E, Tell GS, Klepp KI. Predictors and tracking of body mass index from adolescence into adulthood: follow-up of 18 to 20 years in the Oslo Youth Study. Archiv Pediatr Adoles Med. (2003) 157:1212–8. doi: 10.1001/archpedi.157.12.1212

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Smoak CG, Burke GL, Webber LS, Harsha DW, Srinivasan SR, Berenson GS. Relation of obesity to clustering of cardiovascular disease risk factors in children and young adults: the Bogalusa Heart Study. Am J Epidemiol. (1987) 125:364–72. doi: 10.1093/oxfordjournals.aje.a114543

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Raitakan OT, Porkka KVK, Taimela S, Telama R, Räsänen L, Vllkari JS. Effects of persistent physical activity and inactivity on coronary risk factors in children and young adults the cardiovascular risk in young Finns study. Am J Epidemiol. (1994) 140:195–205. doi: 10.1093/oxfordjournals.aje.a117239

CrossRef Full Text | Google Scholar

41. Janz KF, Dawson JD, Mahoney LT. Tracking physical fitness and physical activity from childhood to adolescence: the Muscatine study. Med Sci Sports Exercise. (2000) 32:1250–7. doi: 10.1097/00005768-200007000-00011

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Katzmarzyk PT, Pérusse L, Malina RM, Bergeron J, Després J, Bouchard C. Stability of indicators of the metabolic syndrome from childhood and adolescence to young adulthood: the Quebec Family Study. J Clin Epidemiol. (2001) 54:190–5. doi: 10.1016/S0895-4356(00)00315-2

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Cook S, Weitzman M, Auinger Nguyen M, Dietz WH. Prevalence of a metabolic syndrome phenotype in adolescents: findings from the third National Health and Nutrition Examination Survey, 1988-1994. Archiv Pediatr Adoles Med. (2003) 157:821–7. doi: 10.1001/archpedi.157.8.821

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Simeunović S, Nedeljković S, Milinčić Z, Vukotić M, Novaković I, Nikolić D. Anthropometric and lipid parameters trends in school children: one decade of YUSAD Study. Srp Arh Celok Lek. (2011) 139:465–9. doi: 10.2298/SARH1108465S

PubMed Abstract | CrossRef Full Text | Google Scholar

45. DiClemente RJ, Santelli JS, Crosby RA. Adolescent Health: Understanding and Preventing Risk Behaviors. San Francisco, CA: John Wiley & Sons (2009).

Google Scholar

46. Zalilah MS, Khor G, Mirnalini K, Norimah A, Ang M. Dietary intake, physical activity and energy expenditure of Malaysian adolescents. Singapore Med J. (2006) 47:491.

PubMed Abstract | Google Scholar

47. Majid HA, Ramli L, Ying S, Su TT, Jalaludin MY, Mohsein NASA. Dietary intake among adolescents in a middle-income country: an outcome from the malaysian health and adolescents longitudinal research team study (the MyHeARTs Study). PLoS ONE. (2016) 11:e0155447. doi: 10.1371/journal.pone.0155447

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Mohammadi S, Jalaludin MY, Su TT, Dahlui M, Mohamed MNA, Majid HA. Dietary and physical activity patterns related to cardio-metabolic health among Malaysian adolescents: a systematic review. BMC Public Health. (2019) 19:251. doi: 10.1186/s12889-019-6557-z

PubMed Abstract | CrossRef Full Text | Google Scholar

49. DŽelajlija DD, Spasić SS, Kotur-Stevuljevic JM, Bogavac-Stanojevic NB. Cardiovascular risk factors in 7–13 years old children from Vojvodina (Serbia). J Med Biochem. (2016) 35:293–301. doi: 10.1515/jomb-2016-0006

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Gu D, Gupta A, Muntner Hu S, Duan X, Chen J, et al. Prevalence of cardiovascular disease risk factor clustering among the adult population of China: results from the International Collaborative Study of Cardiovascular Disease in Asia (InterAsia). Circulation. (2005) 112:658–65. doi: 10.1161/CIRCULATIONAHA.104.515072

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Twisk JW, Boreham C, Cran G, Savage JM, Strain J, van Mechelen W. Clustering of biological risk factors for cardiovascular disease and the longitudinal relationship with lifestyle of an adolescent population: the Northern Ireland Young Hearts Project. J Cardiovasc Risk. (1999) 6:355–62. doi: 10.1177/204748739900600601

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Steinley D. K-means clustering: a half-century synthesis. Br J Math Statist Psychol. (2006) 59:1–34. doi: 10.1348/000711005X48266h

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: clustering, tracking, risk factors, cardiovascular diseases (CVD), adolescents

Citation: Thangiah N, Chinna K, Su TT, Jalaludin MY, Al-Sadat N and Majid HA (2020) Clustering and Tracking the Stability of Biological CVD Risk Factors in Adolescents: The Malaysian Health and Adolescents Longitudinal Research Team Study (MyHeARTs). Front. Public Health 8:69. doi: 10.3389/fpubh.2020.00069

Received: 15 December 2019; Accepted: 21 February 2020;
Published: 17 March 2020.

Edited by:

Hatim A. Omar, University of Kentucky, United States

Reviewed by:

Stephanie Jean Stockburger, University of Kentucky, United States
Lokesh Tiwari, All India Institute of Medical Sciences (Patna), India

Copyright © 2020 Thangiah, Chinna, Su, Jalaludin, Al-Sadat and Majid. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Hazreen Abdul Majid, hazreen@ummc.edu.my

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.