- Genoplan Korea, Seoul, Republic of Korea
Epigenetic clocks have been developed to track both chronological age and biological age, which is defined by physiological biomarkers and the risk of adverse health outcomes. Epigenetic age acceleration (EAA) has been found to predict various diseases, aging-related factors, and mortality. However, epigenetic clocks have predominantly been developed with individuals of European or Hispanic ancestry, and their association with health outcomes and environmental factors has not been sufficiently assessed in East Asian populations. Here, we investigated nine epigenetic clocks: five trained on chronological age (first-generation) and four on biological age (second-generation), using DNA methylation data from blood samples of South Koreans. EAAs of second-generation epigenetic clocks reflected the risk of chronic diseases (type 2 diabetes and hypertension), levels of health-related blood markers (alanine aminotransferase, aspartate aminotransferase, high density lipoprotein, triglyceride, and high sensitivity C-reactive protein), and lung functions (percentage of predicted FEV1 and percentage of predicted FVC), while EAAs of first generation clocks did not. Using follow-up data, we also found that EAAs of second-generation clocks were associated with the time to onset risks of chronic diseases. Health behavior factors (drinking, smoking, exercise, body mass index, and waist-hip ratio), socioeconomic status (income level and educational attainment), and psychosocial status were associated with EAAs of second-generation clocks, while only smoking status was associated with EAAs of first-generation clocks. We conducted validation analyses in an independent South Korean cohort and replicated the association of EAAs with health outcomes and environmental factors. Age acceleration of epigenetic clocks is influenced by various environmental factors and can serve as an effective predictor of health in South Korea.
Introduction
Over the past decade, there has been a substantial increase in the development and application of epigenetic predictors in healthcare research (Yousefi et al., 2022). DNA methylation changes dynamically in response to a variety of exogenous and endogenous factors, and can be used to estimate age through predictors called “epigenetic clocks” (Horvath and Raj, 2018). Epigenetic age acceleration (EAA), typically derived by regressing the epigenetic age of clocks on chronological age, assesses whether individuals are aging faster or slower than their chronological age. EAA provides valuable insights into health outcomes, including morbidity and mortality (Horvath and Raj, 2018; Jain et al., 2022), and has been linked to health behaviors such as body mass index (BMI), alcohol consumption, smoking, and physical activity (Quach et al., 2017; Ryan et al., 2020).
First-generation epigenetic clocks, exemplified by Horvath’s clock and Hannum’s clock (Horvath, 2013; Hannum et al., 2013), were trained on chronological age and used to estimate chronological age (Horvath, 2013; Hannum et al., 2013; Bernabeu et al., 2023; Zhang et al., 2019). Second-generation epigenetic clocks have been developed to track biological age based on diverse age-related metrics, encompassing clinical biomarkers (PhenoAge) (Levine et al., 2018), pace of aging (DunedinPACE) (Belsky et al., 2022), and all-cause mortality (GrimAge and Zhang Y’s clock) (Lu et al., 2019; Zhang et al., 2017). Because they focus on different parts of aging, “second-generation” clocks outperform “first-generation” clocks in predicting both life span and health span (Zhang et al., 2019; Levine et al., 2018; Bell et al., 2019; Faul et al., 2023). Recently, principal component (PC)-based clocks have been created to mitigate noise and unreliability from individual CpG sites using PCs (Higgins-Chen et al., 2022).
With the increasing number of epigenetic clocks, various studies have compared these clocks with each other and have identified associations with health indicators and mortality. For example, Faul et al. (2023) and Crimmins et al. (2021) utilized epigenetic clocks in US adults and identified that second-generation clocks are linked to health behaviors, morbidity, and mortality. McCrory et al. (2021) and Maddock et al. (2020) demonstrated an association of PhenoAge and GrimAge with health span in British population, while Lu et al. (2019) found a predictive utility of GrimAge for morbidity and mortality in a cohort comprising individuals of European, American, and Hispanic ancestry.
Most epigenetic clocks have been primarily developed using data from European, African, or Hispanic individuals (Horvath, 2013; Hannum et al., 2013; Bernabeu et al., 2023; Zhang et al., 2019; Levine et al., 2018; Belsky et al., 2022; Lu et al., 2019; Zhang et al., 2017). Studies investigating their effects on health outcomes or the influence of environmental factors have also predominantly focused on these ancestry groups (Faul et al., 2023; Crimmins et al., 2021; McCrory et al., 2021; Maddock et al., 2020). However, notable differences exist in epigenetic clocks among various ethnic groups (Crimmins et al., 2021; Horvath et al., 2016), and very few studies have investigated their application in the East Asian population.
To assess the performance of epigenetic clocks in the East Asian population, we examined the association of various epigenetic clocks with health outcomes and environmental factors in 1,925 South Korean samples. We employed five “first-generation” epigenetic clocks [PCHorvath (Horvath et al., 2018), PCHannum (Hannum et al., 2013), ZhangQ (Zhang et al., 2019), Bernabeu (Bernabeu et al., 2023), and iCAS-DNAmAge (Zheng et al., 2024)], along with four “second-generation” epigenetic clocks [PCPhenoAge (Levine et al., 2018), PCGrimAge (Lu et al., 2019), DunedinPACE (Belsky et al., 2022), and ZhangY (Zhang et al., 2017)]. We explored the association of EAAs from each clock with health outcomes, health behaviors, and the time to onset of chronic diseases, and confirmed these findings in an independent South Korean cohort. Our findings highlight the utility of epigenetic clocks in predicting adverse health outcomes in East Asians.
Results
Basic characteristics of participants
For the analysis, we used data from Korea Association Resource (KARE) of the Korean Genome and Epidemiology Study (KoGES) (Kim et al., 2017). The basic characteristics of the 1,925 KARE participants are shown in Table 1. The chronological age ranged from 47 to 78 years, with 1,006 (52.3%) males and 919 (47.7%) females. Except for the psychosocial wellbeing index (PWI) and hypertension, significant mean differences in characteristics were observed between males and females (t-test p-value < 0.05). Females generally exhibited better health across various indicators such as T2D prevalence and blood marker levels compared to males. Females also demonstrated healthier behavior, including a lower waist-hip ratio (WHR), less drinking and smoking, and regular exercise, but their BMI tended to be higher than males.
We calculated epigenetic ages from eight epigenetic clocks, comprising five first-generation clocks [Bernabeu (Bernabeu et al., 2023), iCAS-DNAmAge (Zheng et al., 2024), ZhangQ (Zhang et al., 2019), PCHannum (Hannum et al., 2013), and PCHorvath (Horvath et al., 2018)] and four second-generation clocks [PCGrimAge (Lu et al., 2019), PCPhenoAge (Levine et al., 2018), DunedinPACE (Belsky et al., 2022), and ZhangY (Zhang et al., 2017)]. The accuracy of age prediction was assessed through Pearson correlation analysis between the epigenetic clocks and chronological age (Supplementary Table S1). All epigenetic clocks showed correlations with chronological age, with the Bernabeu clock displaying the highest predictive accuracy (r = 0.96). Although iCAS-DNAmAge was developed in East Asians, it showed a relatively low correlation (r = 0.79). Additionally, the epigenetic ages of each clock were strongly correlated with each other (Supplementary Figure S1). Bernabeu and ZhangQ exhibited the strongest correlation between them (r = 0.98), despite the substantial differences in the CpG sites used.
EAAs of each clock were generated by regressing epigenetic age on chronological age. Except EAA of iCAS-DNAmAge, all EAAs exhibited correlations among themselves (Figure 1), and showing relatively strong correlations (0.39–0.82) observed within the same generation and comparatively weaker correlations (−0.1–0.77) observed between different generations. The strongest correlation was observed between PCHorvathEAA and PCHannumEAA (r = 0.82), while iCAS-DNAmAgeEAA showed the no correlation with PCGrimAgeEAA (r = 0) and a negative correlation with ZhangY. Males exhibited a significantly accelerated aging rate compared to females across all epigenetic clocks, except iCAS-DNAmAge (t-test p-value < 0.05, Supplementary Table S2).
Figure 1. Correlation matrix of epigenetic age accelerations. Positive correlations are denoted by red shades, with lighter shades signifying weaker correlation values.
Epigenetic age acceleration and health outcomes
To explore which EAAs of epigenetic clocks reflect health outcomes, we conducted regression analyses adjusting for covariates: age, sex, and 10PCs, which were included to account for noise in individual CpG sites. We estimated the effects of EAAs on various health outcomes, including T2D, hypertension, alanine aminotransferase (ALT), aspartate aminotransferase (AST), high density lipoprotein (HDL), triglyceride (TG), and high sensitivity C-reactive protein (hs-CRP), percentage of predicted FEV1 (FEV1% PRED), and percentage of predicted FVC (FVC% PRED). The significance of the association results was determined using a p-value threshold adjusted for multiple testing via Bonferroni correction [p-value < 0.05/(nine health outcomes × number of clocks) = 1.11E-03 for first-generation clocks and 1.39E-03 for second-generation clocks].
None of the EAAs from the first-generation epigenetic clocks were significantly associated with any diseases or health indicators (p-value > 1.11E-03, Supplementary Table S3). Only some EAAs showed a weak positive association with the risk of T2D and hypertension, as well as with an increase in AST levels, satisfying the uncorrected p-value threshold [Odds ratio (OR) > 1 or Beta >0, p-value < 0.05]. On the other hand, EAAs of the second-generation clocks showed significant associations (p-value < 1.39E-03), suggesting that higher EAAs have a detrimental effect on health (Figure 2). The faster aging in all second-generation epigenetic clocks was associated with an increased risk of T2D. In addition, higher DunedinPACE was associated with a higher risk of hypertension (OR > 1), as well as elevated levels of AST, ALT, TG, and hs-CRP (Beta > 0). Conversely, it was associated with decreased levels of HDL, FEV1% PRED, and FVC% PRED (Beta < 0). PCGrimAgeEAA and ZhangY showed a positive association with levels of TG, and a negative association with FEV1% PRED. Higher PCPhenoAgeEAA indicated a significant association with an increased risk of hypertension and elevated AST levels.
Figure 2. Forest plots for epigenetic age accelerations (EAAs) of second-generation clocks and health outcomes in KARE. Odds ratios (ORs) with 95% confidence intervals (CIs) or beta values with standard errors (SEs) were displayed along with their corresponding p-values. All regression models adjusted for age, sex, and 10 principal components, with all EAAs scaled to a mean of 0 and a standard deviation of 1. Significant results were highlighted with a blue background based on Bonferroni-corrected p-values [0.05/(4 × 9) = 1.39E-03].
Since EAAs of epigenetic clocks were associated with the risk of T2D and hypertension, we further investigated their association with time to onset. Using follow-up data, we conducted analyses with 123 cases of T2D and 721 controls, as well as 277 cases of hypertension and 494 controls. In Kaplan-Meier curves, each EAA was categorized into two groups: age deceleration and age acceleration. DunedinPACE and ZhangY were classified into age acceleration and deceleration groups based on their top 50% and bottom 50% (Supplementary Figure S2). The EAAs from the first-generation epigenetic clocks did not exhibit any association with the onset of T2D and hypertension (p-value > 0.05). Among the second-generation clocks, DunedinPACE showed a significant association with the onset of T2D (p-value < 0.05/(Two diseases × number of clocks) = 5E-03 for first-generation clocks and 6.25E-03 for second-generation clocks), while PCGrimAgeEAA, DunedinPACE, and ZhangY satisfied uncorrected p-value (p-value < 0.05) in their association with the onset of hypertension. For Cox regression analysis, EAAs from first-generation clocks did not show an association with the onset of T2D (p-value > 6.25E-3, Table 2). iCAS-DNAmAge is significantly associated with the development of hypertension, but its acceleration was associated with a reduced risk of hypertension onset (Hazard ratio < 1). In contrast, higher values of DunedinPACE, PCGrimAgeEAA, and ZhangY were significantly associated with a higher risk of T2D onset (Hazard ratio > 1, p-value < 6.25E-03). Increased values of DunedinPACE and ZhangY were associated with an increased risk of hypertension onset as well.
Table 2. Cox regression analysis of epigenetic age acceleration for onset of type 2 diabetes and hypertension.
Epigenetic age acceleration and environmental factors
To investigate the impact of environmental factors on the aging of epigenetic clocks, we conducted regression analyses adjusting for covariates: age, sex, and 10PCs. We estimated the effects of nine factors on EAAs: BMI, WHR, income level, educational attainment, PWI, drinking status, smoking status, pack-years, and regular exercise. Significant factors were identified using a p-value threshold adjusted for Bonferroni correction [p-value < 0.05/(nine environmental factors × number of clocks) = 1.11E-03 for first-generation clocks and 1.39E-03 for second-generation clocks].
Most environmental factors did not impact the acceleration of first-generation clocks (Supplementary Table S4). Only smoking had a significant effect on the acceleration of PCHorvath and PCHannum (p-value < 1.11E-03). In contrast, EAAs of second-generation clocks were influenced by various environmental factors (Figure 3). Increases in WHR and smoking were associated with accelerated aging across all second-generation clocks (Beta > 0, p-value < 1.39E-03). Higher socioeconomic status (income level and educational attainment) slowed down the aging of PCGrimAge and ZhangY, whereas drinking and an increase in pack-years sped up their aging. PCGrimAge was also decelerated by regular exercise and good psychosocial status (PWI), while DunedinPACE increased with higher BMI and an increase in pack-years, as well as poor psychosocial status. As depicted in Figure 3; Supplementary Table S3, when environmental factors significantly influenced the aging of epigenetic clocks, they affected them in the same direction.
Figure 3. Forest plots for epigenetic age accelerations (EAAs) of second-generation clocks and environmental factors in KARE. Beta values with standard errors (SEs) were displayed along with their corresponding p-values. All regression models adjusted for age, sex, and 10 principal components, with all EAAs scaled to a mean of 0 and a standard deviation of 1. Significant results were highlighted with a blue background based on Bonferroni-corrected p-values [0.05/(4 × 9) = 1.39E-03].
Validation in independent Korean samples
We conducted validation studies on the association of EAAs in an independent Korean cohort, Health Examinees (HEXA) of the KoGES (Kim et al., 2017; Health Examinees Study Group, 2015). The basic characteristics of the 822 HEXA participants were presented in Supplementary Table S5. We derived epigenetic ages using the same eight epigenetic clocks and subsequently calculated EAAs based on them. Descriptive statistics are presented in Supplementary Table S6; Supplementary Figure S3. Females exhibited a significantly decelerated aging rate compared to males across epigenetic clocks (t-test p-value < 0.05), except Bernabeu (p-value = 5.01E-02, Supplementary Table S6).
Consistent with the previous findings (Supplementary Table S3), EAAs from the first-generation epigenetic clocks did not exhibit significant associations with any health outcomes in HEXA (p-value > 0.05/(five EAAs × seven health outcomes) = 1.43E-03, Supplementary Table S7). Similarly, associations between EAAs of the second-generation clocks and health outcomes demonstrated consistent effect directions in HEXA (p-value < 0.05/(four EAAs × seven health outcomes) = 1.79E-03, Figure 2; Table 3). Accelerated aging in all second-generation clocks was consistently associated with an increased risk of T2D. PCGrimAgeEAA and ZhangY were significantly associated with TG levels, while DunedinPACE showed significant associations with hypertension, AST, ALT, HDL, TG, and hs-CRP. Among the previously significant findings, only the association between PCPhenoAge and AST did not reach the corrected p-value threshold in HEXA (p-value = 4.37E-03). Despite conducting analyses on time to disease onset using follow-up data, no statistically significant results were identified (Supplementary Figure S4; Supplementary Table S8).
While EAAs from the first-generation clock showed a significant association solely with smoking status previously (Supplementary Table S4), no significant associations with any environmental factors were observed in HEXA [p-value > 0.05/(five EAAs × eight environmental factors) = 1.25E-03, Supplementary Table S9]. In contrast, the associations between EAAs of the second-generation clocks and environmental factors were validated in HEXA (Table 3). Smoking consistently sped up the aging of all second-generation clocks, whereas higher socioeconomic status slowed it down. DunedinPACE increased with higher BMI and WHR but decreased with regular exercise. Higher WHR and drinking accelerated the aging of PCGrimAge and ZhangY, while regular exercise decelerated the aging of PCGrimAge and PCPhenoAge. PWI was the only factor that did not show a validated significant association with EAAs of second-generation clocks.
Effects of epigenetic age acceleration on health outcomes independent of lifestyle factors
We identified that the EAA of second-generation clocks affects health outcomes and is influenced by environmental factors. Since environmental factors affect both health outcomes and epigenetic age, we performed regression analyses with additional adjustments for lifestyle factors, such as body mass index, waist-hip ratio, smoking status, drinking status, and regular exercise, to evaluate the independent effects of EAAs on health outcomes (Table 4). Even after accounting for these factors, the detrimental effect of higher EAA on health outcomes remained significant in KARE (p-value < 0.05). While the statistical significance and effect sizes of EAAs on health outcomes were diminished in most cases, the impact of PCGrimAge on the risk of T2D became slightly stronger (OR 1.14 to 1.16, p-value 6.91E-15 to 1.33E-15). In HEXA, the influence of EAAs on health outcomes was maintained (p-value < 0.05), except for DunedinPACE on hypertension and AST (p-value > 0.05). Similar to KARE, the statistical significance and effect sizes of EAAs were reduced across all cases.
Effects of epigenetic clocks developed in East Asian ancestry
To identify the effects of clocks developed in same ancestry, we calculated three epigenetic clocks, PCHorvath, PCHannum, and PCPhenoAge, trained on a Japanese sample using the Transfer Elastic Net method (Tomo and Nakaki, 2024). In both KARE and HEXA, these three clocks showed correlations with the original clocks (Supplementary Figure S5) and demonstrated stronger correlations with chronological age compared to the original clocks (Supplementary Tables S1, S6, S10). Regarding associations with health outcomes, we observed slightly more significant p-values, such as the effects of PCPhenoAgeEAA on T2D, hypertension, AST, ALT, and FVC% PRED in the KARE cohort (Figure 2; Supplementary Table S11). Conversely, p-values for associations between PCPhenoAgeEAA and most health outcomes were less significant in HEXA (Table 3; Supplementary Table S11). In terms of environmental factors (Supplementary Table S12), a significant association between PCHannumEAA and BMI was observed in both KARE and HEXA, which was not evident with the original clocks. Furthermore, while the association between WHR and PCPhenoAgeEAA became more significant, other associations, such as with smoking status in KARE and with income, education, and regular exercise in HEXA, showed reduced significance. No significant results were observed in the survival analysis, consistent with the findings for the original clock (Supplementary Figure S6; Supplementary Table S13).
Discussion
Recently, the utilization of epigenetic predictors in healthcare research has experienced remarkable growth. We explored the impact of EAAs on disease risk and health indicators, as well as the influence of various environmental factors on the aging of epigenetic clocks. In the current study, we found that faster aging of second-generation epigenetic clocks was significantly associated with adverse health outcomes, including increased risk of chronic diseases, variations in blood markers levels, and reduced lung functions. Moreover, acceleration of these clocks was associated with the onset of chronic diseases in follow-up data. Unhealthy lifestyles, such as smoking, drinking, high BMI and WHR, and poor psychosocial status, were identified as factors that accelerated the aging of second-generation clocks. Conversely, favorable socioeconomic status and regular exercise were associated with slowed aging. Furthermore, we identified the effects of epigenetic age acceleration on health outcomes, independent of lifestyle factors.
We utilized five first-generation and four second-generation epigenetic clocks. Although these clocks were developed outside of East Asian populations, they exhibited a strong correlation with chronological age and associations with health and lifestyle factors, and, as is well known, they accelerated more in men than in women (Horvath et al., 2016). In contrast, iCAS-DNAmAge, the only epigenetic clock developed using data from East Asian populations, displayed a relatively weak correlation with chronological age and EAA of other clocks (Supplementary Table S1; Figure 1), while accelerating more in women than in men (Supplementary Table S2). Additionally, iCAS-DNAmAge is significantly associated with the time to onset risk of hypertension; however, its acceleration appears to reduce this risk (HR < 1). The three epigenetic clocks trained on an East Asian population showed stronger associations with certain health outcomes compared to the original clocks (Supplementary Tables S11, S12). Our findings suggest that epigenetic clocks developed for different ancestries can be valuable for individuals of East Asian ancestry. Additionally, clocks specifically trained for East Asian populations could enhance predictive performance, and second-generation epigenetic clocks are expected to be more efficient in predicting health outcomes than first-generation clocks.
Epigenetic clocks have only recently been developed, leaving much to be understood from a mechanistic perspective. Biological age is a multifaceted phenomenon, influenced by accumulated cellular damage, environmental exposures, genomic instability, and epigenetic alterations (López-Otín et al., 2023; Zhang et al., 2023). Epigenetic clocks derived from DNA methylation integrate multiple independent mechanisms of age-related change, thus each epigenetic clock reflects different mechanisms (Levine et al., 2022). Moreover, second-generation clocks are trained using a range of health indicators rather than solely age. For instance, PhenoAge captures organismal age and the functional state of many organ systems and tissues (Levine et al., 2018), while DunedinPACE measures the pace of aging (Belsky et al., 2022). PCGrimAge and ZhangY exhibit all-cause mortality, although ZhangY uses CpGs that are not related to aging (Lu et al., 2019; Zhang et al., 2017). These factors can influence our results that each clock exhibits varying performance as a health predictor and is affected differently by diverse environmental factors.
As observed in previous studies (Zhang et al., 2019; Levine et al., 2018; Bell et al., 2019; Faul et al., 2023), second-generation epigenetic clocks exhibited higher accuracy in predicting health outcomes compared to first-generation clocks. Furthermore, accelerated aging of second-generation clocks was associated with increased morbidity rates and shifts in biomarker levels indicating deteriorating health conditions, which in turn correlate with higher mortality rates. For example, T2D and hypertension are common comorbidities and age-related chronic metabolic diseases linked with elevated mortality rates (Petrie et al., 2018; Aune et al., 2021; Rockl et al., 2017). AST and ALT are well-known biomarkers used to indicate liver injury, while TG and hs-CRP are well-established risk factors for cardiovascular disease. Elevated levels of these biomarkers have been shown to increase mortality risk (Chen et al., 2022; Liu et al., 2013; Li et al., 2017). HDL, commonly known as “good cholesterol,” is linked to higher mortality risk at both low and high levels (Zhong et al., 2020). FEV1 and FVC are main results of pulmonary function tests, and it is known that low predicted percentages of these values are associated with higher mortality rates (Magnussen et al., 2017). Therefore, our findings suggest that second-generation clocks could serve as predictors of mortality in East Asian populations.
The environmental factors examined in our study have been reported to influence the aging of epigenetic age. Consistent with our findings, high BMI and WHR accelerate this process (Lin et al., 2021), as does smoking and drinking (Ryan et al., 2020), while regular exercise decelerates it (Quach et al., 2017). Poor socioeconomic and psychosocial status also speeds up epigenetic aging (Schmitz et al., 2022; Palma-Gudiel et al., 2020). In addition, adopting a healthier lifestyle, such as regular exercise (Fitzgerald et al., 2021), smoke cessation (Wu et al., 2019), and losing weight (Yaskolka et al., 2021), is known to reverse epigenetic aging. Likewise, changing other environmental factors associated with EAA to healthier options could not only decelerate epigenetic aging but also potentially make one biologically younger.
Overall, this study underscores the utility of epigenetic clocks in the East Asian population. We evaluated different epigenetic clocks and confirmed that the epigenetic age from second-generation clocks provide valuable health predictions, while also suggesting potential for slowing aging through healthier lifestyles. Although we validated our results using an independent cohort, certain aspects remain unvalidated. In the validation cohort, lung function and pack-years could not be analyzed, and the impact of psychosocial status on EAAs did not reach statistical significance (p-value > 0.05). Additionally, the impact of EAAs on disease onset could not be confirmed due to insufficient follow-up cases for T2D (six cases) and failure to meet the statistical threshold in the hypertension analysis. Further investigations with larger sample sizes may clarify the association between them in East Asian populations. Aging is represented by various indicators, and our investigation has shown that each epigenetic clock is linked to different indicators, necessitating a comprehensive interpretation that incorporates multiple epigenetic clocks.
Methods
Study population
The present study utilized two independent, community-based genomic cohort datasets from the KoGES (Kim et al., 2017). The first cohort, KARE, consisted of 10,030 individuals aged 40–69 years residing in the urban area of Ansan and the rural area of Ansung. These participants were initially recruited between 2001 and 2002 and underwent biennial follow-up examinations until 2010. For the validation of the results from KARE, we employed the second cohort, HEXA (Health Examinees Study Group, 2015), which comprised 65,642 participants from urban areas. These individuals were recruited between 2004 and 2013 at 38 hospitals and local health-screening centers, following standardized procedures. Epidemiological data for both cohorts were provided by the Korea Centers for Disease Control and Prevention. Written informed consent for participation in the KoGES cohorts was obtained from all study participants and confirmed by the Institutional Review Board.
Phenotype definition
To classify the participants into T2D case and control groups, we used their responses to the questionnaire on T2D diagnostic history and fasting glucose levels. Participants who answered “Yes” to the questionnaire or had a fasting glucose level above 126 mg/dL were classified as the case group. Conversely, those who answered “No” to the questionnaire and had a fasting glucose level below 126 mg/dL were placed in the control group.
We classified participants into hypertension case and control groups based on their responses to the questionnaire on hypertension diagnosis and their systolic and diastolic blood pressure measurements (SBP and DBP). Participants who answered “Yes” to the questionnaire or had an SBP above 140 mmHg or DBP above 90 mmHg were classified into the case group. Those who answered “No” to the questionnaire and had an SBP below 140 mmHg and DBP below 90 mmHg were placed into the control group. The mean value of two measurements was used for both SBP and DBP.
For the survival analysis, we used follow-up data. Since methylation data from the KARE was generated for the 4th follow-up participants, we examined the data from the 5th to the 9th follow-up surveys. T2D and hypertension cases were classified based on upper criteria. The follow-up time for each participant was determined from the date of the 4th follow-up survey to the date of the follow-up survey at which the participant satisfied the classification criteria. The control groups for T2D and hypertension were defined as participants who met the control criteria in the follow-up data from the 4th to the 9th surveys. The methylation data for HEXA was collected at baseline, and the classification of case and control groups for T2D and hypertension was based on data from the first follow-up survey.
Income level was determined by the average monthly income of the family and divided into eight categories. Education attainment was categorized into nine groups, ranging from not attending school to completing graduate education. PWI comprises 18 items, including 11 positive wellbeing items and 7 negative feeling items such as pain, discomfort, anxiety, and depression. Each question is scored from 0 to 3 for stress level, and the total PWI score is the sum of these subscales. A higher PWI score indicates a higher level of psychosocial stress.
DNA methylation
Methylation data of KoGES were generated from genomic DNA extracted from the blood of the subjects. Genome-wide DNA methylation was profiled using Illumina Infinium Human Methylation 450k BeadChip which measured methylation approximately 450,000 CpG sites (450K), and Illumina Infinium Human Methylation EPIC BeadChip covers over 850,000 CpG sites (EPIC). Illumina intensity data (IDAT) files were processed using R package minfi (Aryee et al., 2014). We screened for samples where fewer than 95% of probes had a detection p-value < 0.01 and applied functional normalization preprocessing (preprocessIllumina function). The beta-value was computed by dividing the intensity of the methylated channel by the sum of the intensities from both the methylated and non-methylated probes. This value reflects the methylation level at each CpG site, ranging between 0 and 1, where higher methylation yields a value closer to 1. For KARE, methylation data from 1,925 subjects were used in the analysis, with 397 from 450K chip and 1,528 from EPIC chip. The data from the 450K and EPIC chips were combined into a single dataset, including only the overlapping CpG sites. Methylation data from 822 HEXA subjects, profiled using EPIC chip, were used in the validation analysis.
Since methylation data of KARE derived from two different chips, we applied the ComBat function from the R package sva to correct for batch effects (Leek et al., 2012). The Supplementary Figure S7 illustrates the pattern of changes in PC after correction.
Epigenetic clocks and epigenetic age acceleration
We employed a total of eight epigenetic clocks trained on blood samples. Most epigenetic clocks were estimated using R package dnaMethyAge from Github (https://github.com/yiluyucheng/dnaMethyAge) (Wang et al., 2024), with missing CpGs filled using median values from the reference dataset. We calculated four first-generation clocks: PCHorvath (Horvath et al., 2018), PCHannum (Hannum et al., 2013), ZhangQ (Zhang et al., 2019), and Bernabeu (Bernabeu et al., 2023), along with four second-generation clocks: PCPhenoAge (Levine et al., 2018), PCGrimAge (Lu et al., 2019), DunedinPACE (Belsky et al., 2022), and ZhangY (Zhang et al., 2017). We prioritized PC-based clocks whenever feasible to minimize noise and improve accuracy (Higgins-Chen et al., 2022). Additionally, iCAS-DNAmAge was calculated using the beta-values of 65 CpGs from the iCAS-DNAmAge paper (Zheng et al., 2024). For the calculation of epigenetic clocks trained using Transfer Elastic Net on Japanese data (Tomo and Nakaki, 2024), we transformed beta-values of CpGs through PCs according to the workflow implemented on Github (https://github.com/MorganLevineLab/PC-Clocks). We then estimated the three clocks trained on Japanese data (PCHorvath, PCHannum, and PCPhenoAge) using the coefficients available at another GitHub repository (https://github.com/t-yui/TransferENet-EpigeneticClock).
Based on the epigenetic age estimates from the clocks, we computed EAA to assess whether individuals are biologically younger or older than their chronological age. EAAs for the clocks were generated by calculating residuals after regressing epigenetic age on chronological age. Since DunedinPACE and ZhangY directly measure the pace of aging and all-cause mortality, respectively, they can be used as EAAs response variables.
Statistical analysis
Student’s t-tests, Pearson correlation analyses, and regression analyses were performed using basic packages of R version 4.2.3. To correct for batch effects, we generated 1 to 10 PCs for the methylation data using the prcomp function in R and adjusted for them in the regression analyses. We selected 1 to 10 PCs, identifying the point where the variance explained by the PCs flattened. Elbow curves of the PCs variance were visualized in Supplementary Figure S8. The Kaplan-Meier curves, scatter plots, forest plots, and visualization of the elbow point for PCs were created using the R package ggplot2. Cox regression analysis was conducted using the R package survival, and the correlation matrix was visualized using the R package ggcorrplot.
Data availability statement
Publicly available datasets were analyzed in this study. This data can be found here: https://nih.go.kr/ko/main/contents.do?menuNo=300563.
Ethics statement
The studies involving humans were approved by Korea National Institute for Bioethics Policy. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.
Author contributions
DK: Conceptualization, Data curation, Formal Analysis, Methodology, Writing–original draft. JK: Validation, Visualization, Writing–original draft. J-WK: Writing–original draft, Investigation. SK: Writing–original draft, Software. YL: Writing–original draft, Investigation. MC: Writing–original draft, Validation, Visualization. B-CL: Writing–review and editing.
Funding
The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.
Acknowledgments
This study was conducted using the resources of Korean Genome and Epidemiology Study (application code: 23102501-01-01).
Conflict of interest
Authors DK, JK, J-WK, SK, YL, MC, and B-CL were employed by Genoplan Korea.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fragi.2024.1493406/full#supplementary-material
References
Aryee, M. J., Jaffe, A. E., Corrada-Bravo, H., Ladd-Acosta, C., Feinberg, A. P., Hansen, K. D., et al. (2014). Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics 30 (10), 1363–1369. doi:10.1093/bioinformatics/btu049
Aune, D., Huang, W., Nie, J., and Wang, Y. (2021). Hypertension and the risk of all-cause and cause-specific mortality: an outcome-wide association study of 67 causes of death in the national health interview survey. Biomed. Res. Int. 2021, 9376134. doi:10.1155/2021/9376134
Bell, C. G., Lowe, R., Adams, P. D., Baccarelli, A. A., Beck, S., Bell, J. T., et al. (2019). DNA methylation aging clocks: challenges and recommendations. Genome Biol. 20 (1), 249. doi:10.1186/s13059-019-1824-y
Belsky, D. W., Caspi, A., Corcoran, D. L., Sugden, K., Poulton, R., Arseneault, L., et al. (2022). DunedinPACE, a DNA methylation biomarker of the pace of aging. Elife 11, e73420. doi:10.7554/eLife.73420
Bernabeu, E., McCartney, D. L., Gadd, D. A., Hillary, R. F., Lu, A. T., Murphy, L., et al. (2023). Refining epigenetic prediction of chronological and biological age. Genome Med. 15 (1), 12. doi:10.1186/s13073-023-01161-y
Chen, W., Wang, W., Zhou, L., Zhou, J., He, L., Li, J., et al. (2022). Elevated AST/ALT ratio is associated with all-cause mortality and cancer incident. J. Clin. Lab. Anal. 36 (5), e24356. doi:10.1002/jcla.24356
Crimmins, E. M., Thyagarajan, B., Levine, M. E., Weir, D. R., and Faul, J. (2021). Associations of age, sex, race/ethnicity, and education with 13 epigenetic clocks in a nationally representative U.S. Sample: the health and retirement study. J. Gerontol. A Biol. Sci. Med. Sci. 76 (6), 1117–1123. doi:10.1093/gerona/glab016
Faul, J. D., Kim, J. K., Levine, M. E., Thyagarajan, B., Weir, D. R., and Crimmins, E. M. (2023). Epigenetic-based age acceleration in a representative sample of older Americans: associations with aging-related morbidity and mortality. Proc. Natl. Acad. Sci. U. S. A. 120 (9), e2215840120. doi:10.1073/pnas.2215840120
Fitzgerald, K. N., Hodges, R., Hanes, D., Stack, E., Cheishvili, D., Szyf, M., et al. (2021). Potential reversal of epigenetic age using a diet and lifestyle intervention: a pilot randomized clinical trial. Aging (Albany NY) 13 (7), 9419–9432. doi:10.18632/aging.202913
Hannum, G., Guinney, J., Zhao, L., Zhang, L., Hughes, G., Sadda, S., et al. (2013). Genome-wide methylation profiles reveal quantitative views of human aging rates. Mol. Cell. 49 (2), 359–367. doi:10.1016/j.molcel.2012.10.016
Health Examinees Study Group (2015). The Health Examinees (HEXA) study: rationale, study design and baseline characteristics. Asian Pac J. Cancer Prev. 16 (4), 1591–1597. doi:10.7314/apjcp.2015.16.4.1591
Higgins-Chen, A. T., Thrush, K. L., Wang, Y., Minteer, C. J., Kuo, P. L., Wang, M., et al. (2022). A computational solution for bolstering reliability of epigenetic clocks: implications for clinical trials and longitudinal tracking. Nat. Aging 2 (7), 644–661. doi:10.1038/s43587-022-00248-2
Horvath, S. (2013). DNA methylation age of human tissues and cell types. Genome Biol. 14 (10), R115. doi:10.1186/gb-2013-14-10-r115
Horvath, S., Gurven, M., Levine, M. E., Trumble, B. C., Kaplan, H., Allayee, H., et al. (2016). An epigenetic clock analysis of race/ethnicity, sex, and coronary heart disease. Genome Biol. 17 (1), 171. doi:10.1186/s13059-016-1030-0
Horvath, S., Oshima, J., Martin, G. M., Lu, A. T., Quach, A., Cohen, H., et al. (2018). Epigenetic clock for skin and blood cells applied to Hutchinson Gilford Progeria Syndrome and ex vivo studies. Aging (Albany NY) 10 (7), 1758–1775. doi:10.18632/aging.101508
Horvath, S., and Raj, K. (2018). DNA methylation-based biomarkers and the epigenetic clock theory of ageing. Nat. Rev. Genet. 19 (6), 371–384. doi:10.1038/s41576-018-0004-3
Jain, P., Binder, A. M., Chen, B., Parada, H., Gallo, L. C., Alcaraz, J., et al. (2022). Analysis of epigenetic age acceleration and healthy longevity among older US women. JAMA Netw. Open 5 (7), e2223285. doi:10.1001/jamanetworkopen.2022.23285
Kim, Y., Han, B. G., and Ko, G. E.Sg (2017). Cohort profile: the Korean Genome and Epidemiology study (KoGES) consortium. Int. J. Epidemiol. 46 (2), 1350. doi:10.1093/ije/dyx105
Leek, J. T., Johnson, W. E., Parker, H. S., Jaffe, A. E., and Storey, J. D. (2012). The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics 28 (6), 882–883. doi:10.1093/bioinformatics/bts034
Levine, M. E., Higgins-Chen, A., Thrush, K., Minteer, C., and Niimi, P. (2022). Clock work: deconstructing the epigenetic clock signals in aging, disease, and reprogramming. bioRxiv. 2022. 02.13.480245.
Levine, M. E., Lu, A. T., Quach, A., Chen, B. H., Assimes, T. L., Bandinelli, S., et al. (2018). An epigenetic biomarker of aging for lifespan and healthspan. Aging (Albany NY) 10 (4), 573–591. doi:10.18632/aging.101414
Li, Y., Zhong, X., Cheng, G., Zhao, C., Zhang, L., Hong, Y., et al. (2017). Hs-CRP and all-cause, cardiovascular, and cancer mortality risk: a meta-analysis. Atherosclerosis 259, 75–82. doi:10.1016/j.atherosclerosis.2017.02.003
Lin, W. Y., Wang, Y. C., Teng, I. H., Liu, C., and Lou, X. Y. (2021). Associations of five obesity metrics with epigenetic age acceleration: evidence from 2,474 Taiwan Biobank participants. Obes. (Silver Spring) 29 (10), 1731–1738. doi:10.1002/oby.23255
Liu, J., Zeng, F. F., Liu, Z. M., Zhang, C. X., Ling, W. H., and Chen, Y. M. (2013). Effects of blood triglycerides on cardiovascular and all-cause mortality: a systematic review and meta-analysis of 61 prospective studies. Lipids Health Dis. 12, 159. doi:10.1186/1476-511X-12-159
López-Otín, C., Blasco, M. A., Partridge, L., Serrano, M., and Kroemer, G. (2023). Hallmarks of aging: an expanding universe. Cell. 186 (2), 243–278. doi:10.1016/j.cell.2022.11.001
Lu, A. T., Quach, A., Wilson, J. G., Reiner, A. P., Aviv, A., Raj, K., et al. (2019). DNA methylation GrimAge strongly predicts lifespan and healthspan. Aging (Albany NY) 11 (2), 303–327. doi:10.18632/aging.101684
Maddock, J., Castillo-Fernandez, J., Wong, A., Cooper, R., Richards, M., Ong, K. K., et al. (2020). DNA methylation age and physical and cognitive aging. J. Gerontol. A Biol. Sci. Med. Sci. 75 (3), 504–511. doi:10.1093/gerona/glz246
Magnussen, C., Ojeda, F. M., Rzayeva, N., Zeller, T., Sinning, C. R., Pfeiffer, N., et al. (2017). FEV1 and FVC predict all-cause mortality independent of cardiac function - results from the population-based Gutenberg Health Study. Int. J. Cardiol. 234, 64–68. doi:10.1016/j.ijcard.2017.02.012
McCrory, C., Fiorito, G., Hernandez, B., Polidoro, S., O'Halloran, A. M., Hever, A., et al. (2021). GrimAge outperforms other epigenetic clocks in the prediction of age-related clinical phenotypes and all-cause mortality. J. Gerontol. A Biol. Sci. Med. Sci. 76 (5), 741–749. doi:10.1093/gerona/glaa286
Palma-Gudiel, H., Fananas, L., Horvath, S., and Zannas, A. S. (2020). Psychosocial stress and epigenetic aging. Int. Rev. Neurobiol. 150, 107–128. doi:10.1016/bs.irn.2019.10.020
Petrie, J. R., Guzik, T. J., and Touyz, R. M. (2018). Diabetes, hypertension, and cardiovascular disease: clinical insights and vascular mechanisms. Can. J. Cardiol. 34 (5), 575–584. doi:10.1016/j.cjca.2017.12.005
Quach, A., Levine, M. E., Tanaka, T., Lu, A. T., Chen, B. H., Ferrucci, L., et al. (2017). Epigenetic clock analysis of diet, exercise, education, and lifestyle factors. Aging (Albany NY) 9 (2), 419–446. doi:10.18632/aging.101168
Rockl, S., Brinks, R., Baumert, J., Paprott, R., Du, Y., Heidemann, C., et al. (2017). All-cause mortality in adults with and without type 2 diabetes: findings from the national health monitoring in Germany. BMJ Open Diabetes Res. Care 5 (1), e000451. doi:10.1136/bmjdrc-2017-000451
Ryan, J., Wrigglesworth, J., Loong, J., Fransquet, P. D., and Woods, R. L. (2020). A systematic review and meta-analysis of environmental, lifestyle, and health factors associated with DNA methylation age. J. Gerontol. A Biol. Sci. Med. Sci. 75 (3), 481–494. doi:10.1093/gerona/glz099
Schmitz, L. L., Zhao, W., Ratliff, S. M., Goodwin, J., Miao, J., Lu, Q., et al. (2022). The socioeconomic gradient in epigenetic ageing clocks: evidence from the multi-ethnic study of atherosclerosis and the health and retirement study. Epigenetics 17 (6), 589–611. doi:10.1080/15592294.2021.1939479
Tomo, Y., and Nakaki, R. (2024). Transfer elastic Net for developing epigenetic clocks for the Japanese population. Mathematics 12 (17), 2716. doi:10.3390/math12172716
Wang, Y., Grant, O. A., Zhai, X., McDonald-Maier, K. D., and Schalkwyk, L. C. (2024). Insights into ageing rates comparison across tissues from recalibrating cerebellum DNA methylation clock. Geroscience 46 (1), 39–56. doi:10.1007/s11357-023-00871-w
Wu, X., Huang, Q., Javed, R., Zhong, J., Gao, H., and Liang, H. (2019). Effect of tobacco smoking on the epigenetic age of human respiratory organs. Clin. Epigenetics 11 (1), 183. doi:10.1186/s13148-019-0777-z
Yaskolka, M. A., Keller, M., Bernhart, S. H., Rinott, E., Tsaban, G., Zelicha, H., et al. (2021). Lifestyle weight-loss intervention may attenuate methylation aging: the CENTRAL MRI randomized controlled trial. Clin. Epigenetics 13 (1), 48. doi:10.1186/s13148-021-01038-0
Yousefi, P. D., Suderman, M., Langdon, R., Whitehurst, O., Davey Smith, G., and Relton, C. L. (2022). DNA methylation-based predictors of health: applications and statistical considerations. Nat. Rev. Genet. 23 (6), 369–383. doi:10.1038/s41576-022-00465-w
Zhang, J., Wang, S., and Liu, B. (2023). New insights into the genetics and epigenetics of aging plasticity. Genes (Basel) 14 (2), 329. doi:10.3390/genes14020329
Zhang, Q., Vallerga, C. L., Walker, R. M., Lin, T., Henders, A. K., Montgomery, G. W., et al. (2019). Improved precision of epigenetic clock estimates across tissues and its implication for biological ageing. Genome Med. 11 (1), 54. doi:10.1186/s13073-019-0667-1
Zhang, Y., Wilson, R., Heiss, J., Breitling, L. P., Saum, K. U., Schottker, B., et al. (2017). DNA methylation signatures in peripheral blood strongly predict all-cause mortality. Nat. Commun. 8, 14617. doi:10.1038/ncomms14617
Zheng, Z., Li, J., Liu, T., Fan, Y., Zhai, Q. C., Xiong, M., et al. (2024). DNA methylation clocks for estimating biological age in Chinese cohorts. Protein Cell 15 (8), 575–593. doi:10.1093/procel/pwae011
Zhong, G. C., Huang, S. Q., Peng, Y., Wan, L., Wu, Y. Q., Hu, T. Y., et al. (2020). HDL-C is associated with mortality from all causes, cardiovascular disease and cancer in a J-shaped dose-response fashion: a pooled analysis of 37 prospective cohort studies. Eur. J. Prev. Cardiol. 27 (11), 1187–1203. doi:10.1177/2047487320914756
Keywords: epigenetic clock, lifestyle, methylation, healthcare, aging
Citation: Kim DJ, Kang JH, Kim J-W, Kim Sb, Lee YK, Cheon MJ and Lee B-C (2024) Assessing the utility of epigenetic clocks for health prediction in South Korean. Front. Aging 5:1493406. doi: 10.3389/fragi.2024.1493406
Received: 09 September 2024; Accepted: 13 November 2024;
Published: 02 December 2024.
Edited by:
Sandeep Kumar Singh, Indian Scientific Education and Technology Foundation, IndiaReviewed by:
Giovanni Fiorito, IRCCS Giannina Gaslini Institute, ItalyMaria Florencia Tevy, INTA, University of Chile, Chile
Myriam Richaud, UMR5247 Institut des Biomolécules Max Mousseron (IBMM), France
Copyright © 2024 Kim, Kang, Kim, Kim, Lee, Cheon and Lee. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Byung-Chul Lee, aW9AZ2Vub3BsYW4uY29t