.
O P E N A C C E S S S O U R C E : Aging Cell
Abstract
Epigenetic clocks, developed using DNA methylation data, have been widely used to quantify biological aging in multiple tissues/cells. However, many existing epigenetic clocks are weakly correlated with each other, suggesting they may capture different biological processes. We utilize multi‐omics data from diverse human tissue/cells to identify shared features across eleven existing epigenetic clocks. Despite the striking lack of overlap in CpGs, multi‐omics analysis suggested five clocks (Horvath1, Horvath2, Levine, Hannum, and Lin) share transcriptional associations conserved across purified CD14+ monocytes and dorsolateral prefrontal cortex. The pathways enriched in the shared transcriptional association suggested links between epigenetic aging and metabolism, immunity, and autophagy. Results from in vitro experiments showed that two clocks (Levine and Lin) were accelerated in accordance with two hallmarks of aging—cellular senescence and mitochondrial dysfunction. Finally, using multi‐tissue data to deconstruct the epigenetic clock signals, we developed a meta‐clock that demonstrated improved prediction for mortality and robustly related to hallmarks of aging in vitro than single clocks.
1 INTRODUCTION
Developing biomarkers that can quantify or approximate biological aging holds enormous promise for both basic and translational biomedical research (Kennedy et al., 2014; Lopez‐Otin, Blasco, Partridge, Serrano, & Kroemer, 2013; Sierra & Kohanski, 2017). One of the most promising biomarkers of aging that has emerged is the epigenetic clock. Some subset of DNA methylation (DNAm) has been shown to change predictably over the life span (Florath, Butterbach, Muller, Bewerunge‐Hudler, & Brenner, 2014; Johansson, Enroth, & Gyllensten, 2013; Rakyan et al., 2010; Teschendorff et al., 2010). Utilizing this observation, epigenetic clocks quantitatively combine DNAm levels at tens to hundreds of genomic locations into composite methylation‐based age predictors that often exhibit extremely high correlations with chronological age, upwards of r = 0.98 in full age range samples (a brief review of various epigenetic clocks (Table S1) can be found in Appendix S1 and other literature (Field et al., 2018)). Epigenetic clocks can also be contrasted against individuals' chronological ages to assess inter‐individual and/or inter‐tissue variability in the rate of aging (Horvath & Raj, 2018), and for a number of clocks, divergence between epigenetic and chronological age (referred to as age acceleration) has been shown to translate to differential susceptibility to death (Chen et al., 2016; Levine et al., 2018; Marioni et al., 2015; Zhang et al., 2017) and disease (Ambatipudi et al., 2017; Levine, Hosgood, et al., 2015; Levine, Lu, Bennett, & Horvath, 2015; Levine et al., 2018; Yang et al., 2016; Zheng et al., 2016).
Despite their shared theoretical interpretation, epigenetic clocks vary in their associations with health outcomes and each other. This does not seem to simply reflect a difference in the validity of the various clocks, but instead may suggest that they capture different aspects of the biological aging process, which likely stems from the differences in the outcomes and populations used to train them. For instance, most clocks were trained exclusively using samples in whole blood, whereas two clocks comprised multi‐tissue samples (Appendix S1). Additionally, while most were developed as “chronological age predictors,” three were trained using other phenotypes that reflect the effect of aging on health characteristics. The clock by Zhang et al. (2017) was developed as a predictor of all‐cause mortality; the clock by Yang et al. (2016) was trained to approximate mitotic rate; and the clock by Levine et al. (2018) was developed to approximate a multi‐system clinical aging measure that strongly correlates with age, but differentiates same‐aged individuals based on morbidity and mortality risk.
While the clocks share both similarities and differences, we lack an understanding of the underlying processes that they capture. We hypothesized that multiple distinct epigenetic aging phenomena exist, which are captured by the various clocks to varying degrees and accuracies, and that by identifying these overlapping signals we could construct a more reliable and valid biomarker of aging. In this study, we first used multi‐omics data from diverse human tissue/cell types, coupled with in vitro experiments focusing on hallmarks of aging, to delineate a comprehensive picture of the shared and contrasting features captured across eleven existing epigenetic clocks (Figure 1). Building on these observations, we decomposed these existing epigenetic clocks and recombined their conserved features into a single “meta‐clock,” for which we demonstrated improved prediction for mortality and more robust aging associations in vivo and in vitro.
Figure 1. Roadmap of this comparative analysis. To simplify the description, we used the last name of the first author to refer to each clock. The upper part shows the timeline of eleven epigenetic clocks included in this study. The next two parts include hypothesis development and testing. In hypothesis development part, we did comparative analysis for eleven epigenetic clocks mainly in four aspects. In hypothesis test part, we deconstructed the core signals across them, and developed and validated a novel meta‐clock
2 RESULTS
2.1 Contrasting clock characteristics across tissues/cells
The eleven epigenetic clocks considered in this study comprise 1600 CpGs, the majority of which (n = 1427) are specific to only one clock (Table S2). The lack of overlap in genomic locations between the clocks remains even when considering larger genomic regions (Figure S1), suggesting that clocks are not simply selecting adjacent CpGs and instead are drawing markers from entirely different regions. However, one CpG (cg09809672 in EDARADD) was included in seven of the eleven clocks. Surprisingly, the CpG (cg16867657) in ELOVL2 was only included in three clocks—Hannum, Horvath2, and Garagnani (the single CpG clock).
CpGs in the various clocks are also distinct in terms of their multi‐tissue age associations (Figure S2). Despite the fact that the Hannum clock was trained exclusively in whole blood, the CpGs comprising it show the strongest and most consistent age correlations across tissue and cell types—with approximately half of CpGs showing strong positive age correlations and the other half showing strong negative age correlations. Interestingly, CpGs in the original pan‐tissue clock by Horvath (which we refer to as Horvath1) show moderate tissue consistency, with a large proportion of CpGs exhibiting very weak to no age correlation. Given that Horvath1 was trained using 52 distinct tissues and cell types, we hypothesize that many of these non‐age‐related CpGs may actually be reflecting (and adjusting out) tissue differences. Similarly, the Levine clock also contains an abundance of CpGs with weak age correlations. The Levine clock—trained in whole blood—aimed to capture aging differences among individuals of the same chronological age. Thus, many CpGs in the Levine clock may reflect more stable differences in processes that influence aging, but do not themselves change with age in a predictable manner. Examples may include innate or acquired differences in resilience mechanisms (Levine & Crimmins, 2016), mTOR signaling (Kennedy & Lamming, 2016), inherent inflammatory/immune responsiveness (Brodin & Davis, 2017), etc.
When considering the overall clock scores, rather than the individual CpGs that comprise them, we get a slightly different picture. As shown in Figure 2, Horvath1 exhibits the strongest age correlation across pooled tissues and cells (r = 0.94), which is not entirely surprising given that it was trained to do just that. This is followed by Horvath2 (r = 0.85), which was the only other age predictor trained on more than one tissue type. While it is often misstated that the clocks developed using DNAm from whole blood do not “work” in other tissue and cell types, we found that the clocks by Hannum, Levine, Lin, and Weidner also exhibit fairly robust multi‐tissue age correlations (r = 0.68, 0.53, 0.67, and 0.46, respectively). Further, the reduced correlations appear to be due to tissue differences in the slope and intercept, rather than a lack of age correlations within tissues (Figure S3). This does not suggest that epigenetic changes are tissue‐specific, but perhaps that tissues have innate differences in vulnerability to epigenetic changes that manifest as differences in the rate of epigenetic aging as a function of chronological age. Interestingly, age slopes in these clocks appear to be most distinct when comparing brain to other tissues—with brain showing a much slower increase in epigenetic age over chronological age.
Figure 2. Age correlations for the clock scores across tissue and cell types. Pearson's correlations were used to assess associations between chronological age (x‐axis) and DNAmAge (y‐axis) by pooling 16 distinct tissue and cell types. Epigenetic clocks are denoted using the last name of the first author. DLPFX, dorsolateral prefrontal cortex
2.2 Conserved transcriptional signature across six epigenetic clocks
To examine the potential functional implications of epigenetic aging assessed by various clocks, we linked the clocks to differential gene expression. As an initial step, clocks were clustered based on their shared transcriptomic expression patterns in both purified CD14+ monocytes and dorsolateral prefrontal cortex (DLPFC) using the log2FC values from 8589 genes as input (Figure 3a,b). Results suggested that the accelerations (i.e., chronological age‐adjusted residuals) of five clocks (Hannum, Lin, Levine, Horvath1, and Horvath2) have similar transcriptional signals in both monocytes and DLPFC. When comparing the relative levels of differential expression associated with these five clocks, we found associations to be strongest for the Hannum clock in monocytes (Figure 3c), and the Horvath2 clock in DLPFC (Figure 3d).
Figure 3. Transcriptomic pattern of 11 existing epigenetic clocks. Hieratical clustering of epigenetic clocks was performed based on the log2FC values for age‐adjusted associations with 8589 genes in monocytes (a) and DLPFC (b). © Comparisons of the strength of differential expression associations between clocks, for five clocks (reference clock = Horvath1) in monocytes. The x‐axis represents the log2FC for the association between Horvath1 and 8589 genes. The y‐axis represents the log2FC for the association between five epigenetic clocks (Yang, Hannum, Lin, Levine, and Horvath2) and 8589 genes, with clocks distinguished by colors. The slope represents the fitted line of the association between the log2FC for Horvath1 and the log2FC for each of the other five clocks, respectively. Thus, slope > 1 suggests the respective clock has stronger gene expression signals compared to Horvath1; a slope < 1 suggests Horvath1 has stronger gene expression signals compared to the respective clock; and a slope = 1 suggests comparable gene expression associations between Horvath1 and the respective clock. (d) Comparisons of the strength of differential methylation for six clocks (Levine, Weidner, VidalBralo, Hannum, Lin, and Horvath2) in DLPFC relative to Horvath1. Selected GO terms (e) and KEGG pathways (f) by the enrichment analysis for co‐expression modules (identified via WGCNA) that were shown to be associated with multiple epigenetic clocks in monocytes and/or DLPFC. Modules are denoted by color (turquoise, yellow, green, and red). For each module, the five most enriched biological processes are shown, based on q value (FDR). DLPFX, dorsolateral prefrontal cortex
Next, we employed consensus weighted gene co‐expression network analysis (WGCNA), to identify gene co‐expression modules shared across monocytes and DLPFC. Modules can be thought of as tightly clustered genes that appear to operate as a network. We identified 16 co‐consensus expression modules (CM, Figure S4) shared in monocytes and DLPFC. We then calculated a summary value for each module, known as the eigengene, and tested the associations between module eigengenes and epigenetic clocks (accounting for age). Several module eigengenes displayed robust and consistent clock correlations across the two tissues (Figure S5). The co‐expression modules with the most robust clock associations after adjusting for age were the turquoise module (inverse association), green module (inverse association), yellow module (positive association), and red module (positive association). We then performed Gene Ontology (GO) enrichment analysis using the genes assigned to each of these modules, individually. We find that the turquoise module is enriched for genes involved in “cellular respiration” and “mitochondrial translation,” suggesting that epigenetic aging is associated with decreased transcription for genes involved in these biological processes. The green module, for which decreased transcription was also associated with epigenetic clocks was enriched for genes involved in “neutrophil‐mediated immunity.” Two modules for which increased expression was associated with epigenetic aging were the yellow module and the red module. The yellow module was enriched for genes involved in ‘chromatin organization’ and “histone modifications,” while the red module was enriched for genes involved in “regulation of autophagosome maturation” and “negative regulation of autophagy” (Figure 3e). Complete results for associations of eigengenes with epigenetic clocks, as well as module‐specific GO and KEGG enrichment are available in Tables S3 and S4.
2.3 Epigenetic clocks distinguish cancer vs normal tissues
Given the link between epigenetics aging and cancer (Ambatipudi et al., 2017; Levine, Hosgood, et al., 2015; Teschendorff et al., 2010; Yang et al., 2016; Zheng et al., 2016), we examined whether epigenetic clocks differ in terms of distinguishing tumor vs. normal tissues using data (GSE53051) from four different tumor/tissue types (breast, colon, lung, and pancreas cancer) (Timp et al., 2014). Results (Figure 4) showed that tumor samples had accelerated epigenetic aging relative to normal tissue using two (Levine and Yang) of the six epigenetic clocks shown to have conserved transcriptional signals in monocytes. Hannum showed slight acceleration in tumor vs normal for all tissues, except lung, while the other three clocks did not show consistent patterning for tumor vs. normal samples. The findings were replicable in two independent datasets for breast and colon cancer (Table S5).
Figure 4. Epigenetic clocks distinguish cancer vs. normal tissues. DNAmAge (adjusted for tissue type and age of the donor) was compared between tumor (red color) and normal tissue (blue color) for breast, colon, lung, and pancreas. Bars indicate standard errors
2.4 In vitro evidence linking epigenetic clocks, cellular senescence, and mitochondrial depletion
To link epigenetic clocks to two well‐known hallmarks of aging—cellular senescence and mitochondrial dysfunction—we examined DNAm data from both cultured fibroblasts (GSE91069) and 143B cells (GSE100249). Results showed (Figure 5a–f) that the clocks by Lin and Levine displayed a sequential increase as cells transition from early passage (EP) to near senescent (NS), and finally to replicative senescence (RS)—NS vs. EP (p‐value = 6.8E−8, 1.3E−4, and 9.7E−4, respectively), and RS vs. EP (p‐value = 4.5E−9, 6.5E−7, and 1.9E−6, respectively). Of these, only Levine showed a suggestive increase when comparing oncogene‐induced senescence (OIS) vs. EP (p = 0.056). Conversely, the only clock to show significant increase in OIS vs. EP was Hannum (p = 4.0E−3); however, paradoxically, Hannum suggested RS and NS had lower epigenetic acceleration relative to EP.
Figure 5. Epigenetic clocks, cellular senescence, and mitochondrial DNA depletion. (a–f) DNAmAge was estimated in BJ fibroblasts using early passage (EP) control samples, near senescent cells (NS), terminally passaged replicative senescent cells (RS), and oncogene‐induced senescent (OIS) cells via HRAS. (g–l) DNAmAge was estimated and compared between control (rho+) and mitochondrial DNA depleted (rho−) 143B cells. Bars indicate standard errors
.../...
.