Literature DB >> 18714388

Transcriptome-wide assessment of human brain and lymphocyte senescence.

Mun-Gwan Hong1, Amanda J Myers, Patrik K E Magnusson, Jonathan A Prince.   

Abstract

BACKGROUND: Identifying biological pathways that vary across the age spectrum can provide insight into fundamental mechanisms that impact disease and frailty in the elderly. Few methodological approaches offer the means to explore this question on as broad a scale as gene expression profiling. Here, we have evaluated mRNA expression profiles as a function of age in two populations; one consisting of 191 individuals with ages-at-death ranging from 65-100 years and with post-mortem brain mRNA measurements of 13,216 genes and a second with 1240 individuals ages 15-94 and lymphocyte mRNA estimates for 18,519 genes. PRINCIPAL
FINDINGS: Among negatively correlated transcripts, an enrichment of mitochondrial genes was evident in both populations, providing a replication of previous studies indicating this as a common signature of aging. Sample differences were prominent, the most significant being a decrease in expression of genes involved in translation in lymphocytes and an increase in genes involved in transcription in brain, suggesting that apart from energy metabolism other basic cell processes are affected by age but in a tissue-specific manner. In assessing genomic architecture, intron/exon sequence length ratios were larger among negatively regulated genes in both samples, suggesting that a decrease in the expression of non-compact genes may also be a general effect of aging. Variance in gene expression itself has been theorized to change with age due to accumulation of somatic mutations and/or increasingly heterogeneous environmental exposures, but we found no evidence for such a trend here. SIGNIFICANCE: Results affirm that deteriorating mitochondrial gene expression is a common theme in senescence, but also highlight novel pathways and features of gene architecture that may be important for understanding the molecular consequences of aging.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18714388      PMCID: PMC2515343          DOI: 10.1371/journal.pone.0003024

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

A decline in cell function with advancing age is a ubiquitous characteristic of all organisms. In humans, the effects of aging become manifest on a variety of levels that extend from an accumulation of DNA mutations to lipid oxidation, protein modification, cell loss, and ultimately death that is primarily due to increased susceptibility to age-related diseases [1]. Apart from overt changes, such as declining muscle strength, extensive metabolic alterations also occur with aging, one of the most prominent being impaired glucose tolerance [2]. Two central evolutionary theories hypothesize that the detrimental effects of aging are due to an accumulation of mutations, or antagonistic pleiotropy, whereby genes with beneficial effects early in life become deleterious with age [3]. These are not necessarily exclusive, and there is at present relatively strong evidence for both in studies of model organisms and in natural populations [3], [4]. Whatever the cause, there is value in charting the molecular sequelae of aging on as broad a scale as possible. Few other methodological approaches lend themselves as well to this as mRNA expression profiling. There have been a handful of studies that have attempted to catalogue how mRNA expression changes with age, the largest of which have been performed in kidney [5] and muscle samples [6]. An intriguing conclusion from the latter study is that there may be a common set of genes that change equivalently in different tissues. For example genes that make up the mitochondrial electron transport chain appear to decrease with age in different tissues, and this is supported in that decreases are also evident in mice and flies [6]. Importantly however, these studies remain relatively small in scale and few in number thus meriting larger studies in additional populations and tissues. The effects of aging are particularly pronounced in the human brain where characteristic changes in morphology include a reduction in both neuronal size and synaptic density [7], [8]. On a behavioral level, decreases in motor and cognitive function are hallmarks of normal aging [9]. Dementia is the most prevalent disorder of the human brain, affecting 20–25 million people worldwide and representing a tremendous burden in terms of years of suffering with disability [10]. To date, there has been only one study, involving 30 individuals, on the effects of aging on gene expression at a transcriptome-level in the normal human brain [11]. There the authors described a number of pathways affected by aging and also concluded that genes for which down-regulation was evident had an over-representation of mutation in gene promoters. A recent study examined the effects of gene polymorphism on gene expression in a relatively large set of human brain samples [12]. An attractive feature of that study was that individuals, prior to death, were free of neurological disease. Our primary focus in the present study has been to use the above sample to investigate the question of whether general changes in gene expression occur in the aging human brain. This has been complemented with an analysis of lymphocyte mRNA expression in order to explore for common molecular themes in different tissues as well as to enable an assessment of a broader age range than that typically available in post-mortem samples.

Results

We began by performing several validation steps towards the initial goal of evaluating if age-related change in gene expression could be detected in the human brain. First, evidence was sought for systematic outlier effects among the 2,096,975 individual expression level estimates. We identified all expression values that were more than 3.4644 standard deviations from the mean expression value for that particular transcript (see methods for an explanation for choosing this threshold). With this, we expected to see approximately 1,086 outliers by chance in the entire set assuming normal distributions of the log10 transformed data. There were 7,079 outliers identified in this way, and these were eliminated from all further analyses. Second, the focus of the original study was primarily upon brain cortex, with 5 subsets being represented, 3 cortical, one group of 6 cerebellum samples and an additional group of samples for which no specific region could be assigned. We assessed whether these classifications differed systematically with regards to transcript detection rates and global expression levels using ANOVA (see methods). Across the various brain region categories there were no significant differences for any these covariates. Simple linear regression was also performed for age-at-death and pmi versus global expression level. For this, there was marginal evidence that age-at-death correlated with global expression (P = 0.080, r2 = 0.016). For pmi however, the trend was significant, but the direction of the curve suggested that expression was increasing with increasing pmi (P = 0.0039. r2 = 0.046). Of note, linear regression of the 2 global expression covariates (see methods) and detection rate metrics (one reflecting all 24,357 transcripts probed by the Illumina HumanRefseq-8 chip and the other reflecting the 14,078 transcripts detected in this brain sample) all showed strong correlation (r2 in excess of 0.5 for all 6 comparisons). Based upon all of the above tests, we resolved to use the global average expression level that includes all 14,078 transcripts as a primary metric of quality as a covariate for initial analyses. For the brain sample, we regressed age-at-death on transcript levels one at a time for all 14,078 transcripts that had been detected in 5 or more individuals in the original study. The results of this are shown in Tables 1 and 2, where we document 54 transcripts that exhibited significant change after Bonferroni correction for 14,078 tests. For purposes of illustration, regression lines for the top 6 scoring genes (3 up-regulated and 3 down-regulated) are also displayed in Figure 1. For this analysis, the most significant finding was for the SVOP gene (encoding a synaptic vesicle protein), and the single strongest effect in terms of variance explained was for the TAC3 gene (encoding tachykinin 3). The smallest number of individuals for which a significant correlation with age was detected was 133 (also for the TAC3 gene). We noted that 3 of the top scoring 54 genes were also present in a list of 532 putative “housekeeping” genes (see [13]), these being PIN1, GUK1, and ARPC2. This proportion is not significantly different from what would be expected by chance (3 vs. 51 compared to 532 vs. 6968). At this stage, we also scrutinized this list of 54 genes more closely for the potential impact of covariates. For this, multiple regression models were fitted as above, but for each transcript in turn we also included terms for pmi, brain region, or gender. All 54 genes remained highly significant when any of these were individually tested (not shown). Among the top 54 transcripts, all were detectable in more than 100 individuals, reinforcing the importance of power in analyses such as this. There were however a few cases where a small number of observations gave rise to apparently large effects. The gene that ranked as the 139th most significant finding (falling below the multiple test correction threshold), ZIC3, had only 14 expression level observations and an r2 = 0.76 (P = 2.6×10−5). This highlighted the possibility that genes with lower maximum expression levels might be changing to a larger degree with age, but technologies with lower detection thresholds and/or larger sample sizes would be required to identify them.
Table 1

Genes whose expression decreases with age in the human brain.

SymbolDescriptionChr.Pos P valuea R2 b
SVOPSV2 related protein homolog (rat)12q4.3(10−9)0.14
HBQ1hemoglobin, theta 116p8.6(10−8)0.13
MRPL28mitochondrial ribosomal protein L2816p1.0(10−7)0.14
TAC3tachykinin 3 (neuromedin K, neurokinin beta)12q1.0(10−7)0.18
C6orf154chromosome 6 open reading frame 1546p1.5(10−7)0.13
SLC25A6solute carrier family 25, member A6Xp/Yp1.9(10−7)0.14
ST3GAL2ST3 beta-galactoside alpha-2,3-sialyltransferase 216q2.2(10−7)0.15
PIN1peptidylprolyl cis/trans isomerase, NIMA-interacting 119p3.2(10−7)0.13
ARPC2actin related protein 2/3 complex, subunit 2, 34 kDa2q3.2(10−7)0.14
KCNF1potassium voltage-gated channel, subfamily F, member 12p4.1(10−7)0.12
GSSglutathione synthetase20q4.3(10−7)0.11
LOC255849hypothetical LOC2558494.5(10−7)0.12
HMGB3high-mobility group box 3Xq4.8(10−7)0.13
TMEM121transmembrane protein 12114q6.1(10−7)0.14
CAMK2N1calcium/calmodulin-dependent protein kinase II inhibitor 11p6.4(10−7)0.12
OLFM1olfactomedin 19q7.4(10−7)0.13
KCNIP1Kv channel interacting protein 15q9.2(10−7)0.12
FABP3fatty acid binding protein 31p1.1(10−6)0.08
GUK1guanylate kinase 11q1.2(10−6)0.13
COPS7ACOP9 constitutive photomorphogenic homolog subunit 7A (Arabidopsis)12p1.3(10−6)0.07
VIPvasoactive intestinal peptide6q1.4(10−6)0.11
PQLC1PQ loop repeat containing 118q1.6(10−6)0.13
FLJ34048hypothetical transcript1.6(10−6)0.12
CYP46A1cytochrome P450, family 46, subfamily A, polypeptide 114q2.0(10−6)0.11
ATG7ATG7 autophagy related 7 homolog (S. cerevisiae)3p2.1(10−6)0.11
CXCL14chemokine (C-X-C motif) ligand 145q2.3(10−6)0.13
NXPH1neurexophilin 17p2.6(10−6)0.11
C17orf76chromosome 17 open reading frame 762.7(10−6)0.17
NPM3nucleophosmin/nucleoplasmin, 310q3.1(10−6)0.08
LHX6LIM homeobox 69q3.2(10−6)0.07
FRMPD2FERM and PDZ domain containing 210q3.4(10−6)0.07
HSD11B1Lhydroxysteroid (11-beta) dehydrogenase 1-like19p3.5(10−6)0.12
SMAD3SMAD family member 315q3.5(10−6)0.09

P-value with the null hypothesis β 1 = 0 in linear regression model Y = β 0+β 1 x 1+β 2 x 2+ε, where Y is transcript expression, x 1 age-at-death, x 2 global expression, and ε random error. bCoefficient of determination of linear regression model Y = β 0+β 1 x 1+ε, where each variable is as described above.

Table 2

Genes whose expression increases with age in the human brain.

SymbolDescriptionChr.Pos P valuea R2 b
ELF2E74-like factor 2 (ets domain transcription factor)4q5.3(10−9)0.16
C1orf162chromosome 1 open reading frame 1621p6.0(10−9)0.13
LRCH4leucine-rich repeats and calponin homology (CH) domain containing 47q2.9(10−8)0.09
MTUS1mitochondrial tumor suppressor 18p6.8(10−8)0.12
RUFY1RUN and FYVE domain containing 15q7.0(10−8)0.10
RDH5retinol dehydrogenase 5 (11-cis/9-cis)12q1.4(10−7)0.06
TYK2tyrosine kinase 219p1.8(10−7)0.10
CLK1CDC-like kinase 12q2.1(10−7)0.14
TXNIPthioredoxin interacting protein1q2.1(10−7)0.10
SLC16A9solute carrier family 16 (monocarboxylic acid transporters), member 910q4.8(10−7)0.06
ADORA3adenosine A3 receptor1p5.1(10−7)0.10
UCKL1uridine-cytidine kinase 1-like 120q7.4(10−7)0.11
CTDSP2nuclear LIM interactor-interacting factor 212q1.1(10−6)0.05
HLA-DPB1major histocompatibility complex, class II, DP beta 16p1.2(10−6)0.09
PATL1protein associated with topoisomerase II homolog 1 (yeast)11q1.4(10−6)0.12
GDPD3glycerophosphodiester phosphodiesterase domain containing 316p1.4(10−6)0.07
BHLHB3basic helix-loop-helix domain containing, class B, 312p1.7(10−6)0.07
RNASE4ribonuclease, RNase A family, 414q1.9(10−6)0.06
PLEKHM1pleckstrin homology domain containing, family M (with RUN domain) member 117q2.5(10−6)0.04
FAM46Afamily with sequence similarity 46, member A6q3.4(10−6)0.06
CALCOCO1calcium binding and coiled-coil domain 112q3.5(10−6)0.09

P-value with the null hypothesis β 1 = 0 in linear regression model Y = β 0+β 1 x 1+β 2 x 2+ε, where Y is transcript expression, x 1 age-at-death, x 2 global expression, and ε random error. bCoefficient of determination of linear regression model Y = β 0+β 1 x 1+ε, where each variable is as described above.

Figure 1

Scatter plots and fitted linear regression lines of the most significantly associated down-regulated (a–c) and up-regulated (d–f) genes with age-at-death in brain samples.

P-value with the null hypothesis β 1 = 0 in linear regression model Y = β 0+β 1 x 1+β 2 x 2+ε, where Y is transcript expression, x 1 age-at-death, x 2 global expression, and ε random error. bCoefficient of determination of linear regression model Y = β 0+β 1 x 1+ε, where each variable is as described above. P-value with the null hypothesis β 1 = 0 in linear regression model Y = β 0+β 1 x 1+β 2 x 2+ε, where Y is transcript expression, x 1 age-at-death, x 2 global expression, and ε random error. bCoefficient of determination of linear regression model Y = β 0+β 1 x 1+ε, where each variable is as described above. We contrasted the individual transcript results of this analysis with the first study to address the impact of aging on expression at the transcriptome level [11]. In that study, 30 individuals were included with an age range of 26–106. Our set was scrutinized for the 148 genes previously highlighted to be associated with aging (see specifically table 1 from [11]). Only one gene from our set that survived strict multiple testing correction was also present in their set, this being MRPL28. With a relaxed uncorrected significance threshold in our set of 0.005, we found 26 overlapping genes. This proportion is significant given that we observed 1,141 transcripts in our set that were significant at the 0.005 threshold, suggesting that some replication exists, albeit limited (26 vs. 122 compared to 1,141 vs. 12,937). In our set, the majority of significant genes exhibited a decrease in expression with advancing age. An intriguing aspect of this comparison was that among the 26 overlapping genes, there were 9 for which expression increased with age in the original study, and for all 9 these were increased with age in the present study. Due to the consistency in the direction of the effects, this might be considered a stronger case for replication. Seven of these 9 genes were previously considered as a class of stress response genes. We explored an expanded list of genes that exceeded a relaxed significance threshold (uncorrected P<0.05) for enrichment or deficit based upon Gene Ontology (GO) terms and the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. The entire set of genes was divided into two groups based upon whether expression was increasing or decreasing with age. We then searched for over- or under-represented terms and pathways in each list of genes using the DAVID web application [14]. For this we created a diminished non-redundant set of 13,216 base genes from the total set of 14,078 by eliminating multiple transcripts representing the same gene (see methods). The enriched terms (Bonferroni corrected P<0.05) are shown in Tables 3 and 4, excluding the ancestor terms that became significant mainly by an overrepresented descendant in the GO structure. A larger list of all overrepresented terms and pathways with a less stringent threshold (uncorrected P<0.01) is shown in Tables S1 and S2. In summarizing these results, the genes whose products are involved in DNA-binding were the most significantly overrepresented group among genes whose expression increases with age (Table 4). Genes with products involved in the regulation of DNA-dependent transcription and genes encoding proteins located in or a subcomponent of the nucleus were also significantly enriched. Among all genes negatively correlated with age, the most significantly enriched groups included genes involved in nervous system development, mitochondrial genes, and those whose products are constituents of the cytoplasm (Table 3). We also highlighted the degree of enrichment in terms of a fold change (the number of genes that give rise to this number are shown in Tables S1 and S2). Thus, in quantitative terms the most enriched genes occurred in the set of negatively correlated transcripts, where dendrite genes and genes associated with neuronal projections were enriched more than 3 fold (Table 3). In contrast, although highly significant in some cases, genes whose expression is increasing with age are enriched a maximum of 1.80 fold (Table 4). As a comparison, the probabilities for the enriched terms to be underrepresented by chance within the opposite group of genes are also shown in the last two columns of Tables 3 and 4. For example, from Table 4, we show that the term “nucleus” which is significantly enriched in the set of genes that are increasing with age is also in deficit in the set of genes that are decreasing with age.
Table 3

Terms in the Gene Ontology and KEGG pathway databases over-represented among genes that decreased expression with advancing age in the human brain (1,450 genes in total 13,216).

CategoryTerm P valuea BonferroniFoldb Increasing groupc
Foldb P valued
GO Bio.Processnervous system development7.8(10−9)3.8(10−5)1.730.766.7(10−3)
synaptic transmission7.4(10−7)3.5(10−3)2.070.283.1(10−7)
oxidative phosphorylation1.4(10−6)6.7(10−3)2.730.155.3(10−5)
GO Cell.Componentcytoplasmic part9.0(10−11)7.2(10−8)1.290.764.2(10−11)
mitochondrion1.5(10−10)1.2(10−7)1.650.547.7(10−10)
neuron projection2.3(10−10)1.8(10−7)3.410.401.0(10−2)
synapse1.8(10−9)1.4(10−6)2.550.426.4(10−4)
mitochondrial membrane part8.7(10−9)6.9(10−6)2.950.141.8(10−5)
mitochondrial inner membrane3.1(10−8)2.5(10−5)2.110.455.4(10−5)
mitochondrial respiratory chain4.9(10−6)3.9(10−3)2.990.114.3(10−4)
dendrite5.2(10−6)4.2(10−3)3.600.171.1(10−2)
GO Mol.Functionhydrogen ion transmembrane transporter activity1.8(10−6)4.7(10−3)2.700.585.1(10−2)
KEGG PathwayOxidative phosphorylation1.9(10−9)3.8(10−7)2.660.251.4(10−4)

EASE-score, P-value of a modified Fisher's exact test for overrepresentation [35]. bFold enrichment in each gene group compared to the base set. cGene group with expression that increases with advancing age-at-death (same genes used in Table 4, 1943 genes). dHypergeometric test for underrepresentation using annotated genes from a total set of 13,216 genes as base population [36].

Table 4

Terms in the Gene Ontology and KEGG pathway databases over-represented among genes that increased expression with advancing age in the human brain (1,943 genes in total 13,216).

CategoryTerm P valuea BonferroniFoldb Decreasing groupc
Foldb P valued
GO Bio.Processregulation of transcription, DNA-dependent2.1(10−13)1.0(10−9)1.430.566.1(10−13)
chromosome organization and biogenesis1.4(10−7)6.8(10−4)1.800.531.4(10−3)
DNA packaging4.7(10−6)2.2(10−2)1.780.491.9(10−3)
DNA metabolic process6.4(10−6)3.0(10−2)1.450.597.6(10−5)
GO Cel.Componentnucleus1.2(10−13)9.6(10−11)1.260.753.7(10−11)
GO Mol.FunctionDNA binding6.8(10−14)1.7(10−10)1.440.521.1(10−14)
zinc ion binding1.7(10−11)4.3(10−8)1.390.646.3(10−9)
transcription regulator activity1.1(10−6)2.7(10−3)1.380.651.8(10−5)

EASE-score, P-value of a modified Fisher's exact test for overrepresentation [35]. bFold enrichment in each gene group compared to the base set. cGene group with expression that decreases with advancing age-at-death (same genes used in Table 3, 1450 genes). dHypergeometric test for underrepresentation using annotated genes from a total set of 13,216 genes as base population [36].

EASE-score, P-value of a modified Fisher's exact test for overrepresentation [35]. bFold enrichment in each gene group compared to the base set. cGene group with expression that increases with advancing age-at-death (same genes used in Table 4, 1943 genes). dHypergeometric test for underrepresentation using annotated genes from a total set of 13,216 genes as base population [36]. EASE-score, P-value of a modified Fisher's exact test for overrepresentation [35]. bFold enrichment in each gene group compared to the base set. cGene group with expression that decreases with advancing age-at-death (same genes used in Table 3, 1450 genes). dHypergeometric test for underrepresentation using annotated genes from a total set of 13,216 genes as base population [36]. We extended the above analytical scheme to a second sample consisting of 1240 individuals with lymphocyte mRNA measures for 19,648 transcripts representing 18,519 genes. We note that a rigorous standardization procedure had been applied to this set previously [15] and we thus elected not to perform any additional transformations, nor to include any additional covariates (see methods). Linear regression was performed as above on all 19,648 transcripts versus age for this sample. Given its size, and compared to the brain sample, there were many more transcripts that exceeded a strict Bonferroni significance threshold, and we chose to present only the top 50 (25 negatively and 25 positively correlated with age) which are shown in Tables 5 and 6, arbitrarily truncated from a total of 1080 (612 negative and 468 positive). The top 3 scoring negatively regulated genes and top 3 scoring positively regulated genes are shown in Figure 2. The most significant finding in this set was for the LRRN3 (encoding a membrane protein with unknown function), and this was also the gene upon which age exhibited the largest effect (28% variance explained). Interestingly, the second largest effect observed in this set was 14% variance explained, representing in our view an anomalous drop from the top ranking gene. As was done for the brain sample, we again divided this list into up- and down- regulated genes and performed term/pathway based analyses. An arbitrary significance level of 0.01 was chosen from the linear regression results to establish these 2 lists (this differed from the brain sample for which a threshold of 0.05 was set due to the number of genes exceeding significance in lymphocytes and a 2000 gene limit set by the DAVID web tool). The total set of 19,648 transcripts for which regression was performed, was reduced to 13,231 for which annotations could be obtained towards analysis using the DAVID web application. The results of this are shown in Tables 7 and 8 where Bonferroni corrected (P<0.05) enriched terms and pathways are documented. As was done for the brain sample, an expanded list of enriched terms and pathways according to relaxed uncorrected significance threshold of P<0.01 is shown in Tables S3 and S4. The main highlight in our view is the highly significant enrichment of mitochondrial genes in the negatively regulated group (Table 7), providing a replication of what was seen in the brain. In general, the magnitude of the fold-change in term/pathway enrichment was larger in the set of negatively regulated genes, also in agreement with what was seen in brain.
Table 5

Genes whose expression decreases with age in human lymphocytes.

SymbolDescriptionChr.Pos P valuea R2
LRRN3leucine rich repeat neuronal 37q5.1(10−94)0.29
FCGBPFc fragment of IgG binding protein19q7.5(10−44)0.14
FBLN2fibulin 23p8.1(10−44)0.14
NRCAMneuronal cell adhesion molecule7q3.3(10−43)0.14
ITM2Cintegral membrane protein 2C2q1.8(10−39)0.13
PDE9Aphosphodiesterase 9A21q4.9(10−37)0.12
ZNF154zinc finger protein 15419q5.3(10−36)0.12
ZSCAN18zinc finger and SCAN domain containing 1819q4.7(10−35)0.12
SATB1SATB homeobox 13p9.8(10−35)0.11
FLNBfilamin B, beta (actin binding protein 278)3p1.1(10−34)0.11
FAM134Bfamily with sequence similarity 134, member B5p3.1(10−34)0.11
SCDstearoyl-CoA desaturase (delta-9-desaturase)10q8.8(10−34)0.11
SREBF1sterol regulatory element binding transcription factor 117p4.5(10−33)0.11
CCR7chemokine (C-C motif) receptor 717q6.3(10−33)0.11
PHGDHphosphoglycerate dehydrogenase1p1.1(10−32)0.11
LEF1lymphoid enhancer-binding factor 14q5.3(10−32)0.11
NPM3nucleophosmin/nucleoplasmin, 310q9.8(10−32)0.11
OXNAD1oxidoreductase NAD-binding domain containing 13p2.1(10−31)0.10
TNNT3troponin T type 3 (skeletal, fast)11p3.4(10−30)0.10
PLEKHG416q5.8(10−30)0.10
MGC9913hypothetical protein MGC99131.5(10−29)0.10
SLC7A6solute carrier family 7 (cationic amino acid transporter, y+ system), member 616q4.4(10−29)0.10
CD27CD27 molecule12p6.1(10−29)0.10
AEBP1AE binding protein 17p1.0(10−28)0.10
MGC29506hypothetical protein MGC295065q1.2(10−28)0.09

P-value with the null hypothesis β 1 = 0 in linear regression model Y = β 0+β 1 x 1+ε, where Y is transcript expression, x 1 age, and ε random error.

Table 6

Genes whose expression increases with age in human lymphocytes.

SymbolDescriptionChr.Pos P valuea R2
IGFBP3insulin-like growth factor binding protein 37p1.7(10−70)0.22
SYT11synaptotagmin XI1q7.4(10−60)0.19
GZMHgranzyme H (cathepsin G-like 2, protein h-CCPX)14q2.0(10−48)0.16
JAKMIP1janus kinase and microtubule interacting protein 14p5.4(10−45)0.15
RCAN2regulator of calcineurin 26p1.5(10−43)0.14
CRIP1cysteine-rich protein 1 (intestinal)14q1.9(10−39)0.13
PATL2protein associated with topoisomerase II homolog 2 (yeast)1.2(10−38)0.13
MSCmusculin (activated B-cell factor-1)8q5.9(10−32)0.11
GDPD5glycerophosphodiester phosphodiesterase domain containing 511q3.1(10−30)0.10
APOBEC3Hapolipoprotein B mRNA editing enzyme, catalytic polypeptide-like 3H22q2.3(10−29)0.10
CCL5chemokine (C-C motif) ligand 517q7.1(10−29)0.10
GFI1growth factor independent 1 transcription repressor1p2.4(10−28)0.09
MANEALmannosidase, endo-alpha-like1p2.9(10−28)0.09
KIF21Akinesin family member 21A12q3.1(10−27)0.09
GPR137BG protein-coupled receptor 137B1q5.4(10−27)0.09
PDGFRBplatelet-derived growth factor receptor, beta polypeptide5q3.7(10−26)0.09
PCBP4poly(rC) binding protein 43p8.1(10−26)0.09
B3GAT1beta-1,3-glucuronyltransferase 1 (glucuronosyltransferase P)11q1.1(10−25)0.08
LLGL2lethal giant larvae homolog 2 (Drosophila)17q4.7(10−25)0.08
LAG3lymphocyte-activation gene 312p5.1(10−25)0.08
PPP2R2Bbeta isoform of regulatory subunit B55, protein phosphatase 2 isoform b5q1.5(10−24)0.08
hypothetical gene supported by BC0400609.1(10−24)0.08
PRSS23protease, serine, 2311q1.5(10−23)0.08
B4GALT5UDP-Gal:betaGlcNAc beta 1,4- galactosyltransferase, polypeptide 520q1.9(10−22)0.07
MXRA7matrix-remodelling associated 717q3.0(10−22)0.07

P-value with the null hypothesis β 1 = 0 in linear regression model Y = β 0+β 1 x 1+ε, where Y is transcript expression, x 1 age, and ε random error.

Figure 2

Scatter plots and fitted linear regression lines of the most significantly associated down-regulated (a–c) and up-regulated (d–f) genes with age in lymphocyte samples.

Table 7

Terms in the Gene Ontology and KEGG pathway databases over-represented among genes that decreased expression with advancing age in human lymphocytes (1,878 genes in total 13,232).

CategoryTerm P valuea BonferroniFoldb Increasing groupc
Foldb P valued
GO Bio.Processtranslation4.2(10−21)2.0(10−17)2.190.608.7(10−4)
cellular biosynthetic process6.7(10−21)3.2(10−17)1.830.791.5(10−2)
gene expression4.0(10−18)1.9(10−14)1.360.703.1(10−11)
ribosome biogenesis and assembly1.5(10−8)7.2(10−5)2.900.235.9(10−3)
tRNA metabolic process2.9(10−8)1.4(10−4)2.570.524.7(10−2)
RNA metabolic process2.2(10−7)1.0(10−3)1.240.716.9(10−9)
RNA processing8.9(10−7)4.2(10−3)1.610.232.5(10−11)
tRNA processing4.3(10−6)2.0(10−2)2.820.632.2(10−1)
rRNA processing8.4(10−6)3.9(10−2)2.800.161.1(10−2)
GO Cell.Componentribosome2.0(10−23)1.5(10−20)3.020.091.1(10−8)
ribosomal subunit4.2(10−21)3.3(10−18)3.590.081.1(10−5)
ribonucleoprotein complex1.6(10−20)1.3(10−17)2.190.223.5(10−12)
organelle lumen9.0(10−16)7.0(10−13)1.710.552.2(10−7)
mitochondrion5.0(10−12)3.9(10−9)1.620.607.4(10−6)
small ribosomal subunit2.7(10−11)2.2(10−8)3.720.001.1(10−3)
large ribosomal subunit1.4(10−10)1.1(10−7)3.450.144.6(10−3)
cytosolic part3.1(10−10)2.5(10−7)2.600.638.0(10−2)
nucleolus5.6(10−10)4.4(10−7)2.420.478.7(10−3)
intracellular organelle part4.9(10−9)3.8(10−6)1.240.831.4(10−4)
mitochondrial part6.4(10−9)5.0(10−6)1.720.631.9(10−3)
mitochondrial matrix1.5(10−8)1.2(10−5)2.250.391.8(10−3)
mitochondrial small ribosomal subunit4.5(10−5)3.4(10−2)4.250.001.1(10−1)
GO Mol.Functionstructural constituent of ribosome9.4(10−22)2.4(10−18)3.080.101.2(10−7)
nucleic acid binding2.4(10−19)6.0(10−16)1.400.641.1(10−14)
RNA binding2.6(10−19)6.7(10−16)1.950.341.3(10−11)
methyltransferase activity3.0(10−6)7.6(10−3)2.110.331.3(10−3)
oxidoreductase activity (NAD or NADP)5.2(10−6)1.3(10−2)2.600.763.1(10−1)
KEGG PathwayRibosome1.1(10−17)2.2(10−15)3.970.003.4(10−5)

EASE-score, P-value of a modified Fisher's exact test for overrepresentation [35]. bFold enrichment in each gene group compared to the base set. cGene group with expression that increases with advancing age (same genes used in Table 8, 1430 genes). dHypergeometric test for underrepresentation using annotated genes from a total set of 13,232 genes as a base population [36].

Table 8

Terms in the Gene Ontology and KEGG pathway databases over-represented among genes that increased expression with advancing age in human lymphocytes (1,430 genes in total 13,232).

CategoryTerm P valuea BonferroniFoldb Decreasing groupd
Foldb P valuee
GO Bio.Processsignal transduction2.1(10−10)1.0(10−6)1.340.728.8(10−11)
immune response9.2(10−10)4.3(10−6)1.841.026.0(10−1)
defense response7.0(10−9)3.3(10−5)1.900.762.7(10−2)
response to external stimulus1.3(10−8)6.2(10−5)1.840.661.4(10−3)
cytoskeleton organization and biogenesis6.8(10−7)3.2(10−3)1.780.582.4(10−4)
positive regulation of cellular process9.6(10−7)4.5(10−3)1.510.974.1(10−1)
response to wounding7.2(10−6)3.3(10−2)1.800.623.0(10−3)
cell adhesion7.6(10−6)3.5(10−2)1.640.716.0(10−3)
GO Cel.Componentplasma membrane7.0(10−11)5.5(10−8)1.420.621.0(10−13)
membrane part1.1(10−7)8.9(10−5)1.210.793.5(10−10)
cytoskeleton1.4(10−7)1.1(10−4)1.560.691.5(10−4)
integral to plasma membrane2.3(10−6)1.8(10−3)1.490.691.0(10−4)
GO Mol.Functionprotein binding9.6(10−10)2.4(10−6)1.160.933.3(10−3)
actin binding9.1(10−7)2.3(10−3)2.070.574.4(10−3)
signal transducer activity1.3(10−6)3.2(10−3)1.370.794.8(10−4)
GTPase activity1.4(10−6)3.5(10−3)2.240.622.7(10−2)
receptor binding1.5(10−6)3.7(10−3)1.680.761.8(10−2)
GTP binding1.5(10−5)3.9(10−2)1.790.862.0(10−1)
KEGG PathwayRegulation of actin cytoskeleton3.1(10−9)6.0(10−7)2.440.526.4(10−3)
Natural killer cell mediated cytotoxicity1.3(10−5)2.6(10−3)2.250.262.3(10−4)
Focal adhesion3.2(10−5)6.4(10−3)2.020.551.2(10−2)

EASE-score, P-value of a modified Fisher's exact test for overrepresentation [35]. bFold enrichment in each gene group compared to the base set. cGene group with expression that decreases with advancing age (same genes used in Table 7, 1878 genes). eHypergeometric test for underrepresentation using annotated genes from a total set of 13,232 genes as a base population [36].

P-value with the null hypothesis β 1 = 0 in linear regression model Y = β 0+β 1 x 1+ε, where Y is transcript expression, x 1 age, and ε random error. P-value with the null hypothesis β 1 = 0 in linear regression model Y = β 0+β 1 x 1+ε, where Y is transcript expression, x 1 age, and ε random error. EASE-score, P-value of a modified Fisher's exact test for overrepresentation [35]. bFold enrichment in each gene group compared to the base set. cGene group with expression that increases with advancing age (same genes used in Table 8, 1430 genes). dHypergeometric test for underrepresentation using annotated genes from a total set of 13,232 genes as a base population [36]. EASE-score, P-value of a modified Fisher's exact test for overrepresentation [35]. bFold enrichment in each gene group compared to the base set. cGene group with expression that decreases with advancing age (same genes used in Table 7, 1878 genes). eHypergeometric test for underrepresentation using annotated genes from a total set of 13,232 genes as a base population [36]. Based upon the identified lists of age-related genes for both brain and lymphocyte samples, we also explored for differences in basic genomic architecture. Our guide for this was a detailed analysis recently presented on characteristics of housekeeping genes [13]. The results of this analysis are presented in Tables 9 and 10, where we document the differences between the up and down-regulated sets in comparison with the characteristics of the non-regulated genes. In summarizing these results, while we found some striking differences, for example with coding sequence length in brain, we note that this was not replicated in lymphocytes. Instead we highlight a single category that does appear to be replicating, namely the ratio of average intron to average exon sequence length. We used second-order factorial ANOVA models to perform combined analyses for all variables. For this, the intron/exon sequence ratio is highly significant (F2,24158 = 41.57, P = 9.5×10−19) with no evidence of interaction between the two sets (P = 0.69). This can be taken in context with the same analysis for coding sequence length, where the tissue by group (group defined by up, down, or non-regulated genes) interaction term was highly significant (P<0.0001). For all other combined analyses, the main group effect was either non-significant or the interaction term was highly significant (P<0.0001).
Table 9

Genomic architecture comparisons for age associated genes in human brain samples.

Negative asso.a Positive asso.b UnregulatedP-value
(n = 1,369)(n = 1,746)(n = 9,405)
pre-mRNA length75652±456053683±201862344±11861.9(10−1)
Coding sequence length1311±282057±441722±187.0(10−60)
Number of exons9.2±0.213.0±0.311.2±0.11.1(10−22)
Total intron length74662±463651237±203360622±12021.4(10−1)
5′ UTR length215±6227±6210±23.0(10−2)
3′ UTR length1155±341219±291202±133.6(10−1)
Average intron of ea. transcript8651±4855312±2496648±1441.5(10−9)
Average exon of ea. transcript424±14435±13429±59.9(10−1)
Average intron / average exon28.7±1.516.4±0.721.9±0.55.7(10−10)

All data are base pair ±SEM. P-values were calculated from log10 transformed data using ANOVA. aNegatively associated genes were those whose expression decreases with age-at-death (P<0.05). bPositively associated genes were those for which expression increased with age-at-death (P<0.05). Unregulated genes were those not significantly correlated with age.

Table 10

Genomic architecture comparisons for age associated genes in human lymphocyte samples.

Negative asso.a Positive asso.b UnregulatedP-value
(n = 2,364)(n = 1,980)(n = 7,898)
pre-mRNA length51908±175549489±188054879±11151.2(10−6)
Coding sequence length1724±521628±511710±186.9(10−3)
Number of exons11.5±0.210.7±0.211.1±0.12.1(10−5)
Total intron length49705±176447549±189353364±11357.5(10−9)
5′ UTR length195±4211±5208±21.0(10−3)
3′ UTR length1104±251134±281185±153.9(10−4)
Average intron of ea. transcript5497±2095463±2565651±1101.5(10−6)
Average exon of ea. transcript381±10420±11435±67.7(10−11)
Average intron / average exon20.5±1.017.7±0.719.2±0.47.1(10−10)

All data are base pair ±SEM. P-values were calculated from log10 transformed data using ANOVA. aNegatively associated genes were those whose expression decreases with age (P<0.05). bPositively associated genes were those for which expression increased with age (P<0.05). Unregulated genes were those not significantly correlated with age.

All data are base pair ±SEM. P-values were calculated from log10 transformed data using ANOVA. aNegatively associated genes were those whose expression decreases with age-at-death (P<0.05). bPositively associated genes were those for which expression increased with age-at-death (P<0.05). Unregulated genes were those not significantly correlated with age. All data are base pair ±SEM. P-values were calculated from log10 transformed data using ANOVA. aNegatively associated genes were those whose expression decreases with age (P<0.05). bPositively associated genes were those for which expression increased with age (P<0.05). Unregulated genes were those not significantly correlated with age. Finally, we used these two samples to pursue the question of whether variance in gene expression itself changes with age. Our hypothesis was that variance might increase with age as a consequence of accumulating somatic mutation [11] and/or increasingly heterogeneous environmental exposures. For the brain sample, an F-test for equality of variances was conducted on all 14,078 transcripts by dividing the sample in two groups according to the median age. We observed only 10 genes that exceeded a strict multiple test correction threshold and in each case there was evidence that the distributions of these genes deviated from normality. There was no overlap in this set for the highest scoring candidates with genes found to change with age in the linear regression analyses. There were 6 cases where an age-related gene also had a borderline significant F-test (P<0.05). This proportion was not significantly different than what could be found in the entire set (not shown). For the lymphocyte set, we took a slightly different approach due to its size and rather than dividing the sample at the median chose to examine decile bins and applied a Levene's robust variance test to explore for differences across age groups. There was again no strong evidence that variance differed across these groupings, the most significant finding being for the NLP gene (P = 2.3×10−6), and with only 2 genes in total attaining significance after multiple test correction. As a final closing note, we enumerated the number of statistical tests used in this study in its entirety, arriving at an approximate number of 110,000. This may be used as a reference for any of the un-corrected P-values that are presented.

Discussion

We consider the most important finding in this study to be that expression levels of genes involved in mitochondrial processes are decreasing with age, as originally proposed by Zahn et al. and also supported by Miller et al. [6], [16]. In the former study, a key observation was that this is apparent in multiple tissues as well as in species other than humans. The present analysis provides a confirmation of this as a common characteristic of aging, in that this evident in both brain and lymphocytes. For these two samples there are also tissue specific themes that have emerged that we consider a validation of the quality of the expression phenotypes originally obtained. In the brain sample, consistent with previous results [11], a decrease in genes involved in synaptic function was observed, which follows from the documented changes in synaptic function that occur with age [8]. In lymphocytes, there was evidence that genes involved in the immune response increase expression with age, which might be regarded as a reflection of chronic persisting viruses such as cytomegalovirus (CMV) [17]. While we acknowledge central differences in these samples that include ethnicity, the age ranges, sample size, and the cell types represented, in this discussion we highlight these and other pathways with a comparison of the two different tissues as a guide. A key distinction in contrasting the study by Zahn et al. was the specific focus on genes of the mitochondrial respiratory chain [6]. While many of these are included in our emergent lists of genes of the mitochondrion (see Tables S1 and S3), results indicate that the effects of age on mitochondrial function may be broader. For example, in the brain the most significant finding across all pathways was for cytoplasm genes (Table 3). Although this categorization includes all genes of the mitochondrion, and indeed our results suggest it is the latter grouping that contributes to the strength of the statistic, there remain additional genes that may be enriched in this set (see Table S1). Oxidative phosphorylation itself did emerge as a highly significant term associated with aging in the brain sample, but not in lymphocytes, indicating that there are likely to be tissue specific differences in the component genes related to mitochondrial function (Tables 3 and 7). Nonetheless, these data together with previous studies provide an intriguing foundation to investigate if this is a primary event in the aging process or derivative of a general decrease in energy metabolism and activity that accompanies old age (e.g. [18]). That transcript levels are both decreasing and increasing is an important indication that the aging process does not lead to a unidirectional decline in expression. While this was evident in both brain and lymphocytes, in each case the relative fold-enrichment of gene categories was higher among negatively correlated transcripts. Thus, the up-regulated categories consisted of more unique transcripts in general than those that were down-regulated. In brain we noted several pathways that appear to have component genes that increase in expression with advancing age, the foremost among these encompassing genes that encode nuclear proteins. The most significant categorizations that emerged included a large contingent of genes related to DNA-binding and transcription, the most abundant being zinc-finger proteins. Although genes related to transcription were highlighted previously, results suggested a mix of both positive and negative regulation [11]. In contrast however, Miller et al. did observe an overlap of genes related to transcription that were increasing in relation to both ageing and to Alzheimer's disease (AD) [16]. In lymphocytes, immune response was the most significant category associated with increasing expression. However, there are several issues that are impossible to resolve for this latter result, especially regarding which cell types are represented in this sample (e.g. [19]). For example, this might relate to an accumulation of highly differentiated T cells due to acceleration by persistent CMV infection but this would require extensive further study [17]. There were also some interesting highlights among the many individual genes that were up-regulated with age in lymphocytes. In particular the most significant up-regulated gene IGFBP3 is intriguing in the context that another member of the insulin-like growth factor binding protein family, IGFBP7 has recently been implicated as an inducer of apoptosis in human melanoma cell lines [20], [21]. IGFBP3 itself has also been shown to be increased in senescent human fibroblasts [22]. Those findings together with results of the present study provide support for the role of insulin-like growth factor signaling in cellular senescence. For the brain sample, we were intrigued by the emergent pattern that expression of cytoplasm genes may be decreasing with age, while expression of nuclear genes may be increasing with age. While this might reflect the large number of mitochondrial and transcription genes in these particular sets, we cannot ignore the possibility that a basic change in cell morphology might be at play. One explanation may be that neuronal number remains stable with age, but synaptic vesicle density decreases [7], [8]. This relative change might give rise to our observations here. In support of this, genes related to neuronal projections (which includes dendrite genes) and synapse genes are major categories that appear to decrease with age (see Table 3). Also in support of this, the highest ranking individual gene, SVOP, encoding the SV2 related protein, is localized to the synaptic vesicle and appears to be an ion transporter. We had anticipated that more neuronal specific genes would exhibit a decreasing pattern, but noted that even some common neuronal reference genes, such as ENO2 (ranking 7597th), were not associated with age. Thus, in contrast to the enrichment of neuronal projection genes mentioned above (and see Table 3), this might support the concept that a decrease in neuronal number is not a major feature of aging [23]. A potentially important source of confounding in analyses of post-mortem brain samples is the mode of death, which we have not examined in more detail in this study. In particular, agonal state and pH have been highlighted as contributors to brain mRNA expression [24]–[26]. The present study has strong similarities with the study by Li et al., (2004)[25]. However, the relationship between pH and age was previously explored, but there was no evidence of a significant correlation [24], [27]. Interestingly, although it was a small sample Vawter et al. also noted significant correlations of age with mitochondrial genes regardless of agonal state (see specifically supplementary table 4 from [24]). Nonetheless, one explanation might be that transcription factor activation and mitochondrial deactivation is a natural response to hypoxic stress which may be more common in elderly individuals (e.g. [25], [28]). Another possibility might be that both aging and extended hypoxia dependent on the mode of death share similarities in terms of gene regulation. For the lymphocyte sample, the most significantly enriched terms were found in the class of negatively regulated genes, where the ribosomal and translation machinery appears to be strongly affected by age. This could potentially contribute to “immunosenescence”, whereby the immune system decays with time, rendering elderly individuals more susceptible to infectious diseases [29]. While changes at the expression level for this class of genes have to our knowledge not been previously reported on this scale, the concept that protein synthesis is involved in immunosenescence is not new [30]. This result is in our view the most striking difference between lymphocytes and brain. Although this suggests a more general decreases in metabolism in aging lymphocytes, we consider this an important reminder that tissue specificity can play a major role in gene expression profiling [31]. Another important feature of this study is the relative degree of stability in gene expression across the age spectrum. To put this in context, the strongest effects of age on any genes were 18% variance explained for TAC3 in brain and 29% for LRRN3 in lymphocytes, but in examining Tables 1, 2, 5 and 6, it is apparent that effect sizes drop off rapidly. For each of these top-ranked genes there was an approximately 2-fold change in average expression levels over the age ranges. This is comparable to well documented changes in hormone levels with age, such as for IGF1 [2], [32]. Also of note, genes claimed to be changing with age previously exhibited differences between young and older groups in the range of 2–3 fold [11]. This again might suggest that cell-loss and/or changes in organ function/morphology are more important than broad changes in gene expression in the decrease in health that typically accompanies increasing age. The possibility however cannot be excluded that relatively small changes in gene expression have a large impact on cell function. As an example, disorders such as Parkinson's disease (PD) can be caused by simple gene dosage effects, as has now been shown for the SNCA gene which encodes alpha-synuclein [33]. Although rare, we noted several genes that may be changing to a larger degree, but which fell below a strict significance threshold, for example ZIC3 in brain. However, there were no similar cases in the much more powerful lymphocyte sample. Finally, we also explored basic gene architecture characteristics in an attempt to provide some insight into why expression might vary with age. The only parameter that emerged as significant and equivalent in both brain and lymphocytes was the intron/exon length ratio, this representing a metric of gene “compactness” [13]. This might be interpreted as suggesting that non-compact genes are more susceptible to mutations that disrupt regulation, and thus lead to decreasing expression with advancing age. However, we cannot exclude the possibility that what we are seeing is simply a result of the gene pathways that are over-represented, these being enriched for compact or non-compact genes. In other words, if specific pathways are affected by age, the genes that represent those pathways may have similar features. We consider this nonetheless an intriguing finding in that it is replicating in different tissues. In summary, we validate previous findings that a decrease in mitochondrial gene expression appears to be a common theme in the aging process. Whether this is a primary event that causes a decline in health with advancing age or a result of a general decrease in metabolism in the elderly remains a topic for further investigation. We also highlight additional novel pathways that may be age dependent but with dramatic differences between tissues, in particular with genes related to transcription and to translation. These results may provide a valuable foundation for understanding the molecular consequences of aging and emphasize the development of catalogues of senescence-related genes in additional tissues.

Materials and Methods

Human Samples

The primary sample used in this study consisted of 191 individuals with ages-at-death data ranging from 65–100 years for which brain autopsy tissue was obtained. Expression level data were obtained using the Illumina HumanRefSeq-8 expression BeadChip platform for a total of 14,078 transcripts in which expression was detected in 5 or more individuals. Detailed descriptions of the human samples as well as the expression profiling protocol are provided in the original publication for which this sample was presented [12]. To create a working dataset, expression phenotypes and covariate data were merged that included age-at-death, post-mortem interval (pmi), gender, brain region, and transcript detection rates. From this, we generated 2 additional covariates for 1) total average expression level from all transcripts and 2) total average expression level for all transcripts that were detected in all individuals (5269 transcripts fulfilled the latter criteria). All individual transcript levels, the global average transcript levels, and pmi were log10 transformed prior to inclusion in analyses. The second sample consisted of 1240 individuals ranging in age from 15-94 in which fresh blood lymphocytes had been obtained and mRNA extracted. Expression phenotyping was conducted using the Illumina Sentrix Human Whole Genome (WG-6) Series I BeadChip platform. In total there were 19,648 individual transcript measures, representing approximately 18,500 genes. Details of the sample and expression protocols have been provided previously in the original publication describing this sample [15]. For our analysis, we used the normalized expression phenotypes without further transformation.

Correlation of Age and Transcript Level

We classified the extremes as outliers that are expected to be observed once or more in 1930 individual log transformed transcript estimates with assumption of independence and normal distribution of 193 measures for each transcript.where α′ is the alpha level for each measure, α is 0.05 and k 193. Based on normal distribution assumption,where Φ is standard normal cumulative distribution fuction, z is z-score of each measure. The normality of log transformed observed values for each transcript was tested by means of a Shapiro-Wilk W test. To assess differences of transcript detection rates or global expression levels across brain regions, as well as to test differences in characteristics of genomic architecture between age-related gene sets, ANOVA was used. Contingency tables (for example comparing the proportions of housekeeping genes) were evaluated by means of a standard chi-square test. All statistical analyses not related to pathway based tests were performed using STATA se 9.0. For the brain sample, the dependency of expression level on age-at-death was tested by fitting a linear regression model for each transcript:where Y is the log10 transformed expression level of the transcript, x 1 age-at-death, x 2 log10 of global expression, β intercept, β and β slope of each independent variable, and ε random error. For the lymphocyte sample, the same test was applied but without the global expression covariate according to the following:where Y is the normalized expression level of each transcript, x 1 age, β intercept, β slope of age, and ε random error. We also allowed for non-independence of expression measures given the relatedness of family members in the lymphocyte sample [34].

Gene Ontology and KEGG Pathway Analyses

The Entrez Gene IDs, symbols, and descriptions of genes for all tested transcripts were attained by Entrez Programming Utilities (eUtils) using GI number with the aid of a Perl script. This facilitated a search for replaced sequence identifiers and extracted information of interest by scanning the output text files from Entrez eUtils on the transcripts. Using the attained Entrez Gene IDs as identifiers of genes, we obtained the total lists of both Gene Ontology terms and KEGG pathway descriptors, with which our selected set of genes was annotated and analyzed using the DAVID functional annotation tool [14] with the most recent data update (January, 2008). The EASE scores [35] from the DAVID tool were used in trimming to over-represented term lists, with Bonferroni corrections of the scores from the tool as an inclusion criteria of Tables 3, 4, 7 and 8. We scrutinized ancestor-descendant relationships in the Gene Ontology structure among the terms in the enrichment lists on the basis of is_a, part_of, and regulates relationship by scanning the master ontology file which was updated in April, 2008. Our goal with this was to determine if ancestor terms had emerged as significant primarily because of enrichment of one of their descendant terms. We detail our strategy for this in Figure S1. Among the GO terms in the list, those which had no descendants were labeled as ‘end-terms’. For every term excluding end-terms, we created the artificial descendants. These were intended to represent the complement of the set of genes annotated with a descendant term in the list with respect to all genes in the ancestor. The artificial term for every descendant of every ancestor in the list was then tested to determine if it was over-represented by applying a hypergeometric test [36]. The ancestors for which all artificial descendant terms were over-represented at P<0.005 were labeled ‘significant ancestor terms’. The terms whose artificial descendants for end-terms or significant ancestors were over-represented were added to the set of ‘significant ancestor terms’. After modifying the set, the terms in the set were checked if they fulfilled inclusion criteria (enriched complement at P<0.005) and the set was updated iteratively until there was no change. The end-terms and significant ancestor terms are listed in Tables S1, S2, S3, S4. The remaining terms in the enrichment list follow in the same tables that are significantly over-represented mainly by a highly enriched descendant. P-values of the enrichment test for the artificial terms are also shown in the Tables S1, S2, S3, S4.

Genomic Architecture of Age-related Gene Sets

The genomic positions of start and end points of transcripts, their coding sequences, and their exons and introns in both brain and lymphocyte gene sets were collected from the UCSC genome browser using Genbank accession numbers (e.g. NM_018711) as identifiers. Information on some of the transcripts that were detected in both samples was not available from the UCSC browser due to the records being suppressed. Therefore, the number of transcripts in the genomic architecture analysis was reduced to 12,520 and 12,242 for brain and lymphocyte samples, respectively. Based on the positional data, lengths of pre-mRNA sequences, coding sequence, total intron, 5′UTR, 3′UTR, average intron and average exon of each transcript, as well as the number of exons and the ratio of intron per coding sequence length were calculated and transformed on a log10 scale. For comparative analyses of gene characteristics, the transcripts were divided into 3 groups, up-regulated genes (positively correlated with age at a P<0.05 threshold), down-regulated (negatively correlated with age at a P<0.05 threshold), and a set of genes that were not significantly altered over the age spectrum. Differences between groups were assessed using ANOVA. Combined analyses were performed on both sets using second order factorial ANOVA with sample source as a covariate.

URLs

Entrez Gene: http://www.ncbi.nlm.nih.gov/sites/entrezdbgene; Entrez eUtils: http://eutils.ncbi.nlm.nih.gov/entrez/eutils; DAVID 2008: http://david.abcc.ncifcrf.gov/; Gene Ontology (GO): http://www.geneontology.org/; UCSC genome browser: http://genome.ucsc.edu/ Overrepresented GO term analysis (0.06 MB PDF) Click here for additional data file. Terms in the Gene Ontology and KEGG pathway databases enriched among genes that decreased expression with advancing age in brain (1450 genes in total 13,216) (0.15 MB PDF) Click here for additional data file. Terms in the Gene Ontology and KEGG pathway databases enriched among genes that increased expression with advancing age in brain (1943 genes in total 13,216) (0.09 MB PDF) Click here for additional data file. Terms in the Gene Ontology and KEGG pathway databases enriched among genes that decreased expression with advancing age in the human lymphocytes (1878 genes in total 13,232) (0.13 MB PDF) Click here for additional data file. Terms in the Gene Ontology and KEGG pathway databases enriched among genes that increased expression with advancing age in the human lymphocytes (1430 genes in total 13,232) (0.12 MB PDF) Click here for additional data file.
  36 in total

1.  Global functional profiling of gene expression.

Authors:  Sorin Draghici; Purvesh Khatri; Rui P Martins; G Charles Ostermeier; Stephen A Krawetz
Journal:  Genomics       Date:  2003-02       Impact factor: 5.736

2.  A test of evolutionary theories of aging.

Authors:  Kimberly A Hughes; Julie A Alipaz; Jenny M Drnevich; Rose M Reynolds
Journal:  Proc Natl Acad Sci U S A       Date:  2002-10-17       Impact factor: 11.205

3.  DAVID: Database for Annotation, Visualization, and Integrated Discovery.

Authors:  Glynn Dennis; Brad T Sherman; Douglas A Hosack; Jun Yang; Wei Gao; H Clifford Lane; Richard A Lempicki
Journal:  Genome Biol       Date:  2003-04-03       Impact factor: 13.583

Review 4.  Regulation of pH in the mammalian central nervous system under normal and pathological conditions: facts and hypotheses.

Authors:  Marta Obara; Monika Szeliga; Jan Albrecht
Journal:  Neurochem Int       Date:  2007-10-26       Impact factor: 3.921

5.  T cell subset-specific susceptibility to aging.

Authors:  Marta Czesnikiewicz-Guzik; Won-Woo Lee; Dapeng Cui; Yuko Hiruma; David L Lamar; Zhi-Zhang Yang; Joseph G Ouslander; Cornelia M Weyand; Jörg J Goronzy
Journal:  Clin Immunol       Date:  2008-01-28       Impact factor: 3.969

Review 6.  Aging of the immune system as a prognostic factor for human longevity.

Authors:  Anis Larbi; Claudio Franceschi; Dawn Mazzatti; Rafael Solana; Anders Wikby; Graham Pawelec
Journal:  Physiology (Bethesda)       Date:  2008-04

Review 7.  Unexpected pieces to the senescence puzzle.

Authors:  Karen Cichowski; William C Hahn
Journal:  Cell       Date:  2008-06-13       Impact factor: 41.582

8.  A survey of genetic human cortical gene expression.

Authors:  Amanda J Myers; J Raphael Gibbs; Jennifer A Webster; Kristen Rohrer; Alice Zhao; Lauren Marlowe; Mona Kaleem; Doris Leung; Leslie Bryden; Priti Nath; Victoria L Zismann; Keta Joshipura; Matthew J Huentelman; Diane Hu-Lince; Keith D Coon; David W Craig; John V Pearson; Peter Holmans; Christopher B Heward; Eric M Reiman; Dietrich Stephan; John Hardy
Journal:  Nat Genet       Date:  2007-11-04       Impact factor: 38.330

9.  A systems level analysis of transcriptional changes in Alzheimer's disease and normal aging.

Authors:  Jeremy A Miller; Michael C Oldham; Daniel H Geschwind
Journal:  J Neurosci       Date:  2008-02-06       Impact factor: 6.167

Review 10.  Mechanisms of ageing: public or private?

Authors:  Linda Partridge; David Gems
Journal:  Nat Rev Genet       Date:  2002-03       Impact factor: 53.242

View more
  41 in total

1.  Multicenter validation of the diagnostic accuracy of a blood-based gene expression test for assessing obstructive coronary artery disease in nondiabetic patients.

Authors:  Steven Rosenberg; Michael R Elashoff; Philip Beineke; Susan E Daniels; James A Wingrove; Whittemore G Tingley; Philip T Sager; Amy J Sehnert; May Yau; William E Kraus; L Kristin Newby; Robert S Schwartz; Szilard Voros; Stephen G Ellis; Naeem Tahirkheli; Ron Waksman; John McPherson; Alexandra Lansky; Mary E Winn; Nicholas J Schork; Eric J Topol
Journal:  Ann Intern Med       Date:  2010-10-05       Impact factor: 25.391

Review 2.  Developing Peripheral Blood Gene Expression-Based Diagnostic Tests for Coronary Artery Disease: a Review.

Authors:  Brian Rhees; James A Wingrove
Journal:  J Cardiovasc Transl Res       Date:  2015-06-25       Impact factor: 4.132

Review 3.  The RNA world of human ageing.

Authors:  J C Gomez-Verjan; E R Vazquez-Martinez; N A Rivero-Segura; R H Medina-Campos
Journal:  Hum Genet       Date:  2018-11-01       Impact factor: 4.132

Review 4.  Gene expression in the human brain: the current state of the study of specificity and spatiotemporal dynamics.

Authors:  Oksana Yu Naumova; Maria Lee; Sergei Yu Rychkov; Natalia V Vlasova; Elena L Grigorenko
Journal:  Child Dev       Date:  2012-11-12

5.  Pervasive Effects of Aging on Gene Expression in Wild Wolves.

Authors:  Pauline Charruau; Rachel A Johnston; Daniel R Stahler; Amanda Lea; Noah Snyder-Mackler; Douglas W Smith; Bridgett M vonHoldt; Steven W Cole; Jenny Tung; Robert K Wayne
Journal:  Mol Biol Evol       Date:  2016-04-15       Impact factor: 16.240

6.  Human aging is characterized by focused changes in gene expression and deregulation of alternative splicing.

Authors:  Lorna W Harries; Dena Hernandez; William Henley; Andrew R Wood; Alice C Holly; Rachel M Bradley-Smith; Hanieh Yaghootkar; Ambarish Dutta; Anna Murray; Timothy M Frayling; Jack M Guralnik; Stefania Bandinelli; Andrew Singleton; Luigi Ferrucci; David Melzer
Journal:  Aging Cell       Date:  2011-07-19       Impact factor: 9.304

7.  Genotype×age interaction in human transcriptional ageing.

Authors:  Jack W Kent; Harald H H Göring; Jac C Charlesworth; Eugene Drigalenko; Vincent P Diego; Joanne E Curran; Matthew P Johnson; Thomas D Dyer; Shelley A Cole; Jeremy B M Jowett; Michael C Mahaney; Anthony G Comuzzie; Laura Almasy; Eric K Moses; John Blangero; Sarah Williams-Blangero
Journal:  Mech Ageing Dev       Date:  2012-07-31       Impact factor: 5.432

8.  Genome-wide DNA methylation differences between late-onset Alzheimer's disease and cognitively normal controls in human frontal cortex.

Authors:  Kelly M Bakulski; Dana C Dolinoy; Maureen A Sartor; Henry L Paulson; John R Konen; Andrew P Lieberman; Roger L Albin; Howard Hu; Laura S Rozek
Journal:  J Alzheimers Dis       Date:  2012       Impact factor: 4.472

9.  Mitochondrial DNA damage in a mouse model of Alzheimer's disease decreases amyloid beta plaque formation.

Authors:  Milena Pinto; Alicia M Pickrell; Hirokazu Fukui; Carlos T Moraes
Journal:  Neurobiol Aging       Date:  2013-05-21       Impact factor: 4.673

10.  Transcriptomics of cortical gray matter thickness decline during normal aging.

Authors:  P Kochunov; J Charlesworth; A Winkler; L E Hong; T E Nichols; J E Curran; E Sprooten; N Jahanshad; P M Thompson; M P Johnson; J W Kent; B A Landman; B Mitchell; S A Cole; T D Dyer; E K Moses; H H H Goring; L Almasy; R Duggirala; R L Olvera; D C Glahn; J Blangero
Journal:  Neuroimage       Date:  2013-05-24       Impact factor: 6.556

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.