| Literature DB >> 24098339 |
Kajia Cao1, Paul Ryvkin, Yih-Chii Hwang, F Brad Johnson, Li-San Wang.
Abstract
Several recent gene expression studies identified hundreds of genes that are correlated with age in brain and other tissues in human. However, these studies used linear models of age correlation, which are not well equipped to model abrupt changes associated with particular ages. We developed a computational algorithm for age estimation in which the expression of each gene is treated as a dichotomized biomarker for whether the subject is older or younger than a particular age. In addition, for each age-informative gene our algorithm identifies the age threshold with the most drastic change in expression level, which allows us to associate genes with particular age periods. Analysis of human aging brain expression datasets from three frontal cortex regions showed that different pathways undergo transitions at different ages, and the distribution of pathways and age thresholds varies across brain regions. Our study reveals age-correlated expression changes at particular age points and allows one to estimate the age of an individual with better accuracy than previously published methods.Entities:
Mesh:
Year: 2013 PMID: 24098339 PMCID: PMC3789733 DOI: 10.1371/journal.pone.0074578
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Microarray datasets used in this study.
| ID | tissue | Number of samples | Age range | Gender | GEO ID |
| H1 | Rostral aspect of frontal cortex | 29 | 26∼95 | F: 11 | GDS707 |
| (∼BA10) | M: 18 | ||||
| H2 | Dorsolateral prefrontal cortex | 29 | 25∼79 | F: 7 |
|
| (BA9) | M: 22 | ||||
| H3 | Orbital prefrontal cortex | 27 | 28∼77 | F: 6 |
|
| (BA47) | M: 21 |
: obtained from the authors directly.
Contingency table for probeset g at age a = δ where j≤m samples (m = 1, 2, …n-1).
|
|
| |
|
|
|
|
|
|
|
|
Best number of genes (N) used in age estimation and the difference of median of age is the absolute difference between the median of estimated age and the median of chronological age.
| BA10 | BA9 | BA47 | |
| Error in age estimation (five-fold cross validation) | 14.43±11.13 | 9.48±6.85 | 8.33±7.56 |
| Best | 70 | 50 | 10 |
| Difference of median of age (with actual age) | 9.5 | 0.5 | 0.5 |
| p values (permutation test) | 0.009 | 0.002 | 0.001 |
The significance of the error was determined by obtaining 1,000 randomized cross-validation errors with age information randomly shuffled; the significance of the prediction error is the fraction of the 1,000 randomized errors lower than the actual cross-validation error. The best N varies across regions.
Figure 1Performance of age estimation using the proposed naïve Bayes method and other methods.
The five-fold cross validation results showed age estimation errors of the naïve Bayes model to be 0.14∼4.38 years smaller than in other tested models, thus reducing error by 34%.
Figure 2Histograms of genes with age-regulated transition points within each decade between 25 and 95 years (p-value≤0.005) in three brain regions.
The distribution of age-regulated genes is very different in BA10 compared to BA9 and BA47.
Functional annotation analysis summary.
| BA10 | BA47 | BA9 | ||||||||||||||||||
| Cluster | Description | No. pathways | Y | M | O | Y | M | O | Y | M | O | |||||||||
| Dn | Up | Dn | Up | Dn | Up | Dn | Up | Dn | Up | Dn | Up | Dn | Up | Dn | Up | Dn | Up | |||
|
|
|
|
|
|
|
|
|
| ||||||||||||
| 2 | Fatty acid/membrane metabolism | 4 |
| |||||||||||||||||
|
|
|
|
|
|
| |||||||||||||||
|
|
|
|
|
|
|
|
| |||||||||||||
| 5 | Actin cytoskeleton |
|
| |||||||||||||||||
|
|
|
|
|
|
|
| ||||||||||||||
|
| Translation; DNA damage response |
|
|
| ||||||||||||||||
| 8 | Transcriptional activation |
|
|
|
| |||||||||||||||
| 9 | Neuronal function |
|
|
| ||||||||||||||||
| 10 | Mitochondrial transport |
|
| |||||||||||||||||
| 11 | Cytoskeleton/RNA splicing |
|
| |||||||||||||||||
| 12 | Calcium transport |
|
| |||||||||||||||||
| 13 | Chromatin regulation |
|
| |||||||||||||||||
Figure 3Venn diagram of three human brain regions.
Age-correlated genes (p≤0.005) in three human brain regions have statistically significant overlaps. Shown are the numbers of genes with age-correlated expression in each brain region.
Topological Characteristics of the protein-protein interaction network of age-correlated genes.
| Topological Characteristics in Protein-Protein Interaction Network | ||||||||||||
| Degree | Betweenness Centrality | Closeness Centrality | Clustering Coefficient | |||||||||
| Young | Middle-aged | Old | Young | Middle-aged | Old | Young | Middle-aged | Old | Young | Middle-aged | Old | |
| BA10 | 13.083 | 11.894 | 12.645 | 6.74E-04 | 6.14E-04 | 7.05E-04 | 0.250 | 0.249 | 0.250 | 0.097 | 0.103 | 0.100 |
| Z-score |
|
|
|
|
|
|
|
|
| −1.212 | −0.322 | −0.750 |
| BA9 | 10.237 | 10.774 | 10.844 | 5.15E-04 | 5.45E-04 | 5.30E-04 | 0.247 | 0.247 | 0.250 | 0.094 | 0.101 | 0.094 |
|
|
|
|
| 2.669 |
|
|
|
|
| −1.318 | −0.726 | −1.595 |
| BA47 | 11.254 | 10.516 | 8.573 | 6.29E-04 | 5.29E-04 | 3.78E-04 | 0.247 | 0.246 | 0.244 | 0.102 | 0.102 | 0.087 |
|
|
|
| 0.940 |
|
| 0.453 |
|
| 2.705 | −0.406 | −0.612 | −2.074 |
For each of the three brain regions age-correlated genes divided into three age groups: young (<40 years old), middle-age (between 40 and 59 years old) and old (between 60 and 79 years old).
Characteristics Path Length in-between Groups of age-correlated genes.
| BA10 | BA9 | BA47 | ||||||||
| size | mean |
| size | mean |
| size | mean |
| ||
| Whole Region Overlapped | 1644 | 3.948 | −10.043 | 1703 | 4.003 | −8.834 | 1640 | 4.039 | −6.937 | |
| Group of age correlated genes |
| 833 | 3.895 | −9.014 | 482 | 3.978 | −5.208 | 567 | 4.008 | −4.344 |
|
| 1000 | 3.938 | −8.145 | 1189 | 3.993 | −6.941 | 1197 | 4.018 | −6.998 | |
|
| 770 | 3.897 | −7.929 | 831 | 3.909 | −8.828 | 513 | 4.082 | −2.776 | |
| Y vs. M | 1358 | 3.919 | −11.666 | 1413 | 3.990 | −7.604 | 1384 | 4.017 | −6.858 | |
| M vs. O | 1336 | 3.920 | −12.679 | 1501 | 3.953 | −11.465 | 1463 | 4.056 | −6.000 | |
| O vs. Y | 1300 | 3.901 | −11.170 | 1186 | 3.950 | −9.251 | 1013 | 4.055 | −5.198 | |
The size of each region shows the number of genes having overlap between the protein-protein interaction network and our age-regulated gene study. Size of Y(young) vs. M (middle-aged), M vs. O (old), and O vs. Y showed the union gene number of each group each brain region.