| Literature DB >> 25758824 |
Andrew Anand Brown1, Zhihao Ding2, Ana Viñuela3, Dan Glass3, Leopold Parts2, Tim Spector3, John Winn4, Richard Durbin5.
Abstract
Statistical factor analysis methods have previously been used to remove noise components from high-dimensional data prior to genetic association mapping and, in a guided fashion, to summarize biologically relevant sources of variation. Here, we show how the derived factors summarizing pathway expression can be used to analyze the relationships between expression, heritability, and aging. We used skin gene expression data from 647 twins from the MuTHER Consortium and applied factor analysis to concisely summarize patterns of gene expression to remove broad confounding influences and to produce concise pathway-level phenotypes. We derived 930 "pathway phenotypes" that summarized patterns of variation across 186 KEGG pathways (five phenotypes per pathway). We identified 69 significant associations of age with phenotype from 57 distinct KEGG pathways at a stringent Bonferroni threshold ([Formula: see text]). These phenotypes are more heritable ([Formula: see text]) than gene expression levels. On average, expression levels of 16% of genes within these pathways are associated with age. Several significant pathways relate to metabolizing sugars and fatty acids; others relate to insulin signaling. We have demonstrated that factor analysis methods combined with biological knowledge can produce more reliable phenotypes with less stochastic noise than the individual gene expression levels, which increases our power to discover biologically relevant associations. These phenotypes could also be applied to discover associations with other environmental factors.Entities:
Keywords: aging; factor analysis; gene expression; heritability; linear mixed models
Mesh:
Year: 2015 PMID: 25758824 PMCID: PMC4426370 DOI: 10.1534/g3.114.011411
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
List of 20 pathways most significantly associated with age
| KEGG_ID | Pathway | No. of Genes in Pathway | Number of Age-Associated Genes | Heritability | |
|---|---|---|---|---|---|
| 00900 | Terpenoid Backbone Biosynthesis | 6.23 | 13 | 6 | 0.00 |
| 00980 | Metabolism of Xenobiotics by Cytochrome P450 | 6.47 | 54 | 6 | 0.09 |
| 01040 | Biosynthesis of Unsaturated Fatty Acids | 1.11 | 17 | 6 | 0.25 |
| 00100 | Steroid Biosynthesis | 1.33 | 14 | 12 | 0.41 |
| 00650 | Butanoate Metabolism | 1.51 | 27 | 8 | 0.39 |
| 04146 | Peroxisome | 1.56 | 64 | 17 | 0.45 |
| 00830 | Retinol Metabolism | 1.93 | 48 | 6 | 0.45 |
| 00010 | Glycolysis Gluconeogenesis | 3.59 | 49 | 12 | 0.42 |
| 00051 | Fructose and Mannose Metabolism | 3.99 | 32 | 8 | 0.32 |
| 00290 | Valine Leucine and Isoleucine Biosynthesis | 1.15 | 11 | 3 | 0.00 |
| 00561 | Glycerolipid Metabolism | 2.63 | 38 | 6 | 0.34 |
| 00620 | Pyruvate Metabolism | 4.20 | 35 | 11 | 0.37 |
| 00770 | Pantothenate and COA Biosynthesis | 4.76 | 16 | 4 | 0.48 |
| 00280 | Valine Leucine and Isoleucine Degradation | 5.79 | 35 | 10 | 0.51 |
| 00020 | Citrate Cycle TCA Cycle | 1.12 | 23 | 8 | 0.43 |
| 04916 | Melanogenesis | 3.34 | 93 | 10 | 0.00 |
| 04910 | Insulin Signaling Pathway | 3.70 | 122 | 13 | 0.45 |
| 00565 | Ether Lipid Metabolism | 5.89 | 27 | 3 | 0.00 |
| 00350 | Tyrosine Metabolism | 9.44 | 32 | 4 | 0.34 |
| 00640 | Propanoate Metabolism | 1.03 | 26 | 6 | 0.59 |
List of 20 pathways most significantly associated with age, together with the total number of genes in the pathway, the number of genes within pathways significantly associated with age (, corrected using Bonferroni for the total number of genes in the pathway), and the heritability of the pathway factor.
Figure 1Q-Q plot of observed P values against theoretical P values for factor analysis (red dots) and single gene–based methods (in blue). Permutations (in green) show the results of a combined analysis of 10 permuted datasets. Horizontal lines show Bonferroni significance thresholds accounting for different numbers of tests (186 tests for single gene measures in blue, 930 for factor analysis in red, and 9300 for the combined 10 permutation analyses in green).
Figure 2Histograms showing the proportion of environmental variation explained by age, heritability, and the proportion of variance explained by the unique environment for pathway factors and the individual gene measurements.
Figure 3The relative importance of sources of variation to global, pathway, and gene phenotypes. Measures of variation shown are the proportion of variance explained by unique environment, proportion of variance explained by genetics (heritability), and the proportion of environmental variation explained by age. To show more clearly the differences in relative importance of these measures to different classes of phenotypes, all proportions are scaled such that contribution to gene phenotypes equals one. Numbers above the bars give the absolute, unscaled proportions.