| Literature DB >> 26414678 |
Hilary K Finucane1,2, Brendan Bulik-Sullivan3,4, Alexander Gusev2, Gosia Trynka5,6,7,8,9, Yakir Reshef10, Po-Ru Loh2, Verneri Anttila3,4,8, Han Xu11, Chongzhi Zang11, Kyle Farh3,12, Stephan Ripke3,4, Felix R Day13, Shaun Purcell5,6,14, Eli Stahl14, Sara Lindstrom2, John R B Perry13, Yukinori Okada15,16, Soumya Raychaudhuri5,6,7,8,17, Mark J Daly3,4,8, Nick Patterson8, Benjamin M Neale3,4,8, Alkes L Price2,8.
Abstract
Recent work has demonstrated that some functional categories of the genome contribute disproportionately to the heritability of complex diseases. Here we analyze a broad set of functional elements, including cell type-specific elements, to estimate their polygenic contributions to heritability in genome-wide association studies (GWAS) of 17 complex diseases and traits with an average sample size of 73,599. To enable this analysis, we introduce a new method, stratified LD score regression, for partitioning heritability from GWAS summary statistics while accounting for linked markers. This new method is computationally tractable at very large sample sizes and leverages genome-wide information. Our findings include a large enrichment of heritability in conserved regions across many traits, a very large immunological disease-specific enrichment of heritability in FANTOM5 enhancers and many cell type-specific enrichments, including significant enrichment of central nervous system cell types in the heritability of body mass index, age at menarche, educational attainment and smoking behavior.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26414678 PMCID: PMC4626285 DOI: 10.1038/ng.3404
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1Simulation results: null calibration and power. We simulated genetic architectures with positive total SNP-heritability, with and without functional enrichment, for two values of p and a range of values of N·h. (a) Proportion of simulations in which a null of no functional enrichment is rejected, as a function of N·h and p. (b) The z-score of total SNP-heritability depends on N·h and p, but does not depend on the presence or absence of functional enrichment. (c) Proportion of simulations in which a null of no functional enrichment is rejected, as a function of the z-score of total SNP-heritability. Here, the z-score of total SNP-heritability for p = 0.005 did not exceed 7.3 even at maximum N·h.
Figure 2Simulation results: model misspecification. Enrichment is the proportion of heritability in DHS regions divided by the proportion of SNPs in DHS regions. Bars show 95% confidence intervals around the mean of 100 trials. (a) From left to right, the simulated genetic architectures are 1x DHS enrichment, 3x DHS enrichment, and 5.5x DHS enrichment (100% of heritability in DHS SNPs). (b) From left to right, the simulated genetic architectures are 200bp flanking regions causal, coding regions causal, and FANTOM5 Enhancer regions causal. For simulations with coding or FANTOM5 Enhancer as the causal category, we removed the causal category and the 500bp window around that category from the full baseline model in order to simulate enrichment in an unknown functional category.
Figure 3Simulation results for ranking cell-type groups and cell types. For each cell-type group, 500 simulations were performed with baseline enrichment and either realistic enrichment or low enrichment in that cell-type group. Results for the left two columns are aggregated over the ten cell-type groups; results for individual groups are displayed in Supplementary Figure 5. The right two columns represent 500 simulations each of realistic or low enrichment of a single cell-type-specific annotation, H3K4me3 in fetal brain cells.
Figure 4Enrichment estimates for the 24 main annotations, averaged over nine independent traits. Annotations are ordered by size. Error bars represent jackknife standard errors around the estimates of enrichment, and stars indicate significance at P < 0.05 after Bonferroni correction for 24 hypotheses tested. Negative point estimates, significance testing, and the choice of nine independent traits are discussed in the Online Methods and Supplementary Note.
Figure 5Enrichment estimates for selected annotations and traits. Error bars represent jackknife standard errors around the estimates of enrichment.
Enrichment of individual cell types. We report the cell type with the lowest P-value for each trait analyzed.
| Phenotype | Cell type | Tissue | Mark | -log10( |
|---|---|---|---|---|
| Height | Chondregenic dif | Bone | H3K27ac | 6.81 |
| BMI | Fetal brain | Fetal brain | H3K4me3 | 4.48 |
| Age at menarche | Fetal brain | Fetal brain | H3K4me3 | 12.25 |
| LDL | Liver | Liver | H3K4me1 | 4.76 |
| HDL | Liver | Liver | H3K4me1 | 4.51 |
| Triglycerides | Liver | Liver | H3K4me1 | 3.99 |
| Coronary artery disease | Adipose nuclei | Adipose | H3K4me1 | 4.21 |
| Type 2 diabetes | Pancreatic islets | Pancreas | H3K4me3 | 2.87 |
| Fasting glucose | Pancreatic islets | Pancreas | H3K27ac | 3.93 |
| Schizophrenia | Fetal brain | Fetal brain | H3K4me3 | 18.51 |
| Bipolar disorder | Mid frontal lobe | Brain | H3K27ac | 4.42 |
| Anorexia | Angular gyrus | Brain | H3K9ac | 2.61 |
| Years of education | Angular gyrus | Brain | H3K4me3 | 6.63 |
| Ever smoked | Inferior temporal lobe | Brain | H3K4me3 | 3.21 |
| Rheumatoid arthritis | CD4+ CD25− IL17+ stim Th17 | Immune | H3K4me1 | 6.76 |
| Crohn’s disease | CD4+ CD25− IL17+ stim Th17 | Immune | H3K4me1 | 7.59 |
| Ulcerative colitis | CD4+ CD25− IL17+ stim Th17 | Immune | H3K4me1 | 6.37 |
denotes FDR < 0.05.
denotes significant at P < 0.05 after Bonferroni correction for multiple hypotheses. Sample sizes are in Supplementary Table 3.
Figure 6Enrichment of cell-type groups. We report significance of enrichment for each of 10 cell-type groups, for each of 11 traits. The black dotted line at −log10(P) = 3.5 is the cutoff for Bonferroni significance. The grey dotted line at −log10(P) = 2.1 is the cutoff for FDR < 0.05. For HDL, three of the top individual cell types are adipose nuclei, which explains the enrichment of the “Other” category.
Figure 7Comparison to other methods for identifying enriched cell types. In “Null” simulations, there is no enrichment. In “Null (baseline enrichment)” simulations, there is enrichment in the baseline categories, some of which overlap the cell type or cell-type group, but no additional enrichment in the cell type or cell-type group. In the “True enrichment” simulations, there is enrichment in either the CNS cell-type group (top panels) or the fetal brain cell type (bottom babels). In all simulations, N = 14000, h = 0.7. We report the proportion of 100 simulations in which the null is rejected by six methods: GoShifter [6], fgwas [9], Top SNPs [10], PICS [7], stratified LD score (unadjusted), and LD score. LD score (unadj) refers to total unadjusted enrichment, i.e., (Prop. h)/(Prop. SNPs); LD score refers to the coefficient β of the category, controlling for all other categories in the model.