| Literature DB >> 25884492 |
Luiz F Goulart1,2, Francesco Bettella3,4, Ida E Sønderby5,6, Andrew J Schork7,8,9, Wesley K Thompson10, Morten Mattingsdal11,12, Vidar M Steen13,14, Verena Zuber15,16,17, Yunpeng Wang18,19, Anders M Dale20,21,22,23, Ole A Andreassen24,25,26, Srdjan Djurovic27,28,29.
Abstract
BACKGROUND: The genotype information carried by Genome-wide association studies (GWAS) seems to have the potential to explain more of the 'missing heritability' of complex human phenotypes, given improved statistical approaches. Several lines of evidence support the involvement of microRNA (miRNA) and other non-coding RNA in complex human traits and diseases. We employed a novel, genetic annotation-informed enrichment method for GWAS that captures more polygenic effects than standard GWAS analysis, to investigate if miRNA-tagging Single Nucleotide Polymorphisms (SNPs) are enriched of associations with 15 complex human phenotypes. We then leveraged the enrichment using a conditional False Discovery Rate (condFDR) approach to assess any improvement in the detection of individual miRNA SNPs associated with the disorders.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25884492 PMCID: PMC4437677 DOI: 10.1186/s12864-015-1513-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1miRNA stratified Q-Q plots for Height, Low Density Lipoprotein (LDL), Crohn’s Disease (CD) and Schizophrenia (SCZ) using Linkage-Disequilibrium (LD)-weighted annotation scores. Shown are Q-Q plots for miRNA SNPs compared to those for all SNPs and intergenic SNPs, a collection of likely null SNPs. The confidence intervals were obtained by sampling ten independent sets of SNP representatives from all LD-blocks (r2 > 0.2) and computing means and confidence intervals for one thousand bins of nominal p-value.
Significance of miRNA enrichment
|
|
|
|---|---|
| BD | 2.4e-02§ |
| BMI | 1.0e-12* |
| CD | 1.9e-06* |
| CPD | 6.3e-01 |
| HDL | 2.2e-20* |
| Height | 3.9e-90* |
| LDL | 2.6e-17* |
| MS | 3.3e-03§ |
| PrCa | 1.1e-01 |
| SBP | 9.8e-08* |
| SCZ | 4.1e-07* |
| T2D | 9.8e-03§ |
| TG | 1.5e-16* |
| UC | 3.2e-06* |
| WHR | 5.8e-06* |
Significance of miRNA enrichment in 15 phenotypes. Reported are the binomial proportion test (BPT) p-values for miRNA target regions compared with intergenic SNPs for all phenotypes. CPD and PrCa do not reach nominal statistical significance; BD, MS, and T2D would not pass strict multiple testing criteria for 15 phenotypes – BD, Bipolar Disorder; BMI, Body Mass Index; CD, Crohn's disease; CPD, Cigarettes per Day; HDL, High density lipoprotein; LDL, Low density lipoprotein; MS, multiple sclerosis; PrCa, prostate cancer; SBP, systolic blood pressure; SCZ, Schizophrenia; T2D, Type 2 Diabetes; TG, triglycerides; UC, Ulcerative Colitis; WHR, Waist to Hip Ratio. § Nominally significant (not significant after controlling for multiple testing); * Significant.
Figure 2Categorical Enrichment for Height, Low Density Lipoprotein (LDL), Crohn’s Disease (CD) and Schizophrenia (SCZ). The relative pattern of enrichment of LD-weighted genic annotation categories, as measured by the mean (z-score2 - 1) normalized by the highest value across categories within each phenotype after intergenic inflation control, remains consistent. Results for all phenotypes are shown in Additional file 1.
Figure 3Independent study replication confirms enrichment in Crohn’s Disease. Shown are plots of (A) stratified true discovery rate (TDR) and (B) cumulative replication rate as functions of the negative decadic logarithm of the p-value. The p-value in the cumulative replication rate plot refers to the p-value in the discovery sample. Replication is intended as achieved for p-value smaller than 0.05 in the replication sample. It is evident from (B) how, given a p-value threshold, miRNA associations replicate at a higher rate in independent samples than just any SNPs or SNPs in intergenic regions. The vertical intercept is the overall replication rate per category. The cumulative replication rate mirrors the TDR as should be expected.
Number of significant miRNA genomic loci
|
|
|
|
|
|---|---|---|---|
|
|
|
| |
| BD | 8 | 2 | 0 |
| BMI | 7 | 5 | 3 |
| CD | 55 | 34 | 16 |
| CPD | 1 | 0 | 0 |
| HDL | 52 | 38 | 18 |
| Height | 189 | 123 | 51 |
| LDL | 62 | 37 | 19 |
| MS | 51 | 42 | 25 |
| PrCa | 16 | 12 | 6 |
| SBP | 12 | 6 | 2 |
| SCZ | 29 | 14 | 6 |
| T2D | 1 | 1 | 1 |
| TG | 84 | 43 | 20 |
| UC | 54 | 36 | 18 |
| WHR | 4 | 3 | 0 |
Comparative table showing number of identified genomic loci with conditional false discovery rate (condFDR < 0.01) compared with false discovery rate (FDR < 0.01) and standard p-value (Bonferroni correction p < 5×10−8). We show that the condFDR method improves the discovery of miRNA SNPs across the 15 phenotypes. The numbers reported here are after pruning SNPs for LD at a threshold of r2 ≤ 0.2. BD, Bipolar Disorder; BMI, Body Mass Index; CD, Crohn's disease; CPD, Cigarettes per Day; HDL, High density lipoprotein; LDL, Low density lipoprotein; MS, multiple sclerosis; PrCa, prostate cancer; SBP, systolic blood pressure; SCZ, Schizophrenia; T2D, Type 2 Diabetes; TG, triglycerides; UC, Ulcerative Colitis; WHR, Waist to Hip Ratio.