| Literature DB >> 23595788 |
Brendan Bulik-Sullivan1, Sara Selitsky, Praveen Sethupathy.
Abstract
Comprehensive analyses of results from genome-wide association studies (GWAS) have demonstrated that complex disease/trait-associated loci are enriched in gene regulatory regions of the genome. The search for causal regulatory variation has focused primarily on transcriptional elements, such as promoters and enhancers. microRNAs (miRNAs) are now widely appreciated as critical posttranscriptional regulators of gene expression and are thought to impart stability to biological systems. Naturally occurring genetic variation in the miRNA regulome is likely an important contributor to phenotypic variation in the human population. However, the extent to which polymorphic miRNA-mediated gene regulation underlies GWAS signals remains unclear. In this study, we have developed the most comprehensive bioinformatic analysis pipeline to date for cataloging and prioritizing variants in the miRNA regulome as functional candidates in GWAS. We highlight specific findings, including a variant in the promoter of the miRNA let-7 that may contribute to human height variation. We also provide a discussion of how our approach can be expanded in the future. Overall, we believe that the results of this study will be valuable for researchers interested in determining whether GWAS signals implicate the miRNA regulome in their disease/trait of interest.Entities:
Keywords: GWAS; complex disease; gene regulation; microRNA; polymorphism
Mesh:
Substances:
Year: 2013 PMID: 23595788 PMCID: PMC3807557 DOI: 10.1002/humu.22337
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878
Figure 1Summary of the analysis pipeline. The primary data sources (pink rectangles) and the number of trait-associated SNPs (blue rectangles) passing each filter (red text) in the analysis pipeline are shown. *, candidate miRNA regulatory hubs are identified for each trait/disease using a Monte Carlo simulation strategy (Methods).
Selected Examples of Trait/Disease-Associated SNPs (TASs) in the miRNA Regulome
| Trait/disorder | miRNA regulome SNP | Minor allele frequency | miRNA | miRNA regulome element |
|---|---|---|---|---|
| Vertical cup-disc ratio | rs12803915 | 21.4% (EUR) | miR-612 | Pre-miRNA |
| Pulse rate | rs6701558 | 11.2% (ASN) | miR-29b/c | Promoter region |
| Schizophrenia | rs2660302 | 21.8% (EUR) | miR-137 | Promoter region |
| Height | rs113431232 | 3.4% (EUR) | let-7a/d/f | Promoter region |
| Pigmentation | rs35407 | 2.2% (EUR) | miR-27 | |
| Body mass index | rs77632545 | 19.9% (ASN) | miR-181a | |
| Asthma | rs17052784 | 2.6% (EUR) | miR-140-3p | |
| Plasma C-reactive protein levels | rs116971887 | 4.9% (EUR) | miR-194 | |
| Type 1 diabetes autoantibodies | rs3842753 | 50.0% (EUR) | miR-491-5p | |
| Type 2 diabetes | rs1802295 | 50.0% (ASN) | miR-510 |
Of the 42 TASs that are in the miRNA regulome and are not in LD with annotated nonsynonymous or transcriptional SNPs, 10 are shown here (pre-miRNA, n = 1; miRNA promoter, n = 3; miRNA target site, n = 6). Minor allele frequencies (MAFs) are specific to the 1000G super-population (ASN, Asian; EUR, European; AMR, American; AFR, African) that is closest to the ancestry of the case-control cohort in the GWAS that identified the genetic association. MAFs of the miRNA regulome SNPs range from relatively rare (e.g., rs35407, EUR MAF = 0.022) to very common (e.g., rs1802295, ASN MAF = 0.5).
Figure 2A genetic variant in the let-7 promoter may contribute to human height variation. A: Each data point represents a human miRNA and the y-axis shows the log10 of the P value of miRNA target site enrichment among genes implicated in height by GWAS. The dashed line denotes the significance threshold (empirical P = 0.01). B: SNP rs113431232, which is in LD with an index SNP (rs1257763) for height, occurs in a validated E2F4 binding site within the promoter region of the let-7a/d/f miRNA cluster. H3K4me3 (histone H3 lysine 4 tri-methylation) ChIP-seq signal (ENCODE) denotes promoter regions; DNase HS (DNase I hypersensitive site) signal (ENCODE) denotes open chromatin; E2F4 ChIP-seq signal (ENCODE) denotes chromatin occupancy of transcription factor E2F4. Peak E2F4 occupancy signal (red rectangle) occurs within a local dip in the DNase HS signal, which is indicative of a bound transcription factor(s). All ENCODE data shown are from HeLa cells.
| Case–control cohort ancestry information provided by the GWAS | 1000G super-population assignment |
|---|---|
| Indian|India|Europe|Canadian|Canada|Caucasian|White|Framingham|Norwegian|Norway|French|France|Iceland|German|Germany|Scandanavia|Finnish|Finland| UK |Welsh|Wales|Irish|Ireland|Scottish|Scotland|Spanish|Spain|Iberia|Toscani|Tuscan|Utah|CEPH|England|English|Swiss|Switzerland|Australia|Turkish|Turkey|Saami|Belgium|Belgian|Russia|Polish|Poland | EUR (European) |
| Africa|Ghani|Malawi|Yoruba|Esan|Mende|Sierra|Leone|Gambia|Kenya|Luhyai | AFR (African) |
| Mexico|Mexican|Hispanic|Peru|Puerto|Rico|Rican|Colombia|Medellin|Lima | AMR (American) |
| Asia|Chin|Japan|Thai|Korea|Bangladesh|Taiwan|Indonesia|Vietnam|Hong Kong|Kosrae|Micronesia|Papua New Guinea|Han|Dai|Kinh |Tokyo|Xishuangbanna|Beijing | ASN (Asian) |
| Anything else | EUR (European) |