| Literature DB >> 24821702 |
Wen-Hua Wei1, Yunfei Guo2, Alida S D Kindt3, Tony R Merriman4, Colin A Semple3, Kai Wang2, Chris S Haley3.
Abstract
Human serum uric acid concentration (SUA) is a complex trait. A recent meta-analysis of multiple genome-wide association studies (GWAS) identified 28 loci associated with SUA jointly explaining only 7.7% of the SUA variance, with 3.4% explained by two major loci (SLC2A9 and ABCG2). Here we examined whether gene-gene interactions had any roles in regulating SUA using two large GWAS cohorts included in the meta-analysis [the Atherosclerosis Risk in Communities study cohort (ARIC) and the Framingham Heart Study cohort (FHS)]. We found abundant genome-wide significant local interactions in ARIC in the 4p16.1 region located mostly in an intergenic area near SLC2A9 that were not driven by linkage disequilibrium and were replicated in FHS. Taking the forward selection approach, we constructed a model of five SNPs with marginal effects and three epistatic SNP pairs in ARIC-three marginal SNPs were located within SLC2A9 and the remaining SNPs were all located in the nearby intergenic area. The full model explained 1.5% more SUA variance than that explained by the lead SNP alone, but only 0.3% was contributed by the marginal and epistatic effects of the SNPs in the intergenic area. Functional analysis revealed strong evidence that the epistatically interacting SNPs in the intergenic area were unusually enriched at enhancers active in ENCODE hepatic (HepG2, P = 4.7E-05) and precursor red blood (K562, P = 5.0E-06) cells, putatively regulating transcription of WDR1 and SLC2A9. These results suggest that exploring epistatic interactions is valuable in uncovering the complex functional mechanisms underlying the 4p16.1 region.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24821702 PMCID: PMC4159153 DOI: 10.1093/hmg/ddu227
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 6.150
Genome-wide significant (P < 3.8E−13) SNP pairs in ARIC and replication in FHS
| chr | SNP1 | pos1 | SNP2 | Pos2 | Dist | LD ( | ||
|---|---|---|---|---|---|---|---|---|
| 4 | 10 124 239 | rs731069 | 10 152 431 | 28.2 | 0.000 | 6.1E−14 | NA | |
| 4 | 10 124 239 | rs747357 | 10 152 878 | 28.6 | 0.000 | 2.2E−13 | NA | |
| 4 | 10 124 567 | rs731069 | 10 152 431 | 27.9 | 0.000 | 4.0E−14 | 1.5E−07 | |
| 4 | 10 124 567 | rs747357 | 10 152 878 | 28.3 | 0.000 | 1.7E−13 | 1.2E−07 | |
| 4 | 10 126 139 | 10 208 794 | 82.7 | 0.002 | 3.1E−13 | 6.9E−04 |
chr—chromosome of a SNP pair; SNP1 (SNP2), pos1 (pos2)— name and position of the first (second) SNP; dist—distance in kb between two SNPs; LD (r2)— linkage disequilibrium between two SNPs; Pint—P-value of the interaction test; Pint_FHS—interaction P-value of the SNP pair in FHS; NA—not directly replicated in FHS; SNPs in bold were genome-wide significant in GWAS in ARIC.
Figure 1.Genome-wide significant SNP pairs in ARIC (red) and their replication (P < 0.05) in FHS (purple). Each horizontal line represents an interaction between two SNPs located at the start and end of the line; two vertical lines mark the 30-kb window described in the main text; y-axis: interaction P-values in the −log10 scale; x-axis: genomic location in base pair (UCSC hg19/NCBI 37.3); arrow bar showing transcription direction and location of the gene (italic) below the bar; rs3733588 is the lead GWAS SNP.
Figure 2.Local interactions in the 4p16.1 region (red) and those remaining significant (P < 0.05) in conditional tests on the lead SNP rs3733588 (blue) in ARIC. Each horizontal line represents an interaction between two SNPs at the start and end locations; two vertical lines mark the 30-kb window described in the main text; y-axis: interaction P-values in the −log10 scale; x-axis: genomic location in base pair (UCSC hg19/NCBI 37.3); arrow bar showing transcription direction and location of the gene (italic) below the bar.
Statistical construction of the genetic structure of the 4p16.1 region
| SNP1 | pos1 | SNP2 | pos2 | dist | LD ( | Variance explained (%) | |
|---|---|---|---|---|---|---|---|
| rs3733588 | 9 997 303 | 6.9E−60 | 4.5 | ||||
| rs874432 | 9 920 606 | 1.4E−10 | 5.5 | ||||
| rs4697695 | 9 915 850 | 3.1E−04 | 5.7 | ||||
| rs9291683 | 10 324 160 | 4.6E−03 | 5.7 | ||||
| rs11734783 | 10 240 663 | 1.4E−02 | 5.9 | ||||
| rs731069 | 10 152 431 | rs10939766 | 10 204 970 | 52.5 | 0.227 | 1.5E−02 | 5.8 |
| rs4698000 | 10 277 467 | rs11943276 | 10 403 545 | 126.1 | 0.042 | 1.7E−02 | 5.9 |
| rs6813385 | 10 148 828 | rs16894270 | 10 165 779 | 17.0 | 0.142 | 7.2E−03 | 6.0 |
only additive effects considered for first marginal SNPs; SNP1 (SNP2), pos1 (pos2)— name and position the first (second) SNP; dist—distance in kb between two SNPs; LD (r2)— linkage disequilibrium between two SNPs; P-value of a marginal SNP or interaction P-value of an epistatic SNP pair; Variance explained—SUA residual variance explained accumulated; blank cells: no information required.
Enrichment of ENCODE enhancers by genome-wide significant local interactions in the 4p16.1 region in ARIC
| Cell type | All enhancers | Strongest enhancers | |||||||
|---|---|---|---|---|---|---|---|---|---|
| ID | Description | Obs. | Exp. | Fold | Obs. | Exp. | Fold | ||
| HepG2 | Hepatocellular carcinoma | 9 | 1.7 | 5.2 | 4.7E−05 | 5 | 0.6 | 8.3 | 3.4E−04 |
| HUVEC | Umbilical vein endothelial cells | 7 | 2.2 | 3.2 | 6.0E−03 | 2 | 1.1 | 1.8 | 3.1E−01 |
| K562 | Leukaemia | 11 | 2.1 | 5.2 | 5.0E−06 | 10 | 0.7 | 15.2 | <1.0E−06 |
| HMEC | Mammary epithelial cells | 7 | 3 | 2.3 | 3.0E−02 | 2 | 1.2 | 1.7 | 3.4E−01 |
| GM12878 | B-lymphocyte lymphoblastoid | 5 | 2.2 | 2.3 | 6.6E−02 | 5 | 0.8 | 6.2 | 1.2E−03 |
| NHEK | Epidermal keratinocytes | 7 | 2.7 | 2.6 | 1.7E−02 | 4 | 1.2 | 3.3 | 3.1E−02 |
Obs.: observed; Exp.: expected; Fold: fold of enrichment calculated as Obs./Exp.; 42 unique epistatic SNPs from Table 1 and Supplementary Material, Table S4 used in the enrichment analysis.