| Literature DB >> 28166722 |
Prashant K Singh1, Patrick R van den Berg2, Mark D Long1, Angie Vreugdenhil1, Laurie Grieshober3, Heather M Ochs-Balcom3, Jianmin Wang4, Sylvie Delcambre5, Sami Heikkinen6, Carsten Carlberg6, Moray J Campbell7, Lara E Sucheston-Campbell8,9.
Abstract
BACKGROUND: The nuclear hormone receptor superfamily acts as a genomic sensor of diverse signals. Their actions are often intertwined with other transcription factors. Nuclear hormone receptors are targets for many therapeutic drugs, and include the vitamin D receptor (VDR). VDR signaling is pleotropic, being implicated in calcaemic function, antibacterial actions, growth control, immunomodulation and anti-cancer actions. Specifically, we hypothesized that the biologically significant relationships between the VDR transcriptome and phenotype-associated biology could be discovered by integrating the known VDR transcription factor binding sites and all published trait- and disease-associated SNPs. By integrating VDR genome-wide binding data (ChIP-seq) with the National Human Genome Research Institute (NHGRI) GWAS catalog of SNPs we would see where and which target gene interactions and pathways are impacted by inherited genetic variation in VDR binding sites, indicating which of VDR's multiple functions are most biologically significant.Entities:
Keywords: ChIP-seq; Cistrome; DR3 motif; GWAS; Immune function; Linkage disequilibrium; NF-κB; Nuclear receptor superfamily; SNP; VDR
Mesh:
Substances:
Year: 2017 PMID: 28166722 PMCID: PMC5294817 DOI: 10.1186/s12864-017-3481-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Schematic workflow and key findings of the study. (Top panel) shows the SNPs and ChIP-seq datasets. Examples of possible overlap of Lead SNPs and SNPs in LD with Lead SNPs at genomic VDR loci are shown with examples of LD blocks. (Bottom panel) shows the number of SNPs identified from individual data sets (GWAS catalog, HapMap3 and 1000 genome project). It also shows the number of SNPs under VDR peaks and SNPs in a DR3-type motif
List of trait SNPs showing significant enrichment for associated traits
| Trait | (A) Trait SNPs (No of SNPs associated with the trait (GWAS + LD) | (B) No. of Trait SNPs under VDR peaks | LD SNP/Lead SNP | Total No. of SNPs under VDR peaks | Corrected | VDR peak set |
|---|---|---|---|---|---|---|
| Esophageal cancer (squamous cell) | 160 | 7 | rs4686848/rs2239612 | 574 | 0.0001 | Combined |
| 160 | 4 | rs2097461/rs2239815 | 149 | 0.002 | GM10855.D3 | |
| Leprosy | 187 | 4 | rs11231770/rs538147 | 78 | 0.0003 | LS180.D3 |
| Mean corpuscular hemoglobin | 148 | 5 | rs4366668/rs6494537 | 574 | 0.02 | Combined |
| 148 | 4 | rs4366668/rs6494537 | 121 | 0.0006 | GM10861.Veh | |
| 148 | 3 | rs4366668/rs6494537 | 81 | 0.008 | GM10855.Veh | |
| Helicobacter pylori serologic status | 22 | 3 | rs10012017/rs10004195 | 574 | 0.008 | Combined |
| 22 | 3 | rs10012017/rs10004195 | 316 | 0.001 | GM10861.D3 | |
| Vitiligo | 463 | 8 | rs10824732/rs11593576 | 574 | 0.02 | Combined |
| 463 | 7 | rs10824732/rs11593576 | 316 | 0.003 | GM10861.D3 | |
| 463 | 5 |
| 149 | 0.007 | GM10855.D3 | |
| Self-reported allergy | 400 | 6 |
| 316 | 0.012 | GM10861.D3 |
| Celiac disease | 576 | 9 | rs1250567/rs1250552 | 574 | 0.016 | Combined |
For each trait, the number of SNPs (GWAS + LD; (B)) under VDR peaks was compared to the total number of SNPs (GWAS + LD; (A)) associated with the trait. The significance of enrichment was tested by a hypergeometric test and the p-values were corrected using Bonferroni (corrected p-value). LD SNP/Lead SNP shows the SNPs under VDR peaks (Lead SNP and/or LD SNPs). The lead SNPs which are also present under VDR peaks are shown in bold. VDR peak set is the the number of data sets in which the VDR peak and SNP relationship was found; Combined = all the peaks from six studies considered together, or cell line and treatment specific data set (D3 = treated with VDR ligand, Veh = vehicle control). Peaks represent the data sets in which the traits showed significant enrichment. This analysis revealed 42 unique SNPs associated with VDR binding and enrichment of a trait
SNPs present in DR3-type motifs contained within VDR peaks and associated with traits
| Chr | Chr Start | Chr End | Strand | Reference Allele | Observed Alleles | SNP under VDR peak (Lead in bold) | Derived From (Lead SNP ID) | Trait | DR3 HOMER score | DR3 motif sequence (Homer) | SNP position in the motif |
|---|---|---|---|---|---|---|---|---|---|---|---|
| chr2 | 8442247 | 8442248 | + | G | A/G |
|
|
|
| AGATCATCGGGGTTT | 11 |
| chr15 | 50946814 | 50946815 | + | A | A/G | rs11070802 | rs2414059 | QT interval | 7 | AGTGCACATGGTGCA | 7 |
| chr3 | 119278467 | 119278468 | + | A | A/G |
|
|
| 6 | AGTTCACTCAGTACT | 5 |
| chr19 | 49867015 | 49867016 | + | C | C/G | rs2288480 | rs2303759 | Multiple sclerosis | 4 | GTGTCACCAAGGGCG | 7 |
| chr4 | 95088557 | 95088558 | + | C | C/T | rs28613890 | rs11097407 | Bipolar disorder and schizophrenia | 8 | GTGGTCATTGAGTTC | 14 |
| chr6 | 109704441 | 109704442 | + | T | C/T | rs3757230 | rs1046943 | Height | 7 | AGGTCATTGGGGAGG | 3 |
| chr6 | 109704450 | 109704451 | + | A | A/C | rs3757231 | rs1046943 | Height | 7 | AGGTCATTGGGGAGG | 12 |
| chr5 | 95265555 | 95265556 | + | G | A/G | rs4563648 | rs7700895 | IgG glycosylation | 5 | GGGTGAGAAAGTTTC | 2 |
| chr1 | 205641342 | 205641343 | + | C | C/G | rs55915134 | rs16856186 | Pulmonary function decline | 6 | TGGGTGAGGGGGGGG | 6 |
| chr11 | 118575418 | 118575419 | - | G | C/T | rs625735 | rs494459 | Height | 9.18 | GTGTCAAAGGGTTCA | 10 |
| chr11 | 128206409 | 128206410 | + | C | C/T |
|
|
|
| AAAGTCAGAGAGGAC | 7 |
| chr15 | 49167995 | 49167996 | + | G | A/G | rs73402209 | rs8023445 | Major depressive disorder | 9.18 | AGGTCAAAGAGGTCG | 14 |
| chr21 | 35348499 | 35348500 | - | C | A/G | rs743418 | rs2032314 | Red blood cell traits | 4 | AGGCTGCTGGGTCCA | 12 |
| chr5 | 150169882 | 150169883 | + | A | A/T | rs74973123 | rs13361189 | Crohn’s disease | 5 | ATGTTAAAGGGTTTA | 7 |
| chr9 | 123664122 | 123664123 | + | A | A/G | rs7859805 | rs881375 | Rheumatoid arthritis | 6 | GGGGCAAAAGGTGTG | 5 |
All the 574 disease- or phenotype-associated SNPs were overlapped with 8,884 VDR ChIP-seq loci containing DR3 motifs to identify SNPs present in DR3 motif. This analysis identified 15 SNPs to be present in DR3-type motifs. Table shows the details of 15 SNPs including homer score, associated Lead SNPs and traits/phenotypes. SNPs in bold survive multiple test correction
SNPs in bold are LEAD SNPs
aSNPs common with 42 significant trait SNPs, bases in red in motif sequence show the position of the SNP in the motif
The distribution of Regulome DB scores for different SNP lists
| Regulome DB score | (A) | (B) | (C) |
|---|---|---|---|
| No. of GWAS + LD SNPs (191482) | No. of (A) SNPs under VDR peaks (572) | No. of (B) SNPs under VDR peaks enriched for trait association (42) | |
| 1a | 21 | 1 | 0 |
| 1b | 272 | 15 | 1 |
| 1c | 6 | 0 | 0 |
| 1d | 148 | 3 | 0 |
| 1e | 7 | 0 | 0 |
| 1f | 3182 | 43 | 5 |
| 2a | 369 | 34 | 2 |
| 2b | 3182 | 109 | 7 |
| 2c | 137 | 8 | 0 |
| Total | 7324 | 213 | 15 |
All the GWAS + LD SNPs were analyzed using RegulomeDB. The table shows the distribution of scores 1 and 2 SNPs under VDR peaks as well as for 42 significant trait associated SNPs
Fig. 2Function of a SNP contained within a VDR binding peak associated with Toll-like receptor 1 (TLR1). The top panel shows the presence of rs5743565 (blocked out), which is in LD with rs2101521 (orange arrow) associated with self-reported allergy in GWAS study. VDR peaks are shown as Red block custom tracts. The rs5743565 is associated with expression of the genes TLR10, TLR6 and TLR1. The bottom panel shows the ENCODE ChIP-seq data for binding of different transcription factors in the region harboring rs5743565 and VDR binding sites (in red). The presence of rs5743565 in the binding region is predicted to alter the association of multiple transcription factors and the expression of associated genes, and therefore been assigned a score in RegulomeDB of < = 2
Fig. 3The co-occurrence of VDR and NF-κB1 binding at sites of significant genetic variation. a The wordcloud characterizes the transcription factors binding at the location of significantly associated Lead SNPs (and not linked to a DR3 motif) were identified by RegulomeDB; 95 different transcription factors, representing eight unique subgroups, overlapped with, and their function was altered by, these SNPs. The font size of transcription factor related to the number of times it was identified to associate with a significant SNP site. b The intersection of VDR and NF-κB ChIP-Seq from CEPH cell lines. The VDR ChIP-seq in the unstimulated and ligand-stimulated states in the CEPH cell lines GM10855 and GM10861 were intersected to generate a consensus cistrome for VDR binding sites in the unstimulated and stimulated states. Similarly, a consensus NF-κB cistrome was generated by intersecting the ChIP-Seq from the cell lines GM12878, GM12891, GM12892, GM15510, GM18505, GM18526, GM18951, GM19099, GM19193. These consensus cistromes were then intersected to reveal the unique and shared binding sites, and the overlap with the identified SNPs. c The effect of rs10174949 genotype on NF-κB binding in ChIP-seq from HapMap cell lines. The altered allele of rs10174949 was predicted to decrease the strength of predicted affinity of NF-κB1 by HaploReg. The ENCODE ChIP-seq data sets of NF-κB for HapMap cell lines for which genotype data was available were downloaded into Integrative Genomics Viewer (IGV). Population ID and the genotype for each cell lines is shown. Cell lines with homozygous reference allele are shown in blue, heterozygous samples are shown in green and sample with homozygous altered allele is shown in red. Samples with the homozygous altered allele (AA) completely lost NF-κB binding compared to the homozygous (GG) or heterozygous (AG) reference alleles
Showing e-QTLS and associated phenotypes for SNPs under VDR peak significantly enriched for trait association
| Chrom | ChromStart | ChromEnd | Strand | Reference Allele | Observed Alleles | SNP under VDR peak (Lead + LD SNP) | RegulomeDB score | e-QTL (affected gene) | Associated GWAS Trait |
|---|---|---|---|---|---|---|---|---|---|
| chr4 | 38805982 | 38805983 | - | T | A/G | rs5743565 | 1b |
| Self-reported allergy |
| chr4 | 38784723 | 38784724 | + | T | A/T | rs10004195 | 1f |
| Helicobacter pylori serologic status |
| chr11 | 64140703 | 64140704 | + | C | C/T | rs10897487 | 1f |
| Leprosy |
| chr11 | 64140736 | 64140737 | + | G | C/G | rs475032 | 1f |
| Leprosy |
| chr12 | 56401084 | 56401085 | + | G | A/G | rs10876864 | 1f |
| Vitiligo |
Five of 42 trait-associated SNPs which were e-QTLs with RegulomeDB score 1, e-QTL gene and traits associated with these SNPs in GWAS studies are shown