| Literature DB >> 27604318 |
Xuan Liao1,2, ChangJun Lan1,2, Dan Liao1,2, Jing Tian1,2, XiuQi Huang1,2.
Abstract
Refractive error (RE) is a complex multifactorial disease. Genome-wide association studies (GWAS) have provided significant insight into the genetic architecture and identified plenty of robust genetic variations or single nucleotide polymorphisms (SNPs) associated with complex disease. A major current challenge is to convert those resources into causal variants and target genes. We used RegulomeDB and HaploReg to annotate regulatory information onto associated SNPs derived from the two largest RE GWAS, and additional SNPs in linkage disequilibrium (LD) with GWAS significant SNPs. Overall 868 SNPs were investigated, out of which 662 returned RegulomeDB scores of 1 to 6. It was observed that 36 out of those SNPs show strong evidence of regulatory effects with a RegulomeDB score of 1, while only four of them were GWAS significant SNPs (CD55/rs1652333, CNDP2/rs12971120, RDH5/rs3138142 and rs3138144). The results encourage us to explore those putative pathogenic variants, both GWAS significant SNPs as well as the SNPs in LD, for future discernment of functional consequence. This study offers the attractive approach for prioritizing potential functional variants by combining ENCODE data and GWAS information, and provide further insights into the pathogenesis and mechanism and ultimately therapeutics.Entities:
Year: 2016 PMID: 27604318 PMCID: PMC5015044 DOI: 10.1038/srep33090
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Annotation of potential regulatory SNPs in RegulomeDB Category of 1.
| Chromosome: Location | RefSNP | Allele | Gene | Position | Histone marked | DNase | eQTL | Motifs altered | Protein bound |
|---|---|---|---|---|---|---|---|---|---|
| 1:207387765 | rs1346720 | T/C | intergenic 69 kb 3′ of | − | − | + | − | SETDB1* | |
| 1:207424726 | rs7545125 | A/G | intergenic 70 kb 5′ of | + | − | + | −; DIx2, Foxo, Ik-2, NF-AT, NF-AT1, YY1 | CEBPB* | |
| 1:207456473 | rs12095015 | T/C | intergenic 38 kb 5′ of | + | + | + | MGA; Foxp1, Irf, SRF, Zec | CDX2, JUN; CJUN | |
| 1:207470459 | G/A | intergenic 24 kb 5′ of | + | − | + | −; Ap-4, E2A, Evi-1, Mef2, Myf, RP58 | MEF-2, TCF21, AP-4, TFAP4, MYF6 | ||
| 1:207472188 | rs2802236 | T/C | intergenic 23 kb 5′ of | + | + | + | HOXA5(Hox-1.3); Eomes, Hoxa5, Pax5, Pax6, Zfp187 | FOXA1, ESR1, EP300, GATA3*; ERALPHA-A | |
| 1:207478979 | rs1572275 | C/A | intergenic 16 kb 5′ of | + | + | + | −; AIRE, Pou5f1 | SMARCC1, RAD21*, POLR2A, CEBPB*, TAF1*, NFYB, TBP*, CHD2, TFAP2A | |
| 1:207483444 | rs2564974 | G/A | intergenic 11 kb 5′ of | + | + | + | Irx-3, Irx3, Irx4, Irx6, TFAP4; CDP, Irx, Myb | STAT1, SMARCC1, TCF7L2, SP1, EP300, TCF12, MYC, ELF1*, RCOR1, MEF2A, TEAD4, SPI1, BHLHE40, JUND*, GATA2*, TAL1*, JUN, USF1, MAX*, GATA1, POLR2A, CBX3, SIRT6*; PU1*, POL2*, BAF155*, CJUN*, CMYC*, INI1* | |
| 1:207494415 | rs2564978 | T/C | intergenic 399 bp 5′ of | + | + | + | FXR/RXR-alpha* | E2F1, MTA3, POLR2A*, SPI1, ELF1*, TAF1, CEBPB*, TBL1XR1, EP300*, TBP*, FOS, TRIM28, MYC*, STAT3*, KDM5B, ZNF143, HDAC1, MAX*, SP4, CDX2, MXI1, GATA1, STAT1, CTCF*, JUNB, PHF8, RAD21*, BACH1, TCF12, GABPB1, ZKSCAN1, IRF1, SIN3A, SETDB1, NFKB1*, HNF4A; PU1, GTF2F1, POL2S2, SMC3, CEBPB | |
| 1:207501210 | rs4844592 | T/A | intronic | + | + | + | Srf, SPI1, SPIC; HDAC2 | POLR2A* | |
| 1:207502533 | rs6700168 | A/C | intronic | + | + | + | IRC900814 | POLR2A*, NFYA, CHD2, MAX, RCOR1, SMC3, ZKSCAN1, ZNF143; POL24H8, GTF2F1, POL2B, POL2S2 | |
| 1:207506328 | rs10864231 | T/G | intronic | − | + | + | −; PL2F, Pax5 | POLR2A*, TAF1; POL24H8 | |
| 1:207507480 | rs1507758 | G/C | intronic | + | + | + | −; Foxp1 | POLR2A*, SETDB1* | |
| 1:207509364 | rs1507760 | C/T | intronic | + | + | + | −; NK-kappaB | POLR2A*, EBF1, CHD1, EP300, CTCF*, GATA1, RAD21*, NFKB1*, TBP, SP1, YY1, TBL1XR1, CHD2, RUNX3, BHLHE40, MXI1; POL24H8, POL2B | |
| 9:71733141 | rs11145326 | C/G | intergenic 3.1kb 5′ of | − | − | + | −; MSX2, Pax6, STAT, Sox | PAX6, SOX15, SOX7, SOX8 | |
| 9:71770938 | rs11145488 | G/A | intronic | + | + | + | −; Nanog, Sox | ||
| 9:71791546 | rs1538583 | A/G | intronic | + | + | + | POU3F2; Foxa, Pbx3, Pou1f1, Pou3f2, STAT | EBF1*, PML, NFIC | |
| 12:56114768 | G/C | intronic | + | + | + | −; BCL, NRSF | MYC, CEBPB, POLR2A, EP300, RUNX3, ZBTB7A, RXRA, MAZ, SMC3, CTCF*, E2F1 | ||
| 12:56115584 | C/T | exon, synonymous-coding | + | + | + | Nr2f2; PLAG1*, RXRA* | POLR2A | ||
| 17:31184630 | rs17183628 | T/C | intronic | + | − | + | Zfp691, ZFP652; FXR, Pax4, VDR | TEAD4 | |
| 17:31187215 | rs17781142 | G/C | intronic | + | + | + | −; LBP-1 | JUN, PML, GATA2*, POLR2A, MAX, RCOR1, CEBPB, STAT2, GATA1*, STAT5A, EBF1*, TAF7, JUND, EP300, CCNT2*, MAZ, IRF1, NFKB1, EBF1, TBL1XR1, MYC* | |
| 17:31239644 | rs10512441 | C/T | intergenic 15 kb 5′ of | − | + | + | HNF3beta; Foxa, Foxj2 | EP300*, POLR2A*, CTCF*, TRIM28, FOXA2*, JUND, GATA3*, UBTF, SRF*, TAF1, YY1, ZBTB7A, ZNF143, SP1, ARID3A, HDAC2, FOXA1*, TBP*, MYC, PML, PHF8, MAZ; KAP1*, HDAC2* | |
| 18:72154930 | rs747176 | G/A | intergenic 8.6 kb 5′ of | + | − | + | −; Mrg, Pax5, Pbx3, TAL1, Tgif1 | ||
| 18:72167123 | rs3764509 | C/G | 5′ UTR | + | + | + | −; Pbx3 | POU2F2, SIN3A*, STAT3*, STAT1*, TAF1*, USF1*, TBP, USF2, SAP30, RUNX3, BHLHE40, CTCF, E2F6*, YY1*, CEBPB*; MAX, POLR2A, POL24H8, SIN3AK20, MYC, NFKB, MXI1, ETS1 | |
| 18:72167801 | rs12605820 | G/A | intronic | + | + | + | Sfpi1; Foxo, HP1, Mef2, TCF12 | ||
| 18:72168485 | rs8084410 | T/C | intronic | + | + | + | TEF*, TEAD1, TEAD4, TEAD3; DMRT4, RFX5, TCF12 | SIN3A | |
| 18:72168607 | rs2303463 | G/A | exon, synonymous-coding | + | + | + | EGR2, EGR3 | ||
| 18:72170298 | rs4891557 | C/T | intronic | + | − | + | ESR1, EWSR1-FLI1; ERalpha-a, EWSR1-FLI1, Irf, RXRA, VDR | ||
| 18:72170396 | rs4891559 | G/A | intronic | + | − | + | −; Bbx, Foxj2 | ||
| 18:72174022 | A/G | intronic | + | + | + | ||||
| 18:72174979 | rs3829640 | A/G | intronic | + | + | + | −; Bcl6b | ||
| 18:72176082 | rs2278161 | T/C | exon, nonsynonymous- coding | + | + | + | −; BDP1 | GATA2*, TAL1*, MYC | |
| 18:72177231 | rs11151960 | G/A | intronic | + | + | + | Sox13, Zic1, Zic2; E2F, Tel2 | PAX5 | |
| 18:72178160 | rs2278159 | T/C | exon, synonymous-coding | + | + | + | −; Maf | ||
| 18:72178299 | rs2278158 | G/T | intronic | + | + | + | −; Ets, TATA | ||
| 18:72179578 | rs734559 | G/A | intronic | + | + | + | Lyf-1, PTF1-beta; CEBPB, HMG-1Y, Ik-1, IK-2, NF-AT, NF-AT1, PTF1-beta, Pou5f1 | ||
| 18:72182964 | rs3794950 | G/A | intronic | + | + | + | −; Jundm2 |
RefSNP ID = reference SNP identification number in NCBI reference assembly, eQTL = expression quantitative trait loci, DNase = deoxyribonuclease.
1GWAS significant SNPs are bolded.
2Promoter and Enhancer histone marks are included.
3Data of motif altered and protein bound are separated by semicolons: before from RegulomeDB and after from Haploreg database. The repeated sites in both databases are marked with an asterisk.
LD for GWAS significant SNPs with functional proxies.
| GWAS significant SNP | Functional Proxy SNPs[ | RegulomeDB Score | |
|---|---|---|---|
| rs10512441 | 0.9 | 1a | |
| rs17183628 | 0.9 | 1d | |
| rs17781142 | 0.9 | 1f | |
| rs1538583 | 1.0 | 1b | |
| rs11145326 | 0.8 | 1f | |
| rs11145488 | 1.0 | 1f | |
| rs3138142 | 1.0 | 1d | |
| rs3138144 | 1.0 | 1f | |
| rs1652333 | 1.0 | 1f | |
| rs2802236 | 1.0 | 1b | |
| rs2564974 | 1.0 | 1b | |
| rs2564978 | 1.0 | 1b | |
| rs12095015 | 1.0 | 1d | |
| rs6700168 | 1.0 | 1d | |
| rs1346720 | 0.8 | 1f | |
| rs7545125 | 1.0 | 1f | |
| rs1572275 | 1.0 | 1f | |
| rs4844592 | 1.0 | 1f | |
| rs10864231 | 1.0 | 1f | |
| rs1507758 | 1.0 | 1f | |
| rs1507760 | 1.0 | 1f | |
| rs12971120 | 1.0 | 1f | |
| rs8084410 | 0.9 | 1b | |
| rs11151960 | 0.8 | 1d | |
| rs747176 | 0.8 | 1f | |
| rs3764509 | 0.9 | 1f | |
| rs12605820 | 0.8 | 1f | |
| rs2303463 | 0.8 | 1f | |
| rs4891557 | 0.8 | 1f | |
| rs4891559 | 0.9 | 1f | |
| rs3829640 | 1.0 | 1f | |
| rs2278161 | 1.0 | 1f | |
| rs2278159 | 0.8 | 1f | |
| rs2278158 | 0.8 | 1f | |
| rs734559 | 1.0 | 1f | |
| rs3794950 | 0.8 | 1f |
#GWAS significant SNPs with regulatory potential are bolded, *RegulomeDB score are confined to the range 1a–f.
Distribution of RegulomeDB variants categories6.
| Category | Description | SNPs Number |
|---|---|---|
| Likely to affect binding and linked to expression of a gene target | ||
| 1a | eQTL + TF binding + matched TF motif + matched DNase footprint + DNase peak | 1 |
| 1b | eQTL + TF binding + any motif + DNase footprint + DNase peak | 5 |
| 1c | eQTL + TF binding + matched TF motif + DNase peak | 0 |
| 1d | eQTL + TF binding + any motif + DNase peak | 5 |
| 1e | eQTL + TF binding + matched TF motif | 0 |
| 1f | eQTL + TF binding/DNase peak | 25 |
| Likely to affect binding | ||
| 2a | TF binding + matched TF motif + matched DNase footprint + DNase peak | 1 |
| 2b | TF binding + any motif + DNase footprint + DNase peak | 24 |
| 2c | TF binding + matched TF motif + DNase peak | 0 |
| Less likely to affect binding | ||
| 3a | TF binding + any motif + DNase peak | 48 |
| 3b | TF binding + matched TF motif | 1 |
| Minimal binding evidence | ||
| 4 | TF binding + DNase peak | 85 |
| 5 | TF binding or DNase peak | 241 |
| 6 | Motif hit | 226 |
| No data[ | 206 | |
eQTL = expression quantitative trait loci, TF = transcription factor, DNase = deoxyribonuclease, #No data is available in RegulomeDB scoring scheme.