| Literature DB >> 31666072 |
Kerry E Poppenberg1,2, Kaiyu Jiang3, Michael K Tso1,4, Kenneth V Snyder1,4,5, Adnan H Siddiqui1,4,5, John Kolega1,6, James N Jarvis3,7, Hui Meng1,2,4,8, Vincent M Tutino9,10,11.
Abstract
BACKGROUND: Genetics play an important role in intracranial aneurysm (IA) pathophysiology. Genome-wide association studies have identified several single nucleotide polymorphisms (SNPs) that are linked to IA but how they affect disease pathobiology remains poorly understood. We used Encyclopedia of DNA Elements (ENCODE) data to investigate the epigenetic landscapes surrounding genetic risk loci to determine if IA-associated SNPs affect functional elements that regulate gene expression and if those SNPs are most likely to impact a specific type of cells.Entities:
Keywords: Cell type; Cerebral aneurysm; Epigenetics; Genetics; Risk
Mesh:
Substances:
Year: 2019 PMID: 31666072 PMCID: PMC6821037 DOI: 10.1186/s12920-019-0591-7
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Positional information for 16 IA-risk single nucleotide polymorphisms and the associated linkage disequilibrium blocksa
| SNP | LD Block | SNP Location | Nearest Gene in LD Block |
|---|---|---|---|
| rs4934 | chr14:95078677–95,080,803 | Exonic |
|
| rs42524 | chr7:94043239–94,049,356 | Exonic |
|
| rs1132274 | chr20:17594030–17,600,114 | Exonic |
|
| rs1800255 | chr2:189841613–189,867,882 | Exonic |
|
| rs1429412 | chr2:198148191–198,223,121 | Intergenic |
|
| rs6841581 | chr4:148365339–148,414,651 | Intergenic |
|
| rs9298506 | chr8:55421614–55,462,324 | Intergenic | – |
| rs10757278 | chr9:22077085–22,125,503 | Intergenic |
|
| rs10958409 | chr8:55309731–55,328,116 | Intergenic | – |
| rs251124 | chr5:82805424–82,826,254 | Intronic |
|
| rs700651 | chr2:198541398–198,631,714 | Intronic |
|
| rs1800796 | chr7:22766246–22,771,738 | Intronic |
|
| rs2891168 | chr9:22072264–22,125,503 | Intronic |
|
| rs3767137 | chr1:22160723–22,168,310 | Intronic |
|
| rs4628172 | chr7:15493884–15,506,529 | Intronic |
|
| rs6538595 | chr12:95489131–95,516,843 | Intronic |
|
aSNPs reported by Alg et al. [25] were mapped to LD blocks using SNAP and SNiPA tools. The UCSC Genome Browser was used to visually determine SNP location and nearest gene
Abbreviations: chr Chromosome, LD Linkage disequilibrium, rs Reference SNP cluster ID, SNP Single nucleotide polymorphism, “–” = no gene)
Histone marks present in IA-associated linkage disequilibrium blocksa
| LD Block | SNP | HUVEC | M0 | PNL | PBMC | |||
|---|---|---|---|---|---|---|---|---|
| H3K4me1b | H3K27ac b | H3K4me1 | H3K27ac | H3K4me1 | H3K27ac | H3K4me1 | ||
| chr1:22160723–22,168,310 | rs3767137 | Y | N | N | N | N | N | N |
| chr2:189841613–189,867,882 | rs1800255 | Y | Y | N | N | N | N | N |
| chr2:198148191–198,223,121 | rs1429412 | Y | Y | Y | Y | Y | Y | Y |
| chr2:198541398–198,631,714 | rs700651 | Y | Y | Y | Y | Y | Y | Y |
| chr4:148365339–148,414,651 | rs6841581 | Y | N | N | N | N | N | N |
| chr5:82805424–82,826,254 | rs251124 | Y | Y | Y | Y | N | N | Y |
| chr7:15493884–15,506,529 | rs4628172 | Y | N | N | N | N | N | N |
| chr7:22766246–22,771,738 | rs1800796 | Y | Y | Y | Y | Y | N | Y |
| chr7:94043239–94,049,356 | rs42524 | Y | N | N | N | N | N | N |
| chr8:55309731–55,328,116 | rs10958409 | Y | Y | N | N | N | N | N |
| chr8:55421614–55,462,324 | rs9298506 | Y | Y | N | N | N | N | N |
| chr9:22072264–22,125,503 | rs2891168 | Y | Y | Y | N | N | N | Y |
| chr9:22077085–22,125,503 | rs10757278 | Y | Y | Y | N | N | N | Y |
| chr12:95489131–95,516,843 | rs6538595 | Y | Y | Y | Y | Y | N | Y |
| chr14:95078677–95,080,803 | rs4934 | N | N | N | N | N | N | N |
| chr20:17594030–17,600,114 | rs1132274 | Y | N | Y | Y | Y | Y | Y |
a“Y” indicates histone mark for given cell type was present within that LD block. Marks that occur at statistically greater than expected frequency are denoted with b. Abbreviations: LD Linkage disequilibrium, chr Chromosome, rs Reference SNP cluster ID, SNP Single nucleotide polymorphism, Y Yes, N No, HUVEC Human umbilical vein endothelial cell, M0 Monocyte, PNL Polymorphonuclear leukocyte - neutrophil, PBMC Peripheral blood mononuclear cell
Fig. 1a UCSC Genome Browser screenshot of epigenetic landscape around SNP rs1800255 within the exon of COL3A1. Black horizontal bar at top represents the LD block of the associated IA-risk SNP. The next tracks in descending order within panel A are the H3K4me1 and H3K27ac peak regions generated from ChIP-Seq data for monocytes (CD14+), PBMCs, and HUVECs. Gray bars in the lower section represent transcription factor ChIP-Seq of 8 factors from ENCODE with data available for HUVECs. Notice that COL3A1 is expressed in this LD block. Histone marked regions, represented by the gray bars in the middle of this panel, are only present for HUVECs and encompass multiple TFBSs. b Tracks for neutrophil H3K4me1 and H3K27ac peak regions generated from ChIP-Seq data. Key: UCSC=University of California Santa Cruz, SNP = single nucleotide polymorphism, COL3A1 = collagen type III alpha chain a, LD = linkage disequilibrium, IA = intracranial aneurysm, PBMC = peripheral blood mononuclear cell, HUVEC = human umbilical vein endothelial cell, ChIP-Seq = chromatin immunoprecipitation-sequencing, TFBS = transcription factor binding site; ENCODE = Encyclopedia of DNA [deoxyribonucleic acid] Elements
Fig. 2a UCSC Genome Browser screenshot of epigenetic landscape around intergenic SNP rs10958409. Black horizontal bar at the top represents the LD block of the associated IA-risk SNP. The next tracks are for monocyte (CD14+), PBMC, and HUVEC H3K4me1 and H3K27ac peak regions generated from ChIP-Seq data. Gray bars in the lower section of panel A represent transcription factor ChIP-Seq of 8 factors from ENCODE with data available for HUVECs. Histone marked regions, represented by the gray bars in the middle of panel A, are only present for HUVECs and encompass multiple TFBS. b Tracks for neutrophil H3K4me1 and H3K27ac peak regions generated from ChIP-Seq data. Key: UCSC=University of California Santa Cruz, SNP = single nucleotide polymorphism, LD = linkage disequilibrium, IA = intracranial aneurysm, PBMC = peripheral blood mononuclear cell, HUVEC = human umbilical vein endothelial cell, ChIP-Seq = chromatin immunoprecipitation-sequencing, TFBS = transcription factor binding site). ENCODE = Encyclopedia of DNA [deoxyribonucleic acid] Elements
CTCF binding sites present in IA-associated linkage disequilibrium blocks for endothelial cells and monocytesa
| LD Block | SNP | HUVEC | M0 |
|---|---|---|---|
| chr1:22160723–22,168,310 | rs3767137 | Y | Y |
| chr2:189841613–189,867,882 | rs1800255 | N | N |
| chr2:198148191–198,223,121 | rs1429412 | Y | Y |
| chr2:198541398–198,631,714 | rs700651 | Y | Y |
| chr4:148365339–148,414,651 | rs6841581 | N | N |
| chr5:82805424–82,826,254 | rs251124 | Y | Y |
| chr7:15493884–15,506,529 | rs4628172 | N | N |
| chr7:22766246–22,771,738 | rs1800796 | Y | Y |
| chr7:94043239–94,049,356 | rs42524 | N | N |
| chr8:55309731–55,328,116 | rs10958409 | Y | N |
| chr8:55421614–55,462,324 | rs9298506 | Y | N |
| chr9:22072264–22,125,503 | rs2891168 | N | N |
| chr9:22077085–22,125,503 | rs10757278 | Y | Y |
| chr12:95489131–95,516,843 | rs6538595 | N | N |
| chr14:95078677–95,080,803 | rs4934 | Y | N |
| chr20:17594030–17,600,114 | rs1132274 | Y | Y |
a “Y” indicates CTCF binding site for given cell type was present within that LD block. Data were unavailable for neutrophils and peripheral blood mononuclear cells. Abbreviations: CTCF CCCTC binding factor site, LD Linkage disequilibrium, chr Chromosome, rs Reference SNP cluster ID, SNP Single nucleotide polymorphism, Y Yes, N No, HUVEC Human umbilical vein endothelial cell, M0 Monocyte
TF binding sites present within histone-marked regions of IA-associated LD blocks for endothelial cellsa
| LD Block | SNP | Distal TF | Promoter TF |
|---|---|---|---|
| chr1:22160723–22,168,310 | rs3767137 | – | POLR2A |
| chr2:189841613–189,867,882 | rs1800255 | FOS, GATA2, JUN | – |
| chr2:198148191–198,223,121 | rs1429412 | FOS, GATA2, JUN | – |
| chr2:198541398–198,631,714 | rs700651 | FOS | MAXb, MYCb, POLR2Ab |
| chr4:148365339–148,414,651 | rs6841581 | – | EZH2 |
| chr5:82805424–82,826,254 | rs251124 | JUNb | – |
| chr7:15493884–15,506,529 | rs4628172 | FOS | – |
| chr7:22766246–22,771,738 | rs1800796 | FOSb, GATA2b, JUNb, POLR2Ab | POLR2Ab |
| chr7:94043239–94,049,356 | rs42524 | FOS, JUN | – |
| chr8:55309731–55,328,116 | rs10958409 | FOS, GATA2, JUN | – |
| chr8:55421614–55,462,324 | rs9298506 | – | – |
| chr9:22072264–22,125,503 | rs2891168 | FOSb, GATA2b, JUNb, POLR2Ab | – |
| chr9:22077085–22,125,503 | rs10757278 | FOSb, GATA2b, JUNb, POLR2Ab | – |
| chr12:95489131–95,516,843 | rs6538595 | FOSb, GATA2b, JUNb, POLR2Ab | – |
| chr14:95078677–95,080,803 | rs4934 | – | – |
| chr20:17594030–17,600,114 | rs1132274 | – | – |
a TF binding site identification only performed for HUVEC. All TFs were present within H3K4me1-marked regions of IA-associated LD blocks. TFs denoted with b were present in regions of IA-associated LD blocks marked by both H3K4me1 and H3K27ac histone modifications. (IA Intracranial aneurysm, LD Linkage disequilibrium, chr Chromosome, SNP Single nucleotide polymorphism, HUVEC Human umbilical vein endothelial cell, “–” = no TF)
DAVID Functional annotation resultsa
| Term | Source | No. of genes | Q-Value | |
|---|---|---|---|---|
| Serpin domain | INTERPRO | 7 | 4.30E-10 | 4.90E-08 |
| Serpin family | INTERPRO | 7 | 4.30E-10 | 4.90E-08 |
| SERPIN | SMART | 7 | 9.00E-10 | 3.30E-08 |
| Serine protease inhibitor | UP_KEYWORDS | 7 | 4.20E-08 | 5.50E-06 |
| Serine-type endopeptidase inhibitor activity | GOTERM_MF_DIRECT | 7 | 1.80E-07 | 1.80E-05 |
| Protease inhibitor | UP_KEYWORDS | 7 | 3.70E-07 | 2.50E-05 |
| Negative regulation of endopeptidase activity | GOTERM_BP_DIRECT | 6 | 1.20E-05 | 4.80E-03 |
| Extracellular matrix structural constituent | GOTERM_MF_DIRECT | 5 | 2.70E-05 | 1.40E-03 |
| Protease inhibitor I4, serpin, conserved site | INTERPRO | 4 | 7.40E-05 | 4.20E-03 |
| Extracellular space | GOTERM_CC_DIRECT | 13 | 9.30E-05 | 1.00E-02 |
| Extracellular matrix | GOTERM_CC_DIRECT | 7 | 9.50E-05 | 5.20E-03 |
| Extracellular matrix organization | GOTERM_BP_DIRECT | 6 | 1.20E-04 | 2.40E-02 |
| Propeptide: C-terminal propeptide | UP_SEQ_FEATURE | 3 | 1.40E-04 | 3.80E-02 |
| Domain: Fibrillar collagen NC1 | UP_SEQ_FEATURE | 3 | 2.80E-04 | 3.70E-02 |
| Extracellular matrix | UP_KEYWORDS | 6 | 3.00E-04 | 1.30E-02 |
| Fibrillar collagen, C-terminal | INTERPRO | 3 | 3.30E-04 | 1.20E-02 |
| Secreted | UP_KEYWORDS | 14 | 3.30E-04 | 1.10E-02 |
| Skeletal system development | GOTERM_BP_DIRECT | 5 | 3.70E-04 | 4.70E-02 |
| Ehlers-Danlos syndrome | UP_KEYWORDS | 3 | 4.60E-04 | 1.20E-02 |
| COLFI | SMART | 3 | 4.70E-04 | 8.60E-03 |
aFunctional annotation results returned from DAVID using genes within ±200 kb of intracranial aneurysm-associated linkage disequilibrium blocks. Abbreviations: DAVID Database for Annotation, Visualization and Integrated Discovery; No. Number
Fig. 3Gene Ontology term finder molecular function results for genes within extended IA-associated LD blocks. Blocks in blue represent significant ontologies
Differentially methylated regions present within IA-associated linkage disequilibrium blocksa
| LD Block | SNP | Differentially Methylated Region |
|---|---|---|
| chr2:198148191–198,223,121 | rs1429412 | chr2:198173194–198,173,317 |
| chr4:148365339–148,414,651 | rs6841581 | chr4:148413287–148,413,410 |
| chr20:17594030–17,600,114 | rs1132274 | chr20:17595355–17,595,478 |
| chr20:17595448–17,595,571 |
aDifferentially methylated regions identified in tissue study by Yu et al. (32)
Abbreviations: chr Chromosome, LD Linkage disequilibrium, rs Reference SNP cluster ID, SNP Single nucleotide polymorphism