| Literature DB >> 31500627 |
Masataka Kikuchi1, Norikazu Hara2, Mai Hasegawa3, Akinori Miyashita2, Ryozo Kuwano2,4, Takeshi Ikeuchi2, Akihiro Nakaya5.
Abstract
BACKGROUND: Genome-wide association studies (GWASs) have identified single-nucleotide polymorphisms (SNPs) that may be genetic factors underlying Alzheimer's disease (AD). However, how these AD-associated SNPs (AD SNPs) contribute to the pathogenesis of this disease is poorly understood because most of them are located in non-coding regions, such as introns and intergenic regions. Previous studies reported that some disease-associated SNPs affect regulatory elements including enhancers. We hypothesized that non-coding AD SNPs are located in enhancers and affect gene expression levels via chromatin loops.Entities:
Keywords: Alzheimer’s disease; Chromatin higher-order structure; Genome-wide association study; Non-coding variants
Mesh:
Substances:
Year: 2019 PMID: 31500627 PMCID: PMC6734281 DOI: 10.1186/s12920-019-0574-8
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Flowchart of the present study
Fig. 2Nearly 30% of non-coding AD SNPs are located in enhancers. a Circle chart showing the genomic region location of AD SNPs from the GWAS catalog database (p-value < 1.00 × 10− 6). b Circle chart showing the proportions of non-coding AD SNPs located in non-enhancer regions and in enhancers identified in one or more tissues or cell types. “Brain Enhancer” indicates non-coding AD SNPs located in enhancers identified in one or more brain tissues. “Non-Brain Enhancer” indicates non-coding AD SNPs located in enhancers that were not identified in brain tissues but were identified in the other tissues or cell types. All tissue and cell type names are described in Additional file 1: Table S2
Gene functional enrichment analysis
| GO | Category | Description | Count | % | Log10(P) | Log10(q) |
|---|---|---|---|---|---|---|
| GO:1902430 | GO Biological Processes | Negative regulation of amyloid-beta formation | 3 | 2.38 | -5.03 | -0.94 |
| GO:0007271 | GO Biological Processes | Synaptic transmission, cholinergic | 4 | 3.17 | -4.47 | -0.91 |
| R-HSA-1834949 | Reactome Gene Sets | Cytosolic sensors of pathogen-associated DNA | 5 | 3.97 | -4.34 | -0.89 |
| GO:0002768 | GO Biological Processes | Immune response-regulating cell surface receptor signaling pathway | 11 | 8.73 | -4.29 | -0.89 |
| GO:0072665 | GO Biological Processes | Protein localization to vacuole | 4 | 3.17 | -3.86 | -0.8 |
| GO:0006353 | GO Biological Processes | DNA-templated transcription, termination | 5 | 3.97 | -3.45 | -0.59 |
| GO:0016032 | GO Biological Processes | Viral process | 13 | 10.32 | -3.35 | -0.53 |
| GO:0007169 | GO Biological Processes | Transmembrane receptor protein tyrosine kinase signaling pathway | 11 | 8.73 | -2.74 | -0.26 |
| GO:0071466 | GO Biological Processes | Cellular response to xenobiotic stimulus | 5 | 3.97 | -2.62 | -0.2 |
| GO:0007127 | GO Biological Processes | Meiosis I | 4 | 3.17 | -2.36 | -0.02 |
| GO:0031329 | GO Biological Processes | Regulation of cellular catabolic process | 11 | 8.73 | -2.33 | -0.02 |
| R-HSA-5653656 | Reactome Gene Sets | Vesicle-mediated transport | 10 | 7.94 | -2.31 | -0.01 |
| M254 | Canonical Pathways | PID MYC REPRESS PATHWAY | 3 | 2.38 | -2.27 | 0 |
| GO:0043547 | GO Biological Processes | Positive regulation of GTPase activity | 7 | 5.56 | -2.09 | 0 |
| GO:0002274 | GO Biological Processes | Myeloid leukocyte activation | 9 | 7.14 | -2.06 | 0 |
Terms with p-value < 0.01, minimum count 3, and enrichment factor > 1.5 (enrichment factor is the ratio between observed count and the count expected by chance) are collected and grouped into clusters based on their membership similarities
Statistical test for DEG enrichment in eQTL genes
“Brain region” includes the following regions; TC temporal cortex, CBE cerebellum, PFC prefrontal cortex, VC visual cortex, EC entorhinal cortex, HIP hippocampus, MTG medial temporal gyrus, SFG superior frontal gyrus, PC posterior cingulate. “Samples” indicates the number of samples in each dataset. “Significant DEG Count” indicates the number of significantly differentially expressed genes in eQTL genes dataset. “FER” indicates the number of the observed genes to the expected value. “P-value” were calculated from hypergeometic distribution test
List of 19 SNPs that were located at protein-binding sites and that have eQTLgene(s)
“eQTL” indicates whether the closest gene is eQTL gene or not. “Enhancer Tissue Count” indicates the number of tissues or cell lines showed enhancer activity. “GWAS p-value” indicates p-value from GWAS Catalog database
Fig. 3AD SNPs and their eQTL genes co-localize in topologically associating domains (TADs). Heatmap showing the frequency of chromatin interactions based on tethered conformation capture (TCC) experiments in the astrocytoma cell line U-251MG (100-kb bins). Diagonal darker blocks indicate TAD. AD SNP could contact the distal eQTL genes via chromatin interactions
eQTL genes in TAD
Fig. 4AD SNPs with eQTL effects are often located at protein-binding sites. a, b Cumulative bar graph of the chromatin state across 127 tissues or cell types (upper panel) and ChIP-seq tracks (bottom panel) around rs1476679 (a) and rs7364180 (b). Three representative chromatin state groups of the cumulative bar graph are depicted according to the color legend. The chromatin state names are shown in parentheses (see details in Methods). Details of all 25 chromatin state names are described in Additional file 1: TableS3. Grey bars in ChIP-seq tracks represent peak clusters of transcription factor (TF) occupancy. The color intensity of the bars is proportional to the level of TF occupancy. Green bars represent motif sites for the corresponding TFs. These ChIP-seq tracks were generated from the UCSC genome browser (https://genome.ucsc.edu/). c A schematic representation of a chromatin loop based on CTCF binding
Fig. 5rs1476679 spatially contacts many eQTL genes via CTCF-mediated chromatin loops and affects their expression levels. a Chromatin interactions of the rs1476679 locus as determined by TCC experiments. Red and blue lines represent significant chromatin interactions from rs1476679 in SK-N-SH and U-251 MG cells, respectively. b Zoom-in region of the rs1476679 locus. The upper panel indicates chromatin interactions of the s1476679 locus. The orange and green bands indicate gene bodies on the positive and negative strands, respectively. Gene symbols in red indicate the eQTL genes of rs1476679. Asterisks indicate chromatin interactions of the rs1476679 locus with the GATS, PILRB, and PILRA genes. In the bottom panel, the color plot indicates the peak scores from ChIP-seq data for CTCF or RNA polymerase II (RNAPII) and DNase-seq data showing DNase I-hypersensitive sites (DHSs). Each row in the colored plot represents different brain tissues or neuronal cell lines (18 experiments (rows) including nine tissues or cell lines based on CTCF ChIP-seq, 21 experiments including 10 cell lines based on RNAPII ChIP-seq, and 82 experiments including 31 cell lines based on DNase-seq; Additional file 1: Table S11). c GATS, PILRB, and PILRA expression levels in the hippocampus and entorhinal cortex from GSE5281. Boxes represent the interquartile range between the first and third quartiles and median (internal line). Whiskers denote the lowest and highest values within 1.5 times the range of the 1 first and third quartiles, respectively; dotsrepresent GATS, PILRB, and PILRA expression levels in each sample