| Literature DB >> 28283040 |
Warren A Cheung1,2, Xiaojian Shao1,2, Andréanne Morin1,2, Valérie Siroux3, Tony Kwan1,2, Bing Ge1,2, Dylan Aïssi3, Lu Chen4,5, Louella Vasquez4, Fiona Allum1,2, Frédéric Guénard6, Emmanuelle Bouzigon7, Marie-Michelle Simon2, Elodie Boulier2, Adriana Redensek2, Stephen Watt4, Avik Datta8, Laura Clarke8, Paul Flicek8, Daniel Mead4, Dirk S Paul9,10, Stephan Beck9, Guillaume Bourque1,2, Mark Lathrop1,2, André Tchernof11, Marie-Claude Vohl6, Florence Demenais7, Isabelle Pin3,12, Kate Downes5,13, Hendrick G Stunnenberg14, Nicole Soranzo4,5,15,16, Tomi Pastinen17,18, Elin Grundberg19,20.
Abstract
BACKGROUND: The functional impact of genetic variation has been extensively surveyed, revealing that genetic changes correlated to phenotypes lie mostly in non-coding genomic regions. Studies have linked allele-specific genetic changes to gene expression, DNA methylation, and histone marks but these investigations have only been carried out in a limited set of samples.Entities:
Mesh:
Year: 2017 PMID: 28283040 PMCID: PMC5346261 DOI: 10.1186/s13059-017-1173-7
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Samples having multiple layer of epigenetics profiles were used in this project. DNA methylation profiles were assessed for all samples using either whole genome bisulfite sequencing (WGBS; green) or targeted bisulfite sequencing (MCC-Seq; red). Listed in this figure are the number of samples used for analyses focusing on: a methylation sequencing alone (Methyl-Seq), b methylation with matched RNA-Seq from the same sample, or c methylation sequencing with matched ChIP-Seq (using six different histone marks) and matched RNA-Seq from the same sample
Summary of targeted bisulfite-sequencing methylome capture panel design
| Total CpGs | Total regions | Total size (bp) | |
|---|---|---|---|
| Regulatory regions in immune cells (DNaseI hypersensitive/active chromatin) | 1,837,099 | 315,043 | 48,810,975 |
| Hypomethylated footprints from immune cells (MethylSeekR) | 3,539,071 | 477,795 | 86,414,084 |
| Illumina 450 K methylation assay included regions | 1,934,175 | 328,405 | 40,853,151 |
| Autoimmune SNPs from GWAS catalog | 14,572 | 7273 | 678,026 |
| Unique (non-overlapping content) in custom MCC-Seq capture panel | 4,609,564 | 822,884 | 119,089,296 |
Fig. 2a CpGs showing significant (q < 0.01) imbalanced methylation, significant (q < 0.1) allelic methylation, and significant (q < 0.1) non-allelic methylation. Percentages indicate the proportion of the total significant CpGs found in any of these three sets (n = 962,557 of the 2,233,846 CpGs tested in all three tests). b Counts of CpGs showing allele-specific (AS)-genetic (ASM q < 0.1), non-allele-specific (NAS)-genetic (ASM q ≥ 0.1 and mQTL < 0.1), epigenetic (GIT q < 0.01, ASM and mQTL ≥ 0.1), and no (GIT q > 0.01, ASM and mQTL q ≥ 0.1) associations. Frequencies are plotted for all the CpGs, and also for CpGs in each of the ChromHMM regions. TSS transcription start site. c Fold change of the rate of putatively epigenetic (GIT q < 0.01, ASM and mQTL q ≥ 0.1) versus genetic (ASM or mQTL q < 0.1). ***Ratios with Fisher p < 0.0001; all other p values were >0.05. d Distribution of allelic ratios at significant GIT and ASM CpGs, for H3K27ac and H3K4me1 in normal T cells
Summary of CpG phasing SNP overlaps between ASM, GIT, and mQTL
| Test A | Significant (test A) | Test B | Significant (test B) | Significant (test A and B) | Total (test A and B) |
|---|---|---|---|---|---|
| ASM | 108,515 | mQTL | 164,671 | 70,700 | 2,319,084 |
| GIT | 818,954 | ASM | 158,444 | 146,012 | 4,462,724 |
| GIT | 603,033 | mQTL | 164,671 | 102,663 | 2,319,084 |
Tissue-specific number of CpGs tested and number of significant CpGs for the mQTL, ASM, and GIT analyses
| Cell type | Naïve T cells | Visceral adipose tissue | Whole blood | Common |
|---|---|---|---|---|
| All mQTLs tested | 3,140,791 | 1,959,622 | 4,261,030 | 1,602,686 |
| Significant mQTLs ( | 501,606 | 170,155 | 769,853 | 555,423 |
| All ASM tested | 3,109,121 | 1,713,482 | 1,636,884 | 1,079,807 |
| Significant ASM ( | 60,559 | 38,827 | 81,126 | 7,682 |
| All GIT tested | 2,944,290 | 1,714,250 | 1,076,251 | 622,721 |
| Significant GIT ( | 278,516 | 301,517 | 486,201 | 88,599 |
Fig. 3The proportion and number of sites of cell type-specific methylation in adipose tissue, naïve T cells (nTC), and whole blood (WB). The red segments at the top show the proportion of CpGs that are specific to the specific tissue, and the purple segments at the bottom show the proportion of CpGs that were found in all three cell types. The yellow, green and blue bars show CpGs that are shared between the specific cell type and adipose tissue, naïve T cells, and whole blood, respectively. Shown is the breakdown in the three cell types for a significant mQTL CpGs (q < 0.1), b significant ASM CpGs (q < 0.1), c putative epigenetic (filtered for ASM and mQTL q > 0.1) GIT CpGs (GIT q < 0.01). d Enrichment GWAS SNPs associated with significant ASM in three different tissues—naïve T cells (TC, blue), whole blood (WB, green) and visceral adipose tissue (VAT, red). We show enrichment for disease-associated loci from eight different traits (celiac disease, Crohn’s disease (CD), inflammatory bowel disease (IBD), ulcerative colitis (UC), multiple sclerosis (MS), rheumatoid arthritis (RA), type 1 diabetes (T1D), and type 2 diabetes (T2D)) and SNPs associated with ASM
Fig. 4Density of allelic and non-allelic methylation versus gene expression correlation (R) and cumulative distribution of the absolute allelic and non-allelic correlation (|R|) for each dataset. a Density plot of significant (p < 0.05) correlations detected among sites (NCpG = 241,687, Ntests = 441,931) tested for allelic and non-allelic correlation for CpGs measured via MCC-Seq. b Density plot of significant correlations for sites (NCpG = 40,315, Ntests = 58,106) with methylation estimated by WGBS. c Empirical cumulative density function (ECDF) plot of the absolute correlation for sites evaluated using MCC-Seq. d ECDF for sites evaluated by WGBS. e Smoothed color density scatter plot of sites comparing significant non-allelic (x-axis) and allelic (y-axis) correlation (p < 0.05 for both correlation tests) for MCC-Seq. Red indicates high density, blue indicates low density, and white indicates no data. f Smoothed color density scatter plot of significant non-allelic versus allelic correlations for WGBS
Fig. 5a Fold change difference in fraction of CpG regions (three or more consecutive significantly allelically differentially methylated CpGs) in ChromHMM-assigned state versus the fraction of single significantly allelically differentially methylated CpGs in the same ChromHMM state. The x-axis lists the eight ChromHMM states and each colored bar shows a different cell type/methylation interrogation technology. b Proportion of correlations tested in each ChromHMM state with CpG methylation sequenced by WGBS and MCC-Seq. c Fold-enrichment of positively correlated CpGs evaluated by MCC-Seq in each ChromHMM state. d Fold-enrichment of positive WGBS correlations. e Fold-enrichment of negatively correlated MCC-Seq CpGs per state. f Fold-enrichment of negatively correlated WGBS CpGs per state
Fig. 6a Proportion of sites showing differential allelic methylation, histone occupancy, and gene expression. All four combinations of high and low methylation rate, histone occupancy rate, and gene expression are compared, described from the perspective of the high methylation allele, and whether this allele is the one with high or low histone occupancy, and high or low gene expression (note that high methylation allele with low histone occupancy also refers to the low methylation allele with high histone occupancy). For the histone marks b H3K27ac, c H3K4me1, d H3K36me3, and e H3K27me3, we show the proportions of all the differential allelic methylation, histone occupancy, and gene expression tested (blue), and proportions of the tested sites that passed significance p < 0.05 (red). f The percentage of allelic differentially modified histone sites as identified by GSCI that have an ASM ratio at the SNP-associated CpG in the top 1% versus the percentage of histone sites not having an allelic effect detected by GSCI but having an ASM ratio at the SNP-associated CpG in the top 1%
Fig. 7a The distribution of the mean methylation of CpGs by ChromHMM state, for CpGs in ChromHMM states where we see high correlation between gene expression and H3K27ac (left) or H3K4me1 (right) histone peaks. Bar graphs show the log2 mean allelic ratio of the methylation on the high gene expression allele versus the low gene expression allele (green; indicating p < 0.05) b Positional distribution of CpGs by distance from the center of the ChromHMM state, for all the ChromHMM states having at least 100 CpGs c Summary of log2 mean allelic methylation ratio of CpGs at each distance from the center of the ChromHMM state bin, for the high gene expression allele versus low gene expression allele