| Literature DB >> 30918249 |
Yanni Zeng1, Carmen Amador1, Charley Xia1,2, Riccardo Marioni3,4, Duncan Sproul1,5, Rosie M Walker3,4, Stewart W Morris4, Andrew Bretherick1, Oriol Canela-Xandri1,2, Thibaud S Boutin1, David W Clark6, Archie Campbell4, Konrad Rawlik2, Caroline Hayward1, Reka Nagy1, Albert Tenesa1,2, David J Porteous3,4, James F Wilson1,6, Ian J Deary3,7, Kathryn L Evans3,4, Andrew M McIntosh3,8, Pau Navarro1, Chris S Haley9,10.
Abstract
Parent-of-origin effects (POE) exist when there is differential expression of alleles inherited from the two parents. A genome-wide scan for POE on DNA methylation at 639,238 CpGs in 5,101 individuals identifies 733 independent methylation CpGs potentially influenced by POE at a false discovery rate ≤ 0.05 of which 331 had not previously been identified. Cis and trans methylation quantitative trait loci (mQTL) regulate methylation variation through POE at 54% (399/733) of the identified POE-influenced CpGs. The combined results provide strong evidence for previously unidentified POE-influenced CpGs at 171 independent loci. Methylation variation at 14 of the POE-influenced CpGs is associated with multiple metabolic traits. A phenome-wide association analysis using the POE mQTL SNPs identifies a previously unidentified imprinted locus associated with waist circumference. These results provide a high resolution population-level map for POE on DNA methylation sites, their local and distant regulators and potential consequences for complex traits.Entities:
Mesh:
Year: 2019 PMID: 30918249 PMCID: PMC6437195 DOI: 10.1038/s41467-019-09301-y
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Study design
| ANALYSIS | AIM | MODEL |
|
|
|---|---|---|---|---|
| GKFSC VC | Understand sources of variation of methylation at CpG sites | CpG ~ G + K + F + S + C | 639,238 (CpGs) | G: 24,101 |
| POE-targeted VC | Find POE-influenced CpGs | Base: CpG ~ G + K | 639,238 (CpGs) | Complex: 606 |
| POE–mQTL | (a) Find POE-influenced CpGs | CpG ~ SNPADD + SNPDOM + SNPPOE | 7e9 (984 CpGs*7e6 SNP) | CpGs: 586 |
| POE–EWAS | Phenotypic consequence of POE-influenced CpGs | Trait~CpG | 26,568 (984 CpGs*27 independent traits) | CpGs: 14 |
| POE–PheWAS | Phenotypic consequence of POE–mQTL SNPs | Trait~SNPADD + SNPDOM + SNPPOE | 51,165 (1895 independent mQTLs*27 independent traits) | Traits: 1 |
The table shows an overview of the analyses performed (ANALYSIS), describing their aims (AIM) and the models used (MODEL), as well as the number of tests performed (Ntests) and the number of significant results obtained (N)
GKFSC VC variance component analyses to partition methylation level variation into its additive genetic (G: SNP associated, K: pedigree associated) and non-additive/environmental (F: family, S: sibling, C: couple) components, SNP single-nucleotide polymorphism
POE-targeted VC modified variance component analysis detects candidate methylation sites with parent-of-origin inheritance pattern (parent-of-origin effect, POE). Base: model without POE; complex: model including a complex POE component allowing for increased similarity between siblings; maternal: model including a POE component (SM) allowing for increased similarity between father and offspring and siblings; paternal: model including a POE component (Sp) allowing for increased similarity between mother and offspring and siblings;
POE–mQTL parent-of-origin effect methylation quantitative trait loci analyses, ADD Additive effect, DOM dominance effect,
POE–EWAS, complex trait association with methylation levels of POE CpGs,
POE–PheWAS, phenotype-wide association study accounting for parent-of-origin effects for parent-of-origin methylation level associated loci (POE–mQTL)
DNA methylation variation decomposed into genetic and environmental components
| Source | Mean PV | Maximum PV | First quartile PV | Third quartile PV | Nominal Sig. sites | Genome-wide Sig. sites |
|---|---|---|---|---|---|---|
|
| 9.5% | 99.20% | 0.67% | 12.98% | 162,800 | 24,101 |
|
| 7.2% | 97.10% | 0.00% | 11.05% | 59,117 | 1531 |
|
| 1.2% | 19.10% | 0.00% | 1.84% | 1946 | 0 |
|
| 1.4% | 46.30% | 0.00% | 2.24% | 23,600 | 78 |
|
| 2.1% | 33.50% | 0.00% | 3.18% | 14,514 | 0 |
Proportion of variation in methylation levels at the 639,238 studied CpG sites explained (PV) by (common SNP-associated additive genetic component), (pedigree-associated additive genetic component), (shared environmental effects between nuclear family members), (non-additive genetic or shared environmental effects between full siblings) and (shared environmental effects between members of a couple). The number of CpG sites that were significant in the component of interest, both at nominal and genome-wide level (Sig. sites at nominal and genome-wide levels) is also shown for each of the five components fitted
Fig. 1The expected phenotypic covariance structures between nuclear family members introduced by different POE patterns. The bar charts show putative levels of methylation associated with the four possible genotypes at a SNP controlling imprinting (paternal allele in blue, maternal allele in red). The family pedigrees show as shaded the family members between which similarity in methylation is increased due to these patterns of imprinting
Fig. 2Example of a novel CpG site (cg05875302) influenced by maternal imprinting POE. Upper panel: bars represent estimated variance explained by each component in the selected model for the site displaying significant maternal imprinting (cg05875302, red arrow) and the sites within 20 kb on either side of the selected site. Bottom left panel: regional plot of –loge (p-value from LRT) of the POE in the selected (red ringed black dot) and surrounding CpG sites with matrix of pairwise correlations of methylation level between these sites in the heatmap below. Bottom right panel: pairwise correlation between methylation M values (corrected for technical and biological covariates) between different pairs of nuclear family members
Fig. 3Genomic annotations significantly enriched in (red) or depleted of (blue) POE-influenced methylation CpGs. Error bars: 95% confidence interval
Classification of the identified 984 CpGs potentially influenced by POEs
| Strength of Evidence | POE–mQTL | Other studies | NCpG | VC POE model | NCpG per POE model |
|---|---|---|---|---|---|
| Strong |
| Replication (<2 kb) | 223 | Maternal/paternal | 104 |
| Strong |
| Overlap (2 kb−2 Mb) | 172 | Maternal/paternal | 79 |
| Strong | √ | Not identified (>2 Mb) | 191 | Maternal/paternal | 70 |
| Total strong | 586 | 586 | |||
| Moderate |
| Replication (<2 kb) | 11 | Maternal/paternal | 2 |
| Moderate |
| Overlap (2 kb–2 Mb) | 190 | Maternal/paternal | 71 |
| Moderate |
| Not identified (>2 Mb) | 197 | Maternal/paternal | 52 |
| Total moderate | 398 | 398 | |||
| Total | 984 | 984 |
The table classifies the 984 candidate CpG sites identified with the targeted POE variance component (VC) analysis into groups representing the support for the detected POE (strength of evidence) based on having or not an identified POE–mQTL (POE–mQTL), and if their position overlaps with previously published studies (other studies: replication: the CpG is in a region within 2 kb of a known imprinted region; overlap: the CpG is in a region between 2 kb and 2 Mb of a known imprinted region; not identified: the CpG is more than 2 Mb away from a known imprinted region). NCpGs is the number of CpG sites in each category. VC POE model indicates the selected VC model (maternal/paternal or complex imprinting) and NCpGs per POE model is the number of CpG sites in each subgroup
Fig. 4Methylation CpGs regulated by SNP rs231356. SNP rs231356 acts both as a cis-POE–mQTL and a trans-POE–mQTL. Red arrows: location of the CpG in the chromosome. Boxplots show the allelic effects of rs231356 on methylation of three CpG sites (cg09518720, in cis, and cg05884032 and cg23776532, in trans). Boxplots: centre line, median; box limits, upper and lower quartiles; whiskers, 1.5× interquartile range; points, outliers
Associations between POE CpGs and traits significant at the multi-trait level
| CpG | Evidence | Chr | Position (bp) | VC | Genic region | Gene name | Trait* | Est | SE | |
|---|---|---|---|---|---|---|---|---|---|---|
| cg11078090 | Strong | 1 | 23878540 | C | Upstream; downstream | E2F2; ID3; LOC101928163 | BMI | 2.22 × 10−7 | 0.033 | 0.006 |
| WC | 5.49 × 10−7 | 0.030 | 0.006 | |||||||
| cg08259905 | Strong | 3 | 62171428 | P | Intronic | PTPRG | Weight | 1.85 × 10−6 | −0.027 | 0.006 |
| cg00329615 | Strong | 3 | 118706648 | C | Intronic | IGSF11 | SBP | 2.48 × 10−7 | 0.024 | 0.005 |
| cg10755899 | Strong | 4 | 1772151 | C | Upstream; downstream | FGFR3; TACC3 | HDL | 1.75 × 10−8 | −0.029 | 0.005 |
| BMI | 2.46 × 10−8 | 0.020 | 0.004 | |||||||
| %Fat | 4.18 × 10−8 | 0.985 | 0.179 | |||||||
| cg01290904 | Moderate | 4 | 5708474 | P | Intronic | EVC2 | HDL | 2.36 × 10−7 | −0.046 | 0.009 |
| cg11064966 | Strong | 5 | 32506514 | C | Intergenic | None | Weight | 5.11 × 10−7 | 0.056 | 0.011 |
| cg12577411 | Strong | 6 | 15551489 | P | Intronic | DTNBP1 | %Fat | 9.60 × 10−9 | −2.27 | 0.394 |
| BMI | 1.03 × 10−8 | −0.045 | 0.008 | |||||||
| WC | 3.47 × 10−7 | −0.038 | 0.007 | |||||||
| Weight | 3.50 × 10−7 | −0.044 | 0.009 | |||||||
| cg15773890 | Strong | 6 | 17259549 | P | Upstream | RBM24 | Alcohol | 1.63 × 10−7 | −0.315 | 0.060 |
| cg05246100 | Strong | 7 | 55246275 | C | Intronic | EGFR | BMI | 1.02 × 10−6 | −0.037 | 0.008 |
| cg11613559 | Strong | 10 | 121577971 | C | Intronic | INPP5F | Alcohol | 2.96 × 10−7 | −0.132 | 0.026 |
| cg14391737 | Moderate | 11 | 86513429 | C | Intronic | PRSS23 | Hips | 1.85 × 10−7 | 0.014 | 0.003 |
| Weight | 6.86 × 10−7 | 0.025 | 0.005 | |||||||
| BMI | 8.50 × 10−7 | 0.023 | 0.005 | |||||||
| cg27272202 | Moderate | 12 | 5158794 | P | Downstream | KCNA5 | CREAT | 2.49 × 10−7 | 0.049 | 0.009 |
| cg08698721 | Strong | 14 | 101294147 | P | ncRNA intronic | MEG3 | Height | 1.89 × 10−7 | −1.08 | 0.206 |
| cg21740139 | Moderate | 17 | 60753158 | P | Exonic | MRC2 | CREAT | 1.83 × 10−6 | 0.045 | 0.009 |
The table shows the CpGs displaying POE (CpG), their location (Chr: chromosome and position in bp, location relative to the nearest gene (genic region) and name of the nearest gene(s) (gene name)), the pattern of imprinting detected in the variance component analysis (VC, C: complex, P: paternal), the strength of the evidence supporting the inference of POE (evidence) and the estimated correlation between methylation level and traits (Est, Trait), together with standard errors (SE) and an indication of significance (P-value of t test). *Further details on traits are given in Supplementary Table 4
Significant POE from cis-POE–mQTL rs6100212 on phenotypes and CpGs
| Type | Trait*/CpG | Est | SE | |
|---|---|---|---|---|
| Trait | WC | 8.57 × 10−7 | −0.005 | 0.001 |
| Trait | WHR | 1.22 × 10−5 | −0.004 | 0.001 |
| Trait | BMI | 1.42 × 10−5 | −0.005 | 0.001 |
| Trait | % Fat | 2.09 × 10−5 | −0.212 | 0.050 |
| Methylation | cg03837903 | 5.04 × 10−6 | 0.028 | 0.006 |
| Methylation | cg04677683 | 2.34 × 10−14 | −0.045 | 0.006 |
| Methylation | cg06200857 | 1.96 × 10−4 | −0.017 | 0.005 |
| Methylation | cg08091561 | 2.57 × 10−4 | −0.020 | 0.005 |
| Methylation | cg09437522 | 5.56 × 10−7 | −0.018 | 0.004 |
| Methylation | cg11480267 | 6.06 × 10−6 | 0.029 | 0.006 |
| Methylation | cg15160445 | 4.60 × 10−16 | −0.047 | 0.006 |
| Methylation | cg23249369 | 1.37 × 10−10 | −0.027 | 0.004 |
| Methylation | cg24203465 | 8.53 × 10−10 | −0.018 | 0.003 |
| Methylation | cg24617313 | 1.12 × 10−16 | −0.159 | 0.019 |
| Methylation | cg25326570 | 2.12 × 10−12 | −0.042 | 0.006 |
| Methylation | cg26102503 | 2.51 × 10−24 | −0.045 | 0.004 |
*Further details on traits are given in Supplementary Table 4
Fig. 5CpGs and complex traits regulated by SNP rs6100212. Upper: this SNP was located upstream of gene PIEZO1P2 and overlapped with H3K27ac and CTCF signals. The SNP was also significant for imbalanced methylation (GIT), but not significant in allelic-specific methylation (ASM) or mQTL (classical additive model) analysis as reported by an independent study. Left bottom: the SNP acted as a cis-POE–mQTL for methylation sites causing a complex imprinting pattern (cg26102503 as an example). Middle and right bottom: the SNP was also shown a regulatory role in waist (Phenome-wide significance), BMI, body fat and WHR (per-trait significance), introducing a similar complex imprinting pattern. Boxplots: centre line, median; box limits, upper and lower quartiles; whiskers, 1.5× interquartile range; points, outliers