| Literature DB >> 24709820 |
Ai Ling Teh1, Hong Pan2, Li Chen1, Mei-Lyn Ong1, Shaillay Dogra1, Johnny Wong1, Julia L MacIsaac3, Sarah M Mah3, Lisa M McEwen3, Seang-Mei Saw4, Keith M Godfrey5, Yap-Seng Chong6, Kenneth Kwek7, Chee-Keong Kwoh8, Shu-E Soh9, Mary F F Chong6, Sheila Barton5, Neerja Karnani1, Clara Y Cheong1, Jan Paul Buschdorf1, Walter Stünkel1, Michael S Kobor3, Michael J Meaney10, Peter D Gluckman11, Joanna D Holbrook1.
Abstract
Integrating the genotype with epigenetic marks holds the promise of better understanding the biology that underlies the complex interactions of inherited and environmental components that define the developmental origins of a range of disorders. The quality of the in utero environment significantly influences health over the lifecourse. Epigenetics, and in particular DNA methylation marks, have been postulated as a mechanism for the enduring effects of the prenatal environment. Accordingly, neonate methylomes contain molecular memory of the individual in utero experience. However, interindividual variation in methylation can also be a consequence of DNA sequence polymorphisms that result in methylation quantitative trait loci (methQTLs) and, potentially, the interaction between fixed genetic variation and environmental influences. We surveyed the genotypes and DNA methylomes of 237 neonates and found 1423 punctuate regions of the methylome that were highly variable across individuals, termed variably methylated regions (VMRs), against a backdrop of homogeneity. MethQTLs were readily detected in neonatal methylomes, and genotype alone best explained ∼25% of the VMRs. We found that the best explanation for 75% of VMRs was the interaction of genotype with different in utero environments, including maternal smoking, maternal depression, maternal BMI, infant birth weight, gestational age, and birth order. Our study sheds new light on the complex relationship between biological inheritance as represented by genotype and individual prenatal experience and suggests the importance of considering both fixed genetic variation and environmental factors in interpreting epigenetic variation.Entities:
Mesh:
Year: 2014 PMID: 24709820 PMCID: PMC4079963 DOI: 10.1101/gr.171439.113
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Ethnicity, sex, and in utero environmental exposures of subjects
Figure 1.(A) Unfiltered genotypes organize subjects by ethnicity. Principal component 1 (x-axis) plotted against principal component 2 (y-axis) from principal component analysis of genotypes for all 708,365 heterologous SNPs across 237 subjects. Subjects are colored by self-reported ethnicity. (B) Unfiltered methylomes do not organize subjects by ethnicity. Principal component 1 (x-axis) plotted against principal component 2 (y-axis) for principal component analysis of methylation levels at all 301,468 variable CpGs across 237 subjects. Subjects are colored by self-reported ethnicity.
Figure 2.(A) The strength of association between genotype and methylation levels is a continuum, with most VMR-CpGs showing some association with genotype. Scatter plot of Pearson R2 (x-axis) for the VMR-CpG and best SNP match against log2 of the number of regression P-values in the first bin of 1000 equally distributed bins (y-axis); the red line represents an absolutely even 708 regression P-value in the first bin. (Black) Disrupting pairs, (dark gray) cis pairs, (light gray) trans pairs. (B) Some VMR-CpGs are minimally influenced by genotype. Manhattan plot of methylation at one VMR-CpG against all SNPs (x-axis) with −log10 P-value on the y-axis, as an example of VMR-CpG with low R2 and a low number of P-values in the first bin. (C) Scatter plot of genotype (x-axis) against methylation (y-axis) for the top pair from the same VMR-CpGs as in B. (D) Some VMR-CpGs are moderately influenced by genotype. Manhattan plot of methylation at one VMR-CpG against all SNPs (x-axis) with −log10 P-value on the y-axis, as an example of VMR-CpG with moderate R2 and a moderate number in the first bin. (E) Scatter plot of genotype (x-axis) against methylation (y-axis) for the top pair from the same VMR-CpGs as in D. (F) Manhattan plot of CpG against all SNPs (x-axis) with −log10 of the P-value (y-axis), as an example of VMR-CpG with high R2 and a high number in the first bin. (G) Scatter plot of genotype (x-axis) against methylation (y-axis) for the top pair from the same CpG as in F.
Figure 3.The strength of association between genotype and methylation is strongest for disrupting pairs and weakest for trans pairs. Box plot of –log10 of the P-value of the association between genotype and methylation levels at each VMR-CpG, for CpG–SNP pair categories disrupting (SNP is within CpG), cis (SNP is on same chromosome as CpG), and trans (SNP is on a different chromosome to the CpG).
Figure 4.Cis pairs tend toward short distances between the SNP and CpG. Bar chart of –log10 of the P-value (y-axis) against the chromosomal distance between the SNP and CpG (x-axis) for cis pairs within 5 kb.
Figure 5.(A) The majority of VMR-CpGs are best explained by G × E models. Pie chart showing the proportions of 1423 VMR-CpGs, which are best explained by the genotype (G), environment, or interaction between gene and environment (G × E) regression models. (B) Genotype tends to be a narrow winner. Stacked histogram of deltas between delta AICs for best and next-best model across 1423 VMR-CpGs. Each box is colored to denote the model that best explained methylation levels at the VMR-CpG. (C) The models explain the range of variation at VMR-CpGs. Stacked histograms of adjusted R2 of the winning model across all 1423 VMR-CpGs. Each box is colored by the winning model. (D) The proportion of VMR-CpGs explained by G × E is stable as model confidence increases. Pie chart showing the proportions of 210 VMR-CpGs that were best explained by the genotype, environment, or G × E regression models with no substantial support for the next-best model (Δ > 2) and adjusted R2 > 0.4.
Figure 6.Association of methylation with environment in one genotypic group. Examples of VMR-CpGs whose methylation levels are significantly associated with phenotype in only one genotypic group. Phenotypic values are shown on the x-axis, and methylation value in percentages on the y-axis. (Left) Data for all samples. Samples are colored by their genotypic group (red indicates AA; blue, AB; black, BB), and a straight fit line is fitted for each group. (Right) Genotypic subgroup with the highest R2.
Figure 7.VMR-CpGs with larger ranges of methylation values are more likely to be MethQTLs. Scatter plot showing the range of methylation values at each CpG across samples (x-axis) compared with the strength of association between the VMR-CpG methylation values and the genotype of the best SNP (y-axis).