| Literature DB >> 29202839 |
Xinyi Lin1,2, Ai Ling Teh1, Li Chen1, Ives Yubin Lim1, Pei Fang Tan1, Julia L MacIsaac3, Alexander M Morin3, Fabian Yap4, Kok Hian Tan4, Seang Mei Saw2,5,6, Yung Seng Lee1,7,8, Joanna D Holbrook1,9, Keith M Godfrey10, Michael J Meaney1,11, Michael S Kobor3, Yap Seng Chong1,12, Peter D Gluckman1,13, Neerja Karnani14,15.
Abstract
BACKGROUND: Epigenomes are tissue specific and thus the choice of surrogate tissue can play a critical role in interpreting neonatal epigenome-wide association studies (EWAS) and in their extrapolation to target tissue. To develop a better understanding of the link between tissue specificity and neonatal EWAS, and the contributions of genotype and prenatal factors, we compared genome-wide DNA methylation of cord tissue and cord blood, two of the most accessible surrogate tissues at birth.Entities:
Keywords: DNA methylation; Epigenome-wide association study; Genotype; Neonate; Prenatal factors; Tissue-specificity
Mesh:
Substances:
Year: 2017 PMID: 29202839 PMCID: PMC5715509 DOI: 10.1186/s12916-017-0970-x
Source DB: PubMed Journal: BMC Med ISSN: 1741-7015 Impact factor: 8.775
Fig. 1Infant cord tissue showed more inter-individual variation than infant cord blood: proportion of CpGs that showed inter-individual variation and interquartile range (IQR) in DNA methylation. a Pie chart shows the proportion of CpGs for four distinct categories: (1) CpGs which showed inter-individual variation in both tissues, (2) CpGs which showed inter-individual variation only in infant cord blood, (3) CpGs which showed inter-individual variation only in infant cord tissue, and (4) CpGs which did not show inter-individual variation in either tissue. A total of 239,560 CpGs passed quality control in both datasets. b Plot of proportion of CpGs (vertical axis) in each tissue (out of 239,560 CpGs) with DNA methylation IQR greater than or equal to the value specified on the horizontal axis. c Boxplots show the distribution of the DNA methylation IQR, for CpGs in infant cord tissue (bright orange) and infant cord blood (bright blue), respectively, for each of the four categories. Outliers are not shown in the boxplots. A CpG was defined to show inter-individual variation if the DNA methylation range (maximum–minimum, excluding outliers) was greater than 10% and DNA methylation 99th percentile–1st percentile was greater than 5%
Fig. 2Infant cord tissue is a better surrogate for primary tissues of mesenchymal stem cell (MSC)-derived mesodermic germinal origins, while infant cord blood is a better surrogate for primary tissues of hematopoietic stem cell (HSC)-derived mesodermic germinal origins: hierarchical clustering of GUSTO tissues (cord tissue, cord blood) with 25 primary tissues/cells profiled using reduced representation bisulfite sequencing in the Epigenome Roadmap project. Infant cord tissue clustered more closely with primary tissues of MSC-derived mesodermic germinal origins, while infant cord blood clustered more closely with primary tissues of HSC-derived mesodermic germinal origins. Left panel shows heatmap of DNA methylation values, with each row representing each tissue type and each column representing each CpG. Color changes from yellow to blue as DNA methylation changes from 0% to 100%. Right panel of plot shows dendrogram, with tissue types of ectodermic, endodermic, HSC-derived mesodermic, and MSC-derived mesodermic germinal origins represented in light pink, light purple, light turquoise, and light orange, respectively; GUSTO cord tissue and cord blood are represented in bright orange and bright blue, respectively. DNA methylation values from GUSTO tissues were generated using Infinium 450 K array (for each CpG and tissue type, the median value across all samples was used). For tissues/cells profiled by the Epigenome Roadmap project, only DNA methylation sites with a minimum reads coverage of 30X were retained and reads from both strands were combined. Hierarchical clustering was performed using only CpG sites that passed quality control filtering in GUSTO tissues, were non-missing in at least 10 out of the 25 Epigenome Roadmap samples, and had interquartile range greater than 10% across different Epigenome Roadmap tissues/cells
Fig. 3SNPs explained a greater proportion of inter-individual variation in DNA methylation in infant cord blood (CB) than in infant cord tissue (CT): SNP-associated CpGs detected in each infant tissue. a Pie charts show the percentage of CpGs in each infant tissue whose inter-individual variation could be explained by SNPs (out of all CpGs which showed inter-individual variation in the infant tissue). A CpG whose inter-individual variation could be explained by SNPs (SNP-associated) was defined to be one where the most significant association between the CpG and cis-SNPs (all SNPs on the same chromosome as CpG) attained a P value < 5 × 10–8, the commonly used Bonferroni threshold for genome-wide association studies (corresponding to testing for 106 independent SNPs across the genome at a family-wise Type 1 error rate of 0.05). b Overlap between SNP-associated, non-SNP-associated (but variable), and non-variable CpGs in the two tissues. Only CpGs which showed inter-individual variation in at least one tissue were included (N = 98,124). Examining each tissue separately, each of these 98,124 CpGs can either be SNP-associated, not SNP-associated, or not variable in each tissue. The number of CpGs in each of these three sets in each tissue is shown in the bottom left bar chart (for each tissue the number of CpGs from the three sets will sum to 98,124). Collectively, the 98,124 CpGs can be grouped into eight categories. The bottom right panel identifies each of these eight categories, with the solid black dots representing the sets being considered. For example, the extreme right column identifies the group of CpGs that are SNP-associated in both tissues. The top bar chart shows the number of CpGs in each of these eight categories. For example, 7822 CpGs were SNP-associated in both tissues
Fig. 4Prenatal factors (PFs) explained a similar proportion of inter-individual variation in infant cord blood (CB) and infant cord tissue (CT): CpGs where the inter-individual variation in DNA methylation were explained by PFs. a Heatmap shows the pairwise Spearman correlation (absolute value) between 45 PFs. Each row/column represents each PF. Color changes from white to blue as correlation changes from zero to one. b Pie charts show the percentage of CpGs in each infant tissue whose inter-individual variation could be explained by PFs (out of all CpGs, which showed inter-individual variation in the infant tissue). A CpG whose inter-individual variation could be explained by PFs was defined to be one where the most significant association between the CpG and all 45 PFs attained a P value < 1 × 10–3, the Bonferroni threshold for testing 45 PFs at a family-wise Type 1 error rate of 0.05. c Overlap between PF-associated, non-PF-associated (but variable), and non-variable CpGs in the two tissues. Only CpGs which showed inter-individual variation in at least one tissue were included (N = 98,124). Examining each tissue separately, each of these 98,124 CpGs can either be PF-associated, non-PF-associated, or not variable in each tissue. The number of CpGs in each of these three sets in each tissue is shown in the bottom left bar chart. Collectively, the 98,124 CpGs can be grouped into eight categories. The bottom right panel identifies each of these eight categories, with the solid black dots representing the sets being considered. The top bar chart shows the number of CpGs in each of these eight categories