| Literature DB >> 26936820 |
Amy Rowlatt1, Gustavo Hernández-Suárez2, María Carolina Sanabria-Salas3,4, Martha Serrano-López3,4, Konrad Rawlik1, Eva Hernandez-Illan5, Cristina Alenda6, Adela Castillejo5, Jose Luis Soto5, Chris S Haley1,7, Albert Tenesa8,7.
Abstract
DNA methylation (DNAm) has been linked to changes in chromatin structure, gene expression and disease. The DNAm level can be affected by genetic variation; although, how this differs by CpG dinucleotide density and genic location of the DNAm site is not well understood. Moreover, the effect of disease causing variants on the DNAm level in a tissue relevant to disease has yet to be fully elucidated. To this end, we investigated the phenotypic profiles, genetic effects and regional genomic heritability for 196080 DNAm sites in healthy colorectum tissue from 132 unrelated Colombian individuals. DNAm sites in regions of low-CpG density were more variable, on average more methylated and were more likely to be significantly heritable when compared with DNAm sites in regions of high-CpG density. DNAm sites located in intergenic regions had a higher mean DNAm level and were more likely to be heritable when compared with DNAm sites in the transcription start site (TSS) of a gene expressed in colon tissue. Within CpG-dense regions, the propensity of the DNAm level to be heritable was lower in the TSS of genes expressed in colon tissue than in the TSS of genes not expressed in colon tissue. In addition, regional genetic variation was associated with variation in local DNAm level no more frequently for DNAm sites within colorectal cancer risk regions than it was for DNAm sites outside such regions. Overall, DNAm sites located in different genomic contexts exhibited distinguishable profiles and may have a different biological function.Entities:
Mesh:
Year: 2016 PMID: 26936820 PMCID: PMC5181623 DOI: 10.1093/hmg/ddw072
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 6.150
The number of DNAm sites within six regions defined by physical distance from islands
| Genomic context with relation to CpG density | Island | North shore | South shore | North shelf | South shelf | Sea | Total |
|---|---|---|---|---|---|---|---|
| Number of DNAm sites | 74274 | 27405 | 21158 | 8323 | 7503 | 57417 | 196080 |
As described in the body of the manuscript, north and south shores encompass up to 2 KB upstream and downstream of islands respectively. Regions 2–4 KB upstream and downstream of islands were defined, respectively, as north and south shelves (27). Sea is any DNAm site not annotated as being located within an island, shelf or shore in the 450K manifest file.
Figure 1.Distribution of mean site-specific DNAm level with respect to CpG density. Methylation levels were measured on the M-value scale where a DNAm level of 0 can be interpreted as a 50% methylation level, a DNAm level of <0 and a DNAm level of >0 indicate < and >50% methylation, respectively. The majority of DNAm sites in islands exhibited a low-average methylation level, which was in contrast to the majority of DNAm sites in low-density CG regions (sea) being on average highly methylated.
Figure 2.Mean site-specific DNAm level as a function of distance from the edge of the island. The 4000 BP region upstream (North) and downstream (South) of islands was divided into bins of 100 BP. The average of the mean site-specific DNAm levels for DNAm sites residing within each bin is shown as a circle enclosed by a line indicating ±2 standard error of the mean estimate. A shore is up to 2000 BP from an island and a shelf is between 2000 and 4000BP from an island.
The number of DNAm sites within each of the eight contextual groups
| Genomic context | Island | Sea |
|---|---|---|
| TSS expressed | 13838 | 2074 |
| TSS not expressed | 19052 | 7603 |
| Intragenic | 31356 | 30106 |
| Intergenic | 10028 | 17634 |
| Total | 74274 | 57417 |
Figure 3.Distribution of mean site-specific DNAm level for eight contextual groups.
Figure 4.Moments of the distributions of mean site-specific DNAm level for eight contextual groups.
Figure 5.Distribution of the variance of each DNAm site used in analysis (n = 196080).
Figure 6.Distribution of the estimated heritability for DNAm sites significantly associated with local genetic variation. Each bar represents a range of 0.05.
Figure 7.The number of SNPs ±1 MB surrounding a DNAm site.
Figure 8.Distribution of mean-site specific heritability for DNAm level in genes expressed in whole colorectal biopsies and colon epithelial cells. DNAm sites have a significant heritability if P < 0.05. Genes expressed in both the epithelial and WCB were removed from the WCB group for this analysis.
Proportion of heritable DNAm sites and the corresponding mean heritability
| Island | Sea | |||
|---|---|---|---|---|
| Proportion heritable | Mean, | Proportion heritable | Mean, | |
| TSS expressed | 0.066 | 0.275 | 0.117 | 0.288 |
| TSS not expressed | 0.084 | 0.281 | 0.117 | 0.293 |
| Intragenic | 0.089 | 0.283 | 0.100 | 0.290 |
| Intergenic | 0.129 | 0.308 | 0.123 | 0.303 |
| Total | 0.089 | 0.286 | 0.110 | 0.295 |
Overall there was a higher proportion of heritable DNAm sites located in the sea compared with islands. Additionally, there was a higher proportion of heritable DNAm sites located in intergenic regions compared with regions containing a TSS and intragenic regions. The average heritability estimates were similar across the contextual groups.
Figure 9.The mean genetic variance and the proportion of heritable DNAm sites for the eight contextual groups. The x-axis value represents the proportion of heritable DNAm sites within each contextual group and the average genetic variance for each contextual group.
Significant associations from SNP by SNP GWAS for 196080 DNAm sites
| Threshold | ||||||
|---|---|---|---|---|---|---|
| Count | Count expected | Count DNAm sites | Count | Count expected | Count DNAm sites | |
| 5 × 10−2 | 6 027 276 | 5 572 851 | 195 792 | |||
| 5 × 10−4 | 184 978 | 55 729 | 55 532 | |||
| 5 × 10−8 | 17 712 | 5.57 | 4050 | 1 519 534 | 6690 | 72544 |
| 5 × 10−12 | 9,495 | 5.57 × 10−4 | 1412 | 236 577 | 6.69 × 10−1 | 16474 |
| 5 × 10−20 | 1037 | 5.57 × 10−12 | 343 | 23 072 | 6.69 × 10−9 | 1786 |
| 5 × 10−40 | 18 | 5.57 × 10−32 | 8 | 706 | 6.69 × 10−29 | 47 |
| Max | 0.884 | 0.8717 | ||||
| Min | 1.40 × 10−62 | 1.96 × 10−58 | ||||
| Total SNPs Tested | 111558037 | 1.34 × 1011 | ||||
The total number of significant associations (count) and number of expected significant associations based on the specified threshold and the total number of SNPs tested (count expected) is reported. The number of DNAm sites with at least one significant association is given (count DNAm sites). SNPs were binned with respect to distance from the DNAm site (cis ≤ ±1 MB and trans > ±1 MB).
Figure 10.Proportion of the regional heritability that can be explained by the top SNP association. The variance explained by a SNP (R2) divided by the regional heritability estimate for DNAm sites with a regional heritability estimate significant at P < 0.05. Only the most significant SNP within the local region was considered. In 4.2% of cases the SNP explained more variance in the DNAm level than did the region.
Effects of rs4925386 and local genetic variation on cg15193198 and cg24112000
| SNP effect | SNP effect, SE | |||||||
|---|---|---|---|---|---|---|---|---|
| cg15193198 | 0.307 | 1.66 × 10−3 | 0.025 | 0.411 | 0.301 | 2.19 × 10−3 | −0.438 | 0.079 |
| cg24112000 | 0.625 | 5.51 × 10−12 | 0.281 | 1.44 × 10−2 | 0.629 | 8.47 × 10−12 | −0.576 | 0.090 |
The regional heritability estimate ( for cg15193198 and cg24112000 including rs4925386. The estimate for the full model ( full) and reduced model ( reduced) were calculated from all SNPs ≤ ±1 MB of the DNAm site excluding rs4925386. The full model included fitting the genotypes at rs4925386 as a fixed effect. The effect of rs4925386 on the DNAm level is reported as the addition of a single copy of the minor allele, adenine.