| Literature DB >> 28743252 |
James E Barrett1, Andrew Feber2, Javier Herrero2, Miljana Tanic2, Gareth A Wilson2,3, Charles Swanton2,3,4,5, Stephan Beck2.
Abstract
BACKGROUND: Epigenetic heterogeneity within a tumour can play an important role in tumour evolution and the emergence of resistance to treatment. It is increasingly recognised that the study of DNA methylation (DNAm) patterns along the genome - so-called 'epialleles' - offers greater insight into epigenetic dynamics than conventional analyses which examine DNAm marks individually.Entities:
Keywords: Epigenetics; Heterogeneity; Phylogenetics
Mesh:
Year: 2017 PMID: 28743252 PMCID: PMC5526259 DOI: 10.1186/s12859-017-1753-2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1a An example of a genomic locus (chr1:1,145,478-1,145,614) in which each row corresponds to a sequencing read. Black and white circles represent methylated and unmethylated CpGs respectively. Note that some CpG measurements are missing. b The four epialleles that are inferred from the observed sequencing reads. c The Akaike Information Criterion score versus the total number of epialleles. The inferred number of epialleles corresponds to the minimum AIC score. d The proportion of observed reads attributed to each epiallele after marginalisation over the parameter (see main text for details)
Fig. 2Estimation of tumour sample purity for region 2 of the tumour. The parameter ξ was calculated at all eligible loci across the genome and the empirical distribution is plotted here. The sample purity is equal to the maximum value of ξ which is interpreted to occur at the rightmost maximum at ξ=0.53. The distribution of ξ is ‘smoothed’ due to the fact that at each locus ξ is estimated from a finite sample of sequencing reads
Fig. 3A genomic locus (chr1:2,603,277-2,603,489) composed of seven CpGs. The distribution of five epialleles – inferred using the Bayesian model – are plotted for seven tumour regions (R1 to R7) and one normal sample (N). In a the tumour samples have not been corrected for normal tissue contamination whereas in b they have been. The tumour samples are shifting towards an unmethylated profile in comparison to the normal tissue. The locus lies in a large intronic region in the gene TTC34
Fig. 4a Heatmap of the top 200 most variable epialleles across the seven tumour samples (labelled R1 to R7) and matched normal sample (labelled N). A proportion of 1.0 (dark blue) means that that epiallele accounted for all observed methylation patterns at the corresponding locus. These data have not been decontaminated of normal tissue. b The phylogenetic tree inferred before correction for contaminating normal tissue. In c and d are the same figures for the decontaminated epiallele profiles. In the top annotation track green denotes a CpG island, yellow a shore, and blue otherwise. In the bottom track dark purple denotes a gene promoter, otherwise light pink. A promoter was defined as between 2kb upstream and 50bp downstream from a transcription start site
In the middle column are estimates of tumour purity based on a comparison of epiallele distributions between normal tissue and tumour tissue. The third column contains estimates obtained from a separate study of exome data from the same tumour samples
| Tumour sample | Epiallele purity estimate | Exome purity estimate |
|---|---|---|
| R1 | 35% | 32% |
| R2 | 54% | 51% |
| R3 | 75% | 73% |
| R4 | 53% | 67% |
| R5 | 25% | 28% |
| R6 | <20% | 13% |
| R7 | 30% | 36% |
Fig. 5Box plots of the Shannon entropy of the epiallele distribution across normal tissue (N) and the seven tumour regions (R1–R7)