| Literature DB >> 28412973 |
Marie K Kirby1,2, Ryne C Ramaker1,3, Brian S Roberts1, Brittany N Lasseigne1, David S Gunther1,4, Todd C Burwell1,5, Nicholas S Davis1,6, Zulfiqar G Gulzar7,8, Devin M Absher1, Sara J Cooper1, James D Brooks9, Richard M Myers10.
Abstract
BACKGROUND: Current diagnostic tools for prostate cancer lack specificity and sensitivity for detecting very early lesions. DNA methylation is a stable genomic modification that is detectable in peripheral patient fluids such as urine and blood plasma that could serve as a non-invasive diagnostic biomarker for prostate cancer.Entities:
Keywords: Biomarker; DNA methylation; Diagnostic; EZH2; Prostate cancer
Mesh:
Substances:
Year: 2017 PMID: 28412973 PMCID: PMC5392915 DOI: 10.1186/s12885-017-3252-2
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Clinical data for patients used in this study
| Training cohort | Testing cohort (TCGA) | |
|---|---|---|
| Patients [n] | ||
| Tumor tissues | 73 | 213 |
| benign-adjacent tissues | 63 | 49 |
| patient-matched tissues | 52 | 49 |
| Age | ||
| Mean [years] | 59.9 | 60.4 |
| Median [years] | 61 | 61 |
| Range age [years] | 43–73 | 43–75 |
| Preoperative PSA [ng/mL] | ||
| Range | 0.94–42 | 1.6–87 |
| Mean | 6.8 | 10.9 |
| Median | 5.62 | 7.4 |
| < 4 [n] | 15 | 19 |
| 4–10 [n] | 44 | 100 |
| > 10 [n] | 9 | 55 |
| unknown | 5 | 39 |
| Gleason Grade [n] | ||
| (<7) | 16 | 15 |
| (3 + 4) | 40 | 84 |
| (4 + 3) | 10 | 50 |
| 8 | 4 | 25 |
| (>8) | 2 | 39 |
| unknown | 1 | 0 |
| T Category | ||
| T2 | 2 | NA |
| T2a | 3 | 8 |
| T2b | 50 | 2 |
| T2c | NA | 82 |
| T3a | 8 | 71 |
| T3b | 7 | 43 |
| T4 | 1 | 5 |
| unknown | 2 | 2 |
| Nodal Status | ||
| N0 | 66 | 160 |
| N1 | 5 | 22 |
| unknown | 2 | 31 |
Fig. 1a Histogram of differentially methylated CpGs (LME, FDR < 0.05). Blue represents CpGs that have significantly higher methylation in benign-adjacent prostate tissue when compared to prostate cancer tissues (73,912 CpGs), and red represents CpGs that have significantly higher methylation in prostate cancer tissues (152,324 CpGs). b Heatmap of the top 10,000 CpGs with the most statistically significant DNA methylation differences between unaffected prostate tissue and prostate cancer tissue based on LME p-value. Color bar represents beta score with 0.5 subtracted
Genomic regions of differentially methylated CpGs
| Genomic location | Sig. CpGs more methylated in tumor | Sig. CpGs less methylated in tumor | Fisher’s exact |
|---|---|---|---|
| A. Island versus non-island | |||
| CG Island | 93,536 | 35,639 | 3.44E-154 |
| Non-CG Island | 58,787 | 38,273 | |
| B. Regulatory region versus gene body (All significant CpGs) | |||
| Regulatory Region | 55,029 | 23,056 | 0.49 |
| Gene Body | 46,502 | 19,858 | |
| C. Regulatory Region Versus Gene Body (Top 10,000 most significant CpGs) | |||
| Regulatory Region | 3165 | 776 | 1.50E-02 |
| Gene Body | 1902 | 700 | |
Regulatory Region = promoter, first exon, first intron
Gene Body = other exon, other intron, 3′ proximal
CG Island = CG islands, CG shelves, and CG shores
A) Analysis of CpGs in islands versus non-islands. B) Analysis of CpGs in gene bodies versus gene regulatory regions for all significant CpGs. C) Analysis of CpGs in gene bodies versus gene regulatory regions for the top 10,000 most significant CpGs
Fig. 2Overlap of top 10,000 most significant (LME p-value) DNA methylation sites in gene regulatory regions and higher methylation in prostate cancer tissues with ENCODE transcription factor binding sites highlights the role EZH2 plays in prostate cancer. a Barplot showing the relative percent of ENCODE transcription factor binding sites containing significant methylation changes. Dashed red lines represent the upper and lower 95% confidence intervals generated from enrichment values of randomly selected methylation sites. b Pie charts demonstrating the directionality of significant DNA methylation sites and gene expression levels within 1 kb of EZH2 binding sites
Fig. 3Boxplots of CpGs in the top diagnostic models. Normal data is from benign-adjacent tissues and Tumor Data is from patient cancer tissues
Fig. 4ROC curve and waterfall plots for performance of the top 3 CpG diagnostic model in a training and b validation datasets. The value of the classifier is given by 6.52–17.04*cg00054525 + 24.18*cg16794576–13.82*cg24581650, where the intercept and coefficients have been regressed by a binomial generalized linear model. A threshold value of this classifier was chosen to yield maximal non-unity specificity in the training set. The red dot on the ROC curve corresponds to the sensitivity and specificity of the classifier at the chosen threshold. The dashed line on the waterfall plots is drawn at the chosen threshold value of the classifier