| Literature DB >> 31552329 |
Jason B Nikas1, Emily G Nikas2.
Abstract
Prostate cancer is the most prevalent and the second most lethal malignancy among males in the United States of America. Its diagnosis is almost entirely predicated upon histopathological analysis of the biopsied tissue, and it is associated with a substantial average error. Using genome-wide DNA methylation data derived from 469 prostatic tumor tissue samples and 50 normal prostatic tissue samples and interrogating over 485 000 CpG sites per sample (spanning across gene promoters, CpG islands, shores, shelves, gene bodies, and intergenic and other areas), we were able to develop a mathematical model that classified with a high accuracy (overall sensitivity = 95.31% and overall specificity = 94.00%) tumor tissue versus normal tissue. The methylation β values of five CpG sites, corresponding to the genes LINC01091, RPS15, SNORA10, and two unknown DNA areas in chromosome 1, provided the input to the model. The model was validated with unknown samples, as well as with a sixfold cross-validation and a leave-one-out cross-validation. This study presents a novel genomic model based on genome-wide DNA methylation analysis of biopsied prostatic tissue that could aid in the diagnosis of prostate cancer and help advance the transition to genomic medicine.Entities:
Year: 2019 PMID: 31552329 PMCID: PMC6751714 DOI: 10.1021/acsomega.9b01613
Source DB: PubMed Journal: ACS Omega ISSN: 2470-1343
DNA Methylation Analysis Results of the Five Input Variables of the Model Based on the Data of the Training Seta
| var. | gene symbol | gene ID | gen. coord. | SDT | SDN | FC | |||
|---|---|---|---|---|---|---|---|---|---|
| 6209 | 1440293 | 0.3830 | 0.1344 | 0.7145 | 0.0958 | –1.8654 | 2.41 × 10–19 | ||
| 574042 | 2012763 | 0.3970 | 0.1302 | 0.6984 | 0.0960 | –1.8046 | 1.28 × 10–19 | ||
| 285419 | 124694137 | 0.3255 | 0.1494 | 0.6525 | 0.0905 | –2.0048 | 4.74 × 10–19 | ||
| unknown | 220697615 | 0.2809 | 0.1618 | 0.6273 | 0.1007 | –2.2332 | 1.03 × 10–18 | ||
| unknown | 201509316 | 0.6082 | 0.2435 | 0.0794 | 0.1274 | 7.6578 | 9.10 × 10–19 |
The FC was calculated as follows: R = MT/MN. If R ≥ 1, FC = R; if R < 1, FC = −1/R.
Figure 1(A) ROC AUC curve of the model F in the training phase. (B) Combination graph (box plot, density plot, and dot plot) of the model F in the training phase. Red circles denote statistical outliers. 249 × 111 mm (300 × 300 DPI).
Figure 23D scatter plot illustrating the overall performance of the model F 223 × 183 mm (300 × 300 DPI).
Figure 3(A) ROC AUC curve of the model F (overall performance). (B) Combination graph (box plot, density plot, and dot plot) of the model F (overall performance). Red circles denote statistical outliers. 263 × 115 mm (300 × 300 DPI).
DNA Methylation Analysis Results of the Five Input Variables of the Model Based on the Combined Data of the Training and the Validation Setsa
| var. | gene symbol | gene ID | gen. coord. | SDT | SDN | FC | |||
|---|---|---|---|---|---|---|---|---|---|
| 6209 | 1440293 | 0.3865 | 0.1315 | 0.7077 | 0.1038 | –1.8308 | 2.79 × 10–26 | ||
| 574042 | 2012763 | 0.3928 | 0.1298 | 0.6908 | 0.1045 | –1.7586 | 7.15 × 10–26 | ||
| 285419 | 124694137 | 0.3293 | 0.1495 | 0.6534 | 0.1053 | –1.9843 | 7.54 × 10–26 | ||
| unknown | 220697615 | 0.2821 | 0.1588 | 0.6171 | 0.1218 | –2.1876 | 6.45 × 10–25 | ||
| unknown | 201509316 | 0.6055 | 0.2369 | 0.0766 | 0.1194 | 7.9034 | 6.26 × 10–27 |
The FC was calculated as follows: R = MT/MN. If R ≥ 1, FC = R; if R < 1, FC = −1/R.
Figure 4PCA graph (PC1 scores vs PC2 scores). 271 × 205 mm (300 × 300 DPI).