| Literature DB >> 32019269 |
Chunyu Wang1, Ning Zhao2, Linlin Yuan3, Xiaoyan Liu1.
Abstract
Breast cancer is the most common female malignancy. It has high mortality, primarily due to metastasis and recurrence. Patients with invasive and noninvasive breast cancer require different treatments, so there is an urgent need for predictive tools to guide clinical decision making and avoid overtreatment of noninvasive breast cancer and undertreatment of invasive cases. Here, we divided the sample set based on the genome-wide methylation distance to make full use of metastatic cancer data. Specifically, we implemented two differential methylation analysis methods to identify specific CpG sites. After effective dimensionality reduction, we constructed a methylation-based classifier using the Random Forest algorithm to categorize the primary breast cancer. We took advantage of breast cancer (BRCA) HM450 DNA methylation data and accompanying clinical data from The Cancer Genome Atlas (TCGA) database to validate the performance of the classifier. Overall, this study demonstrates DNA methylation as a potential biomarker to predict breast tumor invasiveness and as a possible parameter that could be included in the studies aiming to predict breast cancer aggressiveness. However, more comparative studies are needed to assess its usability in the clinic. Towards this, we developed a website based on these algorithms to facilitate its use in studies and predictions of breast cancer invasiveness.Entities:
Keywords: Breast cancer; DNA methylation; Invasiveness; Metastasis
Year: 2020 PMID: 32019269 PMCID: PMC7072524 DOI: 10.3390/cells9020326
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 6.600
Figure 1Schematic diagram of the presented stepwise analysis. Step 1, differential methylation analysis: Two methods were carried out to select the differential methylation sites. Step 2, dimensionality reduction: using four methods to reduce the dimension. Step 3, four kinds of dimensionality reduction results were used to construct the classifier respectively. Step 4, enrichment analysis: Hypergeometric test evaluated and compared the performances of the classifiers.
The number of selected CpG sites and the performance of four classifiers for DNA methylation profiles.
| Normal | Invasiveness | ||
|---|---|---|---|
|
| Number of CpG | 14 | 134 |
| Training Accuracy | 97% | 93.6% | |
| Testing Accuracy | 96.9% | 549/217 | |
|
| Number of CpG | 12 | 5 |
| Training Accuracy | 99% | 100% | |
| Testing Accuracy | 96.9% | 611/165 | |
|
| Number of CpG | 8 | 3 |
| Training Accuracy | 99% | 95% | |
| Testing Accuracy | 91% | 454/312 | |
|
| Number of CpG | 80 | 60 |
| Training Accuracy | 99% | 93.3% | |
| Testing Accuracy | 94.8% | 664/102 |
Figure 2Four enrichment analyses results of clinical indicators in the two tumor clusters. Red indicates samples predicted by the classifier as invasive, and green indicates samples predicted to be non-invasive. The X axis represents clinical indicators; the Y axis represents the enrichment ratio of Table 0. ** is extremely significant ().
Figure 3Unsupervised clustering and heatmap of 20 samples based on the 134 differently methylated probes. Each column corresponds to a sample and each row corresponds to a CpG site. Color indicates methylation value. With color ranging from blue to red, methylation values range from small to large. Color key is to the right.
Known metastasis-associated genes and their descriptions in literatures.
| Gene | Description |
|---|---|
|
| |
|
| Functions as a suppressor of colorectal cancer metastasis by down-regulating the |
|
| “ |
|
| “This study for the first time identifies |
|
| “The role of |
|
| “In migrating cancer stem cells isolated from primary human colorectal cancers, |
|
| “Thyroid hormone receptors induce TRAIL expression, and TRAIL thus synthesized acts in concert with simultaneously synthesized Bcl-xL to promote metastasis” |
|
| “ |
|
| |
|
| “ |
|
| “ |
|
|
Gene is gene symbol; “…” This is a direct quote from the corresponding literature.