| Literature DB >> 28944917 |
Ling Guo1, Kun Zhang1, Zhitong Bing2.
Abstract
Breast cancer metastasis is a demanding problem in clinical treatment of patients with breast cancer. It is necessary to examine the mechanisms of metastasis for developing therapies. Classification of the aggressiveness of breast cancer is an important issue in biological study and for clinical decisions. Although aggressive and non‑aggressive breast cancer cells can be easily distinguished among different cell lines, it is very difficult to distinguish in clinical practice. The aim of the current study was to use the gene expression analysis from breast cancer cell lines to predict clinical outcomes of patients with breast cancer. Weighted gene co‑expression network analysis (WGCNA) is a powerful method to account for correlations between genes and extract co‑expressed modules of genes from large expression datasets. Therefore, WGCNA was applied to explore the differences in sub‑networks between aggressive and non‑aggressive breast cancer cell lines. The greatest difference topological overlap networks in both groups include potential information to understand the mechanisms of aggressiveness. The results show that the blue and red modules were significantly associated with the biological processes of aggressiveness. The sub‑network, which consisted of TMEM47, GJC1, ANXA3, TWIST1 and C19orf33 in the blue module, was associated with an aggressive phenotype. The sub‑network of LOC100653217, CXCL12, SULF1, DOK5 and DKK3 in the red module was associated with a non‑aggressive phenotype. In order to validate the hazard ratio of these genes, the prognostic index was constructed to integrate them and examined using data from the Cancer Genomic Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. Patients with breast cancer from TCGA in the high‑risk group had a significantly shorter overall survival time compared with patients in the low‑risk group (hazard ratio=1.231, 95% confidence interval=1.058‑1.433, P=0.0071, by the Wald test). A similar result was produced from the GEO database. The findings may provide a novel strategy for measuring cancer aggressiveness in patients with breast cancer.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28944917 PMCID: PMC5779881 DOI: 10.3892/mmr.2017.7608
Source DB: PubMed Journal: Mol Med Rep ISSN: 1791-2997 Impact factor: 2.952
All samples of aggressive and non-aggressive breast cancer cell lines.
| A, Non-aggressive | |
|---|---|
| GEO no. | Cell line name |
| GSM1067677 | MCF7 |
| GSM1230317 | MCF7 |
| GSM1230347 | BT474 |
| GSM1273928 | MCF7 |
| GSM1273929 | MCF7 |
| GSM1298685 | MCF7 |
| GSM1298686 | MCF7 |
| GSM1298687 | MCF7 |
| GSM1374661 | MDA-MB-453 |
| GSM156771 | MCF10A |
| GSM212661 | MCF7 |
| GSM286756 | MCF7 |
| GSM286757 | MCF7 |
| GSM286758 | MCF7 |
| GSM286762 | MCF7 |
| GSM286763 | MCF7 |
| GSM286764 | MCF7 |
| GSM286768 | MCF7 |
| GSM286769 | MCF7 |
| GSM286770 | MCF7 |
| GSM297803 | MCF7 |
| GSM436499 | MCF7 |
| GSM436500 | MCF7 |
| GSM436501 | MCF7 |
| GSM678802 | MCF7 |
| GSM678803 | MCF7 |
| GSM678804 | MCF7 |
| GSM699776 | MCF7 |
| GSM699777 | MCF7 |
| GSM803623 | MCF7 |
| GSM803682 | MCF7 |
| GSM803741 | MCF7 |
| GSM820808 | HMEC |
| GSM820809 | HMEC |
| GSM820810 | HMEC |
| GSM984494 | BT474 |
| GSM984498 | MCF7 |
| GSM984499 | SKBR3 |
| B, Aggressive | |
| GEO no. | Cell line name |
| GSM1374510 | HCC202 |
| GSM1374550 | Hs578T |
| GSM573291 | MDA-MB-231 |
| GSM573292 | MDA-MB-231 |
| GSM573293 | MDA-MB-231 |
| GSM596523 | MDA-MB-231 |
| GSM596524 | MDA-MB-231 |
| GSM596525 | MDA-MB-231 |
| GSM803625 | MDA-MB-231 |
| GSM803626 | MDA-MB-435 |
| GSM803684 | MDA-MB-231 |
| GSM803685 | MDA-MB-435 |
| GSM803744 | MDA-MB-435 |
| GSM820814 | MDA-MB-231 |
| GSM820815 | MDA-MB-231 |
| GSM820816 | MDA-MB-231 |
| GSM839353 | MDA-MB-231 |
| GSM839354 | MDA-MB-231 |
| GSM839355 | MDA-MB-231 |
| GSM843477 | BT549 |
| GSM843478 | BT549 |
| GSM843479 | BT549 |
| GSM870207 | MDA-MB-231 |
| GSM870208 | MDA-MB-231 |
| GSM870209 | MDA-MB-231 |
| GSM870210 | MDA-MB-231 |
| GSM984501 | Hs578T |
GEO, gene expression omnibus.
Figure 1.Network analysis of gene expression in aggressive and non-aggressive. Bistinct modules of co-expressed genes in (A) aggressive and (B) non-aggressive breast cancer cell lines. Dendrograms produced by average linkage hierarchical clustering of 1,811 genes bases on topological overlap. Modules were assigned colors as an indicator in the horizontal bar beneath the aggressive dendrogram. The bottom color bar represents the module after merging modules. Classical multidimensional scaling plots in three dimensions depict the relative size and cohesion of modules in aggressive and non-aggressive group.
Figure 2.Connectivity of aggressive and non-aggressive breast cancer comparison. (A) The slope of the curve changes greatly at 12 in aggressive breast cancer cells. Power=12 led to the aggressive network satisfying scale-free topology. (B) The slope of the curve changes greatly at 6 in non-aggressive breast cancer cells. Power=6 led to non-aggressive network satisfying scale-free topology. (C) Power=12 was used for the aggressive group and depicted the scale-free topology and (D) power=6 was used for the non-aggressive group and depicted the scale-free topology. The black curve corresponds to scale-free topology and the red curve corresponds to truncated scale-free topology. (E) Spearman's rank correlation was used for comparing network connectivity between aggressive and non-aggressive. The value of Spearman's rank correlation (rho) is 0.16 and P=5.133×10−12.
List of the top GO terms in the most significant the Database for Annotation, Visualization and Integrated Discovery functional clusters for each network module.
| A, Aggressive breast cancer cell lines | |||
|---|---|---|---|
| Top five terms | No. of genes in ME | P-value | FDR |
| Blue module | 363 | ||
| GO:0001501:skeletal system development | 1.05×10−8 | 1.83×10−5 | |
| GO:0007155:cell adhesion | 7.15×10−8 | 1.25×10−4 | |
| GO:0022610:biological adhesion | 7.39×10−7 | 1.29×10−3 | |
| GO:0001568:blood vessel development | 3.27×10−6 | 5.70×10−3 | |
| GO:0001944:vasculature development | 3.73×10−6 | 6.50×10−3 | |
| Brown module | 359 | ||
| GO:0048545:response to steroid hormone stimulus | 5.91×10−9 | 9.53×10−6 | |
| GO:0008285:negative regulation of cell proliferation | 6.08×10−9 | 9.82×10−6 | |
| GO:0009725:response to hormone stimulus | 1.27×10−6 | 2.04×10−3 | |
| GO:0042127:regulation of cell proliferation | 6.36×10−6 | 1.02×10−2 | |
| GO:0009719:response to endogenous stimulus | 8.54×10−5 | 1.39×10−1 | |
| Green module | 183 | ||
| GO:0007167:enzyme linked receptor protein signaling pathway | 4.76×10−4 | 7.57×10−1 | |
| GO:0001525:angiogenesis | 6.42×10−4 | 1.02 | |
| GO:0009611:response to wounding | 7.10×10−4 | 1.13 | |
| GO:0048514:blood vessel morphogenesis | 1.04×10−3 | 1.64 | |
| GO:0001568:blood vessel development | 1.8×10−3 | 2.85 | |
| Red module | 74 | ||
| GO:0030030:cell projection organization | 4.04×10−5 | 6.34×10−2 | |
| GO:0034329:cell junction assembly | 1.37×10−4 | 2.14×10−1 | |
| GO:0006928:cell motion | 2.24×10−4 | 3.50×10−1 | |
| GO:0034330:cell junction organization | 2.30×10−3 | 3.54 | |
| GO:0000904:cell morphogenesis involved in differentiation | 2.49×10−3 | 3.83 | |
| Turquoise module | 196 | ||
| GO:0046907:intracellular transport | 2.89×10−5 | 5.04×10−2 | |
| GO:0016192:vesicle-mediated transport | 1.72×10−4 | 2.99×10−1 | |
| GO:0051270:regulation of cell motion | 3.15×10−4 | 5.46×10−1 | |
| GO:0001701:in utero embryonic development | 4.69×10−4 | 8.14×10−1 | |
| GO:0010033:response to organic substance | 4.69×10−4 | 8.14×10−1 | |
| Yellow module | 191 | ||
| GO:0048732:gland development | 1.61×10−5 | 2.61×10−2 | |
| GO:0042981:regulation of apoptosis | 1.83×10−5 | 2.97×10−2 | |
| GO:0043067:regulation of programmed cell death | 1.92×10−5 | 3.11×10−2 | |
| GO:0010941:regulation of cell death | 2.61×10−4 | 4.21×10−1 | |
| GO:0009611:response to wounding | 5.23×10−4 | 8.43×10−1 | |
| B, Non-aggressive breast cancer cell lines | |||
| Top five terms | No. of genes in ME | P-value | FDR |
| Blue module | 374 | ||
| GO:0006796:phosphate metabolic process | 2.79×10−4 | 4.80×10−1 | |
| GO:0006793:phosphorus metabolic process | 2.79×10−4 | 4.80×10−1 | |
| GO:0000075:cell cycle checkpoint | 2.91×10−4 | 4.99×10−1 | |
| GO:0010033:response to organic substance | 4.34×10−4 | 7.43×10−1 | |
| GO:0046907:intracellular transport | 5.63×10−4 | 9.65×10−1 | |
| Brown module | 90 | ||
| GO:0007178:transmembrane receptor protein serine/threonine kinase signaling pathway | 6.48×10−3 | 9.26 | |
| GO:0051789:response to protein stimulus | 7.02×10−3 | 10.24 | |
| GO:0009615:response to virus | 7.58×10−3 | 10.75 | |
| GO:0030509:BMP signaling pathway | 1.11×10−2 | 15.84 | |
| GO:0006955:immune response | 1.29×10−2 | 17.61 | |
| Turquoise module | 1,345 | ||
| GO:0007155:cell adhesion | 8.22×10−12 | 1.49×10−8 | |
| GO:0022610:biological adhesion | 8.85×10−12 | 1.61×10−8 | |
| GO:0009611:response to wounding | 1.83×10−11 | 3.32×10−8 | |
| GO:0048732:gland development | 3.85×10−11 | 6.98×10−8 | |
| GO:0001568:blood vessel development | 4.20×10−11 | 7.63×10−8 | |
GO, gene ontology; FDR, false discovery rate; ME, module eigengene.
Figure 3.Visualization of specific network of blue and red modules in metastasis and non-metastasis breast cancer. (A) The light blue nodes represent specific nodes and linkage in blue module network of aggressive breast cancer. The dark blue nodes represent the overlap between metastasis and non-aggressive breast cancer. (B) The light blue nodes represent specific nodes in non-aggressive breast cancer. The dark blue nodes represent the overlap between aggressive and non-aggressive breast cancer. (C) The light red nodes represent the specific nodes and linkage in red module network of aggressive breast cancer. The dark red nodes represent the overlap between aggressive and non-aggressive breast cancer. (D) The light red nodes represent the specific nodes in non-metastasis breast cancer. And the dark red nodes represent the overlap between aggressive and non-aggressive breast cancer.
List of top five genes with high k as hub genes in blue and red modules.
| A, Blue module of aggressive group | |||
|---|---|---|---|
| Gene symbol | Accession of uniprot | Gene name | |
| Q9BQJ4 | Transmembrane protein 47 | 1.000 | |
| P36383 | Gap junction γ-1 protein | 0.929 | |
| P12429 | Annexin A3 | 0.925 | |
| Q15672 | Twist-related protein 1 | 0.917 | |
| Q9GZP8 | Immortalization upregulated protein | 0.905 | |
| B, Red module of non-aggressive group | |||
| Gene symbol | Accession of uniprot | Gene name | |
| Neurotrimin-like | 1.000 | ||
| P48061 | Stromal cell-derived factor 1 | 0.958 | |
| Q8IWU6 | Sulfatase 1 | 0.936 | |
| Q9P104 | Docking protein 5 | 0.819 | |
| Q9UBP4 | Dickkopf-related protein 3 | 0.782 | |
Nine hub genes predictive of survival in patients with breast in the Cancer Genome Atlas database.
| Gene symbol | Gene name | Hazard ratio | Cox P-value | Confidence interval (95%) |
|---|---|---|---|---|
| Transmembrane protein 47 | 1.161 | 0.004 | 1.049–1.286 | |
| Gap junction γ-1 protein | 1.192 | 0.025 | 1.022–1.390 | |
| Annexin A3 | 1.114 | 0.016 | 1.021–1.214 | |
| Twist-related protein 1 | 1.145 | 0.019 | 1.022–1.283 | |
| Immortalization upregulated protein | 0.956 | 0.118 | 0.903–1.012 | |
| Stromal cell-derived factor 1 | 1.203 | 0.001 | 1.076–1.345 | |
| Sulfatase 1 | 0.950 | 0.375 | 0.848–1.064 | |
| Docking protein 5 | 1.045 | 0.395 | 0.944–1.158 | |
| Dickkopf-related protein 3 | 1.194 | 0.004 | 1.059–1.344 |
Figure 4.Kaplan-Meier survival curves for testing hub genes in blue and red module. (A) The classification of low-risk and high-risk by WPI of hub genes in overall survival (days). (B) Kaplan-Meier curve obtained from WPI classification by hub genes expression in breast cancer patients (P=0.0248). WPI, weighted prognostic index; TCGA, The Cancer Genome Atlas.
Figure 5.Kaplan-Meier survival curves and ROC curves for testing hub genes in blue and red module in GSE3439 dataset. (A) Kaplan-Meier curve obtained from the weighted prognostic index classification by hub gene expression in breast cancer patients (P=0.0241). (B) ROC curve had an area under the curve of 0.697 in validation data set. GEO, Gene Expression Omnibus; ROC, receiver operating characteristic.