| Literature DB >> 24586640 |
Melike Marsan1, Gert Van den Eynden2, Ridha Limame3, Patrick Neven4, Jan Hauspy2, Peter A Van Dam5, Ignace Vergote4, Luc Y Dirix2, Peter B Vermeulen2, Steven J Van Laere1.
Abstract
INTRODUCTION: Metastases remain the primary cause of cancer-related death. The acquisition of invasive tumour cell behaviour is thought to be a cornerstone of the metastatic cascade. Therefore, gene signatures related to invasiveness could aid in stratifying patients according to their prognostic profile. In the present study we aimed at identifying an invasiveness gene signature and investigated its biological relevance in breast cancer. METHODS &Entities:
Mesh:
Year: 2014 PMID: 24586640 PMCID: PMC3931724 DOI: 10.1371/journal.pone.0089262
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Collection of gene signatures used for overrepresentation analysis.
| Description Gene Signature | # Genes | % Original | Reference | |
| 1. | Differentially expressed genes by TGFβ in p53-depleted MDA-MB-231 cells | 105 | 5 | Adorno et al. 2009 |
| 2. | Coculture of mesenchymal stromal cells with CD133+ hematopoietic stem cells | 21 | 2 | Alakel et al. 2009 |
| 3. | TGFβ-induced EMT in HMECs predisposed to ionizing radiation | 32 | 3 | Andarawewa et al. 2007 |
| 4. | Trophoblast invasion-related genes | 648 | 7 | Bilban et al. 2009 |
| 5. | Overexpression of Integrin α6β4 in MDA-MB-435 cells | 263 | 10 | Chen et al. 2009 |
| 6. | Ezrin knockdown in SW480 | 26 | 2 | GSE11297 |
| 7. | Comparison of MDA-MB-231 cells with wild-type SNAIL and dominant negative SNAIL | 50 | 2 | Fabre-Guillevin et al. 2008 |
| 8. | Comparison of MDA-MB-435 cells with wild-type NM23-H1 and mutant NM23-H1 | 44 | 3 | Horak et al. 2007 |
| 9. | Functional implications of non-lens βγ-Cristallin and Refoil Factor Complex | 55 | 6 | Liu et al. 2008 |
| 10. | Gene expression profiling of central and peripheral zones of pancreatic carcinoma | 756 | 12 | Nakamura et al. 2007 |
| 11. | Normal HMECs vs. HMECs transfected with constitutively active RhoA | 135 | 2 | GSE12917 |
| 12. | Overexpression of classIIb HLH factors E2-2A and E2-2B in MDCK cells | 147 | 5 | Sobrado et al. 2009 |
| 13. | Genes epigenetically regulated in poorly metastatic MDA-MB-468 cells vs the highly metastatic MDA-MB-468LN variant | 136 | 2 | Rodenhiser et al. 2008 |
| 14. | Expression profiling of migratory cells in the Drosophila ovary | 33 | 3 | Wang et al. 2006 |
| 15. | Transfection of MIR-520C in MCF-7 cells | 113 | 4 | Huang et al. 2008 [17 |
| 16. | Transfection of MIR-373 in MCF-7 cells | 128 | 7 | Huang et al. 2008 [17 |
| 17. | Comparison of mesenchymal and epithelial cells | 186 | 3 | Choi et al. 2010 |
| 18. | Genes differentially expressed in mesenchymal stem cells induced by CCL25 | 105 | 3 | Binger et al. 2009 |
| 19. | Genes differentially expressed across a collection of 10 migratory glioma cell lines | 89 | 1 | Demuth et al. 2009 |
| 20. | Genes differentially expressed in MDA-MB-231 cells after CD146 downmodulation | 45 | 3 | Zabouo et al. 2009 |
| 21. | Keratinocyts treated with TGFβ to suppress proliferation but not migration | 92 | 4 | Cheng et al. 2008 |
Gene expression data sets used throughout this study.
| Group | ID | Repository | Platform | N | DMFS | Remark | Reference |
|
| E-TABM-157 | Array express | HGU133A | 51 | NA | Breast cancer cell line collection | Neve et al. 2006 |
| GSE12777 | GEO | HGU133PLUS2 | 39 | NA | Breast cancer cell line collection | Hollestelle et al. 2009 | |
| GSE16795 | GEO | HGU133A | 51 | NA | Breast cancer cell line collection | Hoeflich et al. 2009 | |
| GSE11279 | GEO | HGU133PLUS2 | 4 | NA | Ezrin knockdown SW480 | – | |
| GSE12917 | GEO | HGU133PLUS2 | 6 | NA | Normal and RhoA-transfectedHMECs | – | |
| GSE12548 | GEO | HGU133PLUS2 | 20 | NA | EMT time series in ARPE19 | Takahashi et al. 2010 | |
| GSE17708 | GEO | HGU133PLUS2 | 26 | NA | Time course of A549 treatedwith TGFβ | Sartor et al. 2010 | |
| GSE24202 | GEO | HGU133A | 21 | NA | HMLEs transfectedwith EMT-inducers | Taube et al. 2010 | |
| GSE11259 | GEO | HGU133v2 | 9 | NA | [Non]Metastatic 4T1 clones | Lou et al. 2008 | |
|
| GSE1456 | GEO | HGU133A | 159 | – | None | Pawitan et al. 2005 |
| GSE2034 | GEO | HGU133A | 286 | 286 | Lymph node negative cohort | Wang et al. 2005 | |
| GSE2603 | GEO | HGU133A | 99 | 82 | None | Minn et al. 2005 | |
| GSE2990 | GEO | HGU133A | 189 | 125 | Cohort used for generation of GGI | Sotiriou et al. 2006 | |
| GSE4922 | GEO | HGU133A | 289 | – | None | Ivshina et al. 2006 | |
| GSE5327 | GEO | HGU133A | 58 | 58 | ER negative cohort | Minn et al. 2007 | |
| GSE7390 | GEO | HGU133A | 198 | 198 | Lymph node negative cohort | Desmedt et al. 2007 | |
| GSE11121 | GEO | HGU133A | 200 | 200 | Lymph node negative cohort | Schmidt et al. 2008 | |
| GSE12093 | GEO | HGU133A | 136 | – | ER+, tamoxifen treated | Zhang et al. 2009 | |
| GSE17705 | GEO | HGU133A | 298 | 298 | ER+, tamoxifen treated | Symmans et al. 2010 | |
| GSE25055 | GEO | HGU133A | 310 | 66 | ErbB2-, Anthracyclin/Taxane-treated | Hatzis et al. 2011 | |
| GSE25065 | GEO | HGU133A | 198 | 198 | ErbB2-, Anthracyclin/Taxane-treated | Hatzis et al. 2011 |
Figure 1Core invasiveness genes/gene signature collection Interaction matrix.
Interaction matrix representing the core invasiveness genes in the X-axis and the gene signature collection used for the overrepresentation analysis in the Y-axis. A blue cell indicates membership of the associated CIG in the corresponding gene signature. Most of the signatures count at least 2 CIGs in their gene lists except for the gene signature indentified in migratory glioma cells.
Figure 2Validation of the CIG signature using breast cancer cell line gene expression data.
The top row (A–C) shows the scatter plot representations of the PCAs performed on the distinct breast cancer cell lines data sets (GSE12777, GSE16795 and E-TABM-157) using the CIG signature. The X-axis represents the first principal component; the Y-axis represents the second principal component. A blue dot indicates a non-invasive breast cancer cell line and a red dot indicates an invasive breast cancer cells line. The black dots represent the centroids of the invasive and non-invasive cell lines. The middle row (D–F) shows the distributions of the Euclidean distances between the centroids of the invasive and non-invasive cell lines after class label permutation. The dashed vertical line indicates the true Euclidean distance between the centroids of the invasive and non-invasive cell lines. The lower row (G–I) represents the distributions of the Euclidean distances between the centroids of the invasive and non-invasive breast cancer cell lines based upon random selections of 16 genes from the group of 2636 genes obtained from the collection signatures associated with cell motility or invasion. The dashed vertical line indicates the Euclidean distance between the centroids of the invasive and non-invasive cells based on the 16 CIGs. These data demonstrate that the CIG signature is able to segregate invasive and non-invasive breast cancer cell lines and performs better then random selections of genes, which validates our gene selection strategy.
Figure 3PCA for the CIGs on the breast cancer cell line data sets.
Regression coefficients for the first principal components obtained by performing PCA for the CIGs on the breast cancer cell line data sets. The X-axis represents the 16 CIGs, the Y-axis represents the regression coefficients. The black, red and green dots are indicative for respectively GSE12777, GSE16795 and E-TABM-157. Positive and negative regression coefficients indicate respectively pro-invasive and contra-invasive genes. The magnitude of the regression coefficient reflects the importance of the corresponding gene in determining the CIG expression. The horizontal dashed line indicates a regression coefficient of zero. Some genes are represented less than 3 times due to the fact that not all CIG were amongst the informative gene list in every cell line data set. Most of the CIGs, except for S100A8, STC1 and LTBP1 show consistent regression coefficients indicating that they have a similar behaviour with respect to the prediction of the invasiveness phenotype of breast cancer cells in all 3 data sets.
Figure 4Boxplots showing the relation between CIG expression and EMT.
The top row (A–B) represents a time series of different cell lines treated with EMT-inducing factors. These data demonstrate that CIG expression increases by incubation time. The lower left boxplot (C) indicates that CIG expression is induced by all of the known EMT-inducing factors, but most strongly downstream of GSC. The lower right boxplot (D) indicates that CIG expression does not necessarily correlate with metastatic capability as the cell line with the highest metastatic capability has the lowest CIG expression.
Figure 5Association between published gene signatures and the CIG signature in human breast cancer.
Heatmap showing the association between the expressions of several published gene signatures and the CIG signature in a set of approximately 2.500 breast tumour samples. The rows and columns represent the set of analysed gene expression signatures organized into groups related to prognosis, EMT, pathway activation, stem cell biology, breast tumour heterogeneity and stromal involvement. The cells at the intersection between the rows and the columns are colour-coded with red indicating a positive correlation between the respective gene signatures and white indicating a negative correlation. Colour saturation is associated the magnitude the correlation coefficient. The dendrogram is divided in 3 groups (red, blue and green) of strongly associated gene signatures. Underneath the heatmap the Spearman correlation coefficients between the CIG signature and the remaining signatures is represented as well as the ten signatures most strongly associated with the CIG signature.
Survival analysis.
| UNIVARIATE | MODEL 1 | MODEL 2 | |||||||||||||
| Parameter | N | Cluster | β value | CI-low | CI-high | P-value | β value | CI- low | CI-high | P-value | β value | CI-low | CI-high | P-value | |
|
| Basal like | 1508 | Luminal Progenitor | 1.996 | 1.621 | 2.459 | P<0.001 | – | – | – | – | – | – | – | – |
| ErbB2+ | 1508 | Luminal Progenitor | 2.915 | 2.198 | 3.864 | P<0.001 | – | – | – | – | – | – | – | – | |
| Luminal A | 1508 | Luminal Mature Cell | 0.398 | 0.326 | 0.486 | P<0.001 | 0.525 | 0.391 | 0.707 | P<0.001 | – | – | – | – | |
| Luminal B | 1508 | Luminal Progenitor | 1.431 | 1.117 | 1.832 | P = 0.004 | 0.422 | 0.173 | 1.032 | – | 0.771 | 1.325 | 2.367 | P<0.001 | |
| Normal-like | 1508 | Mammary Stem Cell | 0.486 | 0.379 | 0.623 | P<0.001 | 0.310 | 0.117 | 0.822 | – | – | – | – | – | |
| Claudin-Low | 1508 | Mammary Stem Cell | 0.945 | 0.633 | 1.411 | P = 0.782 | – | – | – | – | – | – | – | – | |
|
| Proliferation (SSP) | 1508 | Luminal Progenitor | 1.826 | 1.570 | 2.124 | P<0.001 | 1.483 | 1.191 | 1.848 | P<0.001 | – | – | – | – |
| 70 Gene Prognostic Index | 1508 | Luminal Progenitor | 2.758 | 2.124 | 3.581 | P<0.001 | 1.661 | 1.132 | 2.438 | – | – | – | – | – | |
| Invasiveness Gene Signature | 1508 | Luminal Progenitor | 3.738 | 2.561 | 5.455 | P<0.001 | – | – | – | – | – | – | – | – | |
|
| Mammosphere Signature | 1508 | Mammary Stem Cell | 0.507 | 0.427 | 0.603 | P<0.001 | 0.540 | 0.442 | 0.662 | P<0.001 | 0.597 | 0.488 | 0.731 | P<0.001 |
| CD44+ Signature | 1508 | Mammary Stem Cell | 2.060 | 1.610 | 2.637 | P<0.001 | 2.236 | 1.757 | 2.486 | P<0.001 | 2.009 | 1.567 | 2.576 | P<0.001 | |
| Luminal Mature Cell (DPM) | 1508 | Luminal Mature Cell | 0.523 | 0.362 | 0.756 | P<0.001 | – | – | – | – | – | – | – | – | |
| Luminal Progenitor Cell (DPM) | 1508 | Luminal Progenitor | 3.903 | 2.598 | 5.863 | P<0.001 | 1.724 | 1.258 | 2.808 | – | – | – | – | – | |
| Mammary Stem Cell (DPM) | 1508 | Mammary Stem Cell | 1.253 | 0.757 | 2.076 | P = 0.381 | – | – | – | – | – | – | – | – | |
|
| Quiescent WHR | 1508 | Luminal Mature Cell | 0.246 | 0.140 | 0.430 | P<0.001 | 0.291 | 0.164 | 0.516 | P<0.001 | – | – | – | – |
| Activated WHR | 1508 | Luminal Progenitor | 3.559 | 2.133 | 5.939 | P<0.001 | – | – | – | – | – | – | – | – | |
| Stromal Metagene | 1508 | Mammary Stem Cell | 0.766 | 0.681 | 0.862 | P<0.001 | 0.800 | 0.710 | 0.903 | P<0.001 | – | – | – | – | |
|
| AKT | 1508 | Luminal Progenitor | 0.727 | 0.611 | 0.865 | P<0.001 | 0.638 | 0.492 | 0.827 | P<0.001 | – | – | – | – |
|
| CTNβ | 1508 | Luminal Progenitor | 1.191 | 1.099 | 1.291 | P<0.001 | – | – | – | – | – | – | – | – |
| E2F1 | 1508 | Luminal Progenitor | 0.916 | 0.846 | 0.992 | P = 0.030 | – | – | – | – | – | – | – | – | |
| EGFR | 1508 | Mammary Stem Cell | 0.540 | 0.439 | 0.664 | P<0.001 | – | – | – | – | – | – | – | – | |
| ER | 1508 | Luminal Mature Cell | 0.782 | 0.714 | 0.857 | P<0.001 | – | – | – | – | – | – | – | – | |
| ERBB2 | 1508 | Mammary Stem Cell | 0.998 | 0.904 | 1.102 | P = 0.965 | – | – | – | – | – | – | – | – | |
| INFα | 1508 | Luminal Progenitor | 1.017 | 0.986 | 1.049 | P = 0.278 | – | – | – | – | – | – | – | – | |
| INFγ | 1508 | Luminal Progenitor | 1.001 | 0.970 | 1.033 | P = 0.949 | – | – | – | – | – | – | – | – | |
| MYC | 1508 | Luminal Progenitor | 1.236 | 1.030 | 1.483 | P = 0.022 | 0.746 | 0.559 | 0.996 | – | 0.531 | 0.423 | 0.666 | P<0.001 | |
| P53 | 1508 | Luminal Mature Cell | 0.725 | 0.673 | 0.781 | P<0.001 | 0.597 | 0.497 | 0.716 | P<0.001 | 0.682 | 0.614 | 0.758 | P<0.001 | |
| PI3K | 1508 | Luminal Progenitor | 0.882 | 0.809 | 0.963 | P = 0.005 | 0.854 | 0.746 | 0.958 | – | – | – | – | – | |
| PR | 1508 | Luminal Mature Cell | 0.885 | 0.853 | 0.918 | P<0.001 | 1.139 | 1.054 | 1.231 | – | – | – | – | – | |
| RAS | 1508 | Luminal Progenitor | 1.534 | 1.223 | 1.924 | P<0.001 | 2.019 | 1.281 | 3.184 | – | – | – | – | – | |
| SRC | 1508 | Luminal Progenitor | 1.112 | 1.057 | 1.170 | P<0.001 | – | – | – | – | – | – | – | – | |
| STAT3 | 1508 | Mammary Stem Cell | 1.067 | 0.833 | 1.368 | P = 0.607 | – | – | – | – | – | – | – | – | |
| TNFα | 1508 | Luminal Progenitor | 1.151 | 0.971 | 1.363 | P = 0.105 | – | – | – | – | – | – | – | – | |
| TGFβ | 1508 | Mammary Stem Cell | 0.728 | 0.626 | 0.847 | P<0.001 | – | – | – | – | – | – | – | – | |
| VEGF | 1508 | Luminal Progenitor | 1.189 | 1.032 | 1.369 | P = 0.017 | – | – | – | – | – | – | – | – | |
| P63 | 1508 | Luminal Progenitor | 0.943 | 0.859 | 1.035 | P = 0.213 | – | – | – | – | – | – | – | – | |
|
| EMT | 1508 | Mammary Stem Cell | 0.940 | 0.922 | 0.959 | P<0.001 | 0.971 | 0.948 | 0.995 | – | – | – | – | – |
| SNAIL | 1508 | Mammary Stem Cell | 0.745 | 0.654 | 0.851 | P<0.001 | 0.855 | 0.746 | 0.981 | – | 0.863 | 0.746 | 1.000 | P = 0.049 | |
| TWIST | 1508 | Luminal Progenitor | 0.901 | 0.834 | 0.974 | P = 0.008 | 0.811 | 0.742 | 0.885 | P<0.001 | 0.893 | 0.774 | 0.907 | P<0.001 | |
| GSC | 1508 | Luminal Mature Cell | 0.884 | 0.847 | 0.924 | P<0.001 | 0.875 | 0.827 | 0.927 | P<0.001 | – | – | – | – | |
| E-CADHERIN | 1508 | Luminal Progenitor | 1.025 | 0.989 | 1.063 | P = 0.168 | – | – | – | – | – | – | – | – | |
| Core Invasiveness Gene Signature | 1508 | Mammary Stem Cell | 0.956 | 0.896 | 1.019 | P = 0.168 | – | – | – | – | – | – | – | – | |