| Literature DB >> 26285647 |
Tommaso De Marchi1, Ning Qing Liu2, Cristoph Stingl3, Mieke A Timmermans4, Marcel Smid5, Maxime P Look6, Mila Tjoa7, Rene B H Braakman8, Mark Opdam9, Sabine C Linn10, Fred C G J Sweep11, Paul N Span12, Mike Kliffen13, Theo M Luider14, John A Foekens15, John W M Martens16, Arzu Umar17.
Abstract
Estrogen receptor (ER) positive tumors represent the majority of breast malignancies, and are effectively treated with hormonal therapies, such as tamoxifen. However, in the recurrent disease resistance to tamoxifen therapy is common and a major cause of death. In recent years, in-depth proteome analyses have enabled identification of clinically useful biomarkers, particularly, when heterogeneity in complex tumor tissue was reduced using laser capture microdissection (LCM). In the current study, we performed high resolution proteomic analysis on two cohorts of ER positive breast tumors derived from patients who either manifested good or poor outcome to tamoxifen treatment upon recurrence. A total of 112 fresh frozen tumors were collected from multiple medical centers and divided into two sets: an in-house training and a multi-center test set. Epithelial tumor cells were enriched with LCM and analyzed by nano-LC Orbitrap mass spectrometry (MS), which yielded >3000 and >4000 quantified proteins in the training and test sets, respectively. Raw data are available via ProteomeXchange with identifiers PXD000484 and PXD000485. Statistical analysis showed differential abundance of 99 proteins, of which a subset of 4 proteins was selected through a multivariate step-down to develop a predictor for tamoxifen treatment outcome. The 4-protein signature significantly predicted poor outcome patients in the test set, independent of predictive histopathological characteristics (hazard ratio [HR] = 2.17; 95% confidence interval [CI] = 1.15 to 4.17; multivariate Cox regression p value = 0.017). Immunohistochemical (IHC) staining of PDCD4, one of the signature proteins, on an independent set of formalin-fixed paraffin-embedded tumor tissues provided and independent technical validation (HR = 0.72; 95% CI = 0.57 to 0.92; multivariate Cox regression p value = 0.009). We hereby report the first validated protein predictor for tamoxifen treatment outcome in recurrent ER-positive breast cancer. IHC further showed that PDCD4 is an independent marker.Entities:
Keywords: Biomarker; Breast cancer; Mass spectrometry; Proteomics; Tamoxifen resistance
Mesh:
Substances:
Year: 2015 PMID: 26285647 PMCID: PMC5528925 DOI: 10.1016/j.molonc.2015.07.004
Source DB: PubMed Journal: Mol Oncol ISSN: 1574-7891 Impact factor: 6.603
Figure 1Data analysis flow‐chart and development of predictor for tamoxifen treatment outcome. Patients were divided into two independent cohorts and separately measured by LC‐MS. Proteomic data from training and test sets were analyzed separately in MaxQuant. Identified proteins were filtered for reversed sequences and for Posterior Error Probability score (PEP < 0.05), intensities of commonly expressed proteins were normalized using ComBat algorithm to minimize batch effects, and filtered for missing data (10 minimum observations for global proteomic analysis and allowing 30% and 0% missing data in training and test set respectively for predictor generation). Student t test (p value < 0.05) was then used to assess differences in protein expression levels between good and poor outcome patients. A multivariate regression model was used to obtain an optimal list of 4 proteins to be tested as a predictor of tamoxifen treatment outcome: CGN, G3BP2, PDCD4 and OCIAD1. The 4‐protein signature was confirmed in an external test set. Acronyms: EMC = Erasmus MC, University Medical Center; NKI‐AVL=Netherlands Cancer Institute‐ Antoni van Leeuwenhoek hospital; RadboudUMC = Radboud University Medical Center.
Patient and tumor characteristics.
| Traininga | Testa | ||
|---|---|---|---|
| EMC | NKI‐AVL | RadboudUMC | |
| All patients | 56 (100) | 41 (100) | 15 (100) |
| Age | |||
| ≤ 55 years | 15 (27) | 12 (29) | 1 (7) |
| > 55 years | 41 (73) | 29 (71) | 14 (93) |
| Menopausal status | |||
| Premenopausal | 10 (18) | 11 (27) | 0 (0) |
| Postmenopausal | 46 (82) | 30 (63) | 15 (100) |
| Tumor size | |||
| T1 (≤2 cm) | 12 (21) | 20 (49) | 5 (33) |
| T2 (2–5 cm) + Tx | 40 (72) | 19 (46) | 9 (60) |
| T3 (>5 cm) + T4 | 4 (7) | 2 (5) | 1 (7) |
| Tumor differentiationb | |||
| Good/Moderate | 13 (59) | 29 (71) | 8 (53) |
| Poor | 33 (23) | 12 (29) | 4 (27) |
| Unknown | 10 (18) | 0 (0) | 3 (20) |
| Disease free interval | |||
| ≤ 12 months | 24 (43) | 4 (10) | 5 (33) |
| > 12 months | 32 (57) | 37 (90) | 10 (67) |
| PgRc | |||
| Negative | 9 (16) | 17 (41) | 11 (73) |
| Positive | 44 (79) | 24 (59) | 4 (27) |
| Involved lymph nodes | |||
| 0 | 31 (55) | 24 (58) | 6 (40) |
| ≥ 1 | 20 (36) | 16 (39) | 7 (47) |
| unknown | 5 (9) | 1 (3) | 2 (13) |
| Dominant site of relapse | |||
| Loco‐regional | 8 (14) | 4 (10) | 0 (0) |
| Bone | 26 (46) | 12 (29) | 6 (40) |
| Visceral | 13 (24) | 6 (15) | 9 (60) |
| Bone and other | 9 (16) | 14 (34) | 0 (0) |
| Unknown | 0 (0) | 5 (12) | 0 (0) |
Acronym: PgR: progesterone receptor.
Data are reported as number (percentage).
Histopathological characteristics were evaluated by local pathologists, according to standard clinical practice at time of sample collection.
Missing data not reported.
Figure 2Protein abundance levels in 112 ER positive breast cancer samples. The waterfall plot shows mean protein abundance distribution of 1.960 commonly expressed proteins. The mean abundance of each quantified protein was calculated and plotted. The 30 least (blue) and most (red) abundant proteins are boxed in panel (A) and enlarged in panel (B) and (C), respectively.
Figure 3Protein compartmentalization and abundance correlation analysis. Panel shows quantified protein abundance range per subcellular compartment in the LCM enriched 112 ER positive tumors (A) and in WTL control replicates (B). Number of proteins per compartment and percentages are displayed above the dot plot.
Figure 4Hierarchical clustering and differential protein abundance of 4‐protein predictor. Samples in the training set (n = 56) were hierarchically clustered based on 99 differentially abundant proteins (t test p value < 0.05). Log10 intensities of differentially abundant proteins constituting the predictor for tamoxifen treatment outcome are shown in scatter dot plots. Eight poor and four good outcome patients were misclassified (A). Three out of four proteins, CGN (Uniprot accession number: Q9P2M7; p value = 0.006), OCIAD1 (Uniprot accession number: Q9NX40; p value < 0.001) and PDCD4 (Uniprot accession number: Q53EL6; p value < 0.001), had higher abundance in patients with good outcome, whereas G3BP2 (Uniprot accession number Q9UN86; p value < 0.001) was found more highly expressed in the poor outcome patient group (B).
LFQ based identification of 4 proteins in discovery and validation sets.
| Protein ID | Gene name | Molecular weight (kDa) | Peptides/Unique peptidesa | Sequence coverage/Unique sequence coverage (%/%)b | PEP scorec | |||
|---|---|---|---|---|---|---|---|---|
| Training | Test | Training | Test | Training | Test | |||
| Q9P2M7 | CGN | 136.380 | 22/22 | 40/40 | 22.8/22.8 | 39.5/39.5 | 3.63E‐133 | 7.88E‐268 |
| Q9UN86 | G3BP2 | 54.120 | 5/5 | 8/7 | 16.4/16.4 | 22.0/19.5 | 1.61E‐34 | 5.85E‐84 |
| Q9NX40 | OCIAD1 | 27.626 | 8/8 | 9/9 | 32.2/32.2 | 32.7/32.7 | 5.35E‐146 | 2.59E‐247 |
| Q53EL6 | PDCD4 | 51.735 | 19/19 | 18/18 | 49.3/49.3 | 47.8/47.8 | 1.24E‐225 | 3.23E‐281 |
Ratio between peptides and unique peptides associated to each predictor protein.
Peptides/Unique peptides sequence coverage of each protein sequence.
PEP: represents an estimation of a false identification.
Information on the 4 proteins constituting the predictor for tamoxifen therapy outcome.
| Gene name | GO cellular component | Protein name | Student t | p value |
|---|---|---|---|---|
| CGN | Cell junction | Cingulin | 3.13 | 0.006 |
| G3BP2 | Cytoplasm | Ras GTPase‐activating protein‐binding protein 2 | 3.50 | <0.001 |
| OCIAD1 | Endosome, Mitochondrion | Ovarian carcinoma immunogenic antigen domain‐containing protein 1 | 4.15 | <0.001 |
| PDCD4 | Cytoplasm, Nucleus | Programmed cell death protein 4 | 3.99 | <0.001 |
Figure 5ROC curve of the training set and Kaplan–Meier curves for TTP as a function of predicted outcome in patients in the test set. Patient outcome scores from the training set were calculated based on abundance levels of the 4 predictor proteins and protein weights (i.e. Student t value). The ROC curve was generated and Youden maximum (J = 0.740) was chosen as the best discriminatory cutoff (A). Patient scores were subsequently calculated for patients in the test set, survival curves were generated for the predicted groups and differences were assessed with the Log‐rank test (B). Acronym: AUC: area under the curve; HR: hazard ratio; CI: confidence interval.
Univariate and multivariate Cox regression analysis for time to progression.
| Factors | Hazard ratio | Univariate 95% CI | p value | Hazard ratio | Multivariate 95% CI | p value |
|---|---|---|---|---|---|---|
| 4 protein predictor score | ||||||
| High | 1.00 | 1.00 | ||||
| Low | 2.44 | 1.30–4.54 | 0.006 | 2.17 | 1.15–4.17 | 0.017 |
| Age | ||||||
| ≤55 years | 1.00 | 1.00 | ||||
| >55 years | 0.44 | 0.23–0.86 | 0.017 | 0.55 | 0.28–1.08 | 0.083 |
| Disease free interval | ||||||
| ≤12 months | 1.00 | |||||
| >12 months | 0.63 | 0.30–1.31 | 0.213 | |||
| Dominant site of relapse | Overall p | |||||
| Loco‐regional | 1.00 | 0.270 | ||||
| Bone | 0.89 | 0.30–2.68 | ||||
| Visceral | 0.68 | 0.22–2.09 | ||||
| Bone and other | 0.45 | 0.14–1.40 | ||||
| PgR | ||||||
| Negative | 1.00 | |||||
| Positive | 0.57 | 0.32–1.00 | 0.052 | |||
Acronym: PgR: progesterone receptor.
Univariate and multivariate Cox regression analysis for time to progression.
| Factors | Hazard ratio | Univariate 95% CI | p value | Hazard ratio | Multivariate 95% CI | p value |
|---|---|---|---|---|---|---|
| PDCD4 | ||||||
| Low | 1.00 | 1.00 | ||||
| High | 0.75 | 0.59–0.96 | 0.020 | 0.72 | 0.57–0.92 | 0.009 |
| Age | ||||||
| ≤55 | 1.00 | 1.00 | ||||
| >55 | 0.58 | 0.45–0.70 | <0.001 | 0.52 | 0.40–0.67 | <0.001 |
| Disease free interval | ||||||
| ≤12 months | 1.00 | 1.00 | ||||
| >12 months | 0.73 | 0.54–0.99 | 0.042 | 0.63 | 0.46–0.87 | 0.004 |
| Dominant site of relapse | Overall p | |||||
| Loco‐regional | 1.00 | 0.310 | ||||
| Bone | 1.39 | 0.91–2.10 | ||||
| Visceral | 1.15 | 0.73–1.81 | ||||
| Bone and other | 1.38 | 0.89–2.13 | ||||
| PgR | ||||||
| Negative | 1.00 | |||||
| Positive | 0.77 | 0.59–1.01 | 0.062 | |||
Acronym: PgR: progesterone receptor.
Figure 6PDCD4 immunohistochemical staining of tissue micro‐array. Tissue cores showed two different staining patterns that have been evaluated by histo‐score (i.e. Histo‐score < 30 and ≥ 30), representing low and high PDCD4 protein expression (A). Patients were categorized according to histo‐score cutoff and TTP was plotted as a Kaplan–Meier curve. The Log‐rank test was used to test for differences in TTP between the two survival curves (B). Acronym: HR: hazard ratio; CI: confidence interval.