| Literature DB >> 32370309 |
Masanori Oshi1,2, Eriko Katsuta1, Li Yan3, John M L Ebos4, Omar M Rashid5,6, Ryusei Matsuyama2, Itaru Endo2, Kazuaki Takabe1,2,7,8,9,10.
Abstract
We generated a 4-gene score with genes upregulated in LM2-4, a metastatic variant of MDA-MB-231 (DOK 4, HCCS, PGF, and SHCBP1) that was strongly associated with disease-free survival (DFS) in TCGA cohort (hazard ratio [HR]>1.2, p < 0.02). The 4-gene score correlated with overall survival of TCGA (HR = 1 .44, p < 0 .001), which was validated with DFS and disease-specific survival of METABRIC cohort. The 4-gene score was able to predict worse survival or clinically aggressive tumors, such as high Nottingham pathological grade and advanced cancer staging. High score was associated with worse survival in the hormonal receptor (HR)-positive/Her2-negative subtype. High score enriched cell proliferation-related gene sets in GSEA. The score was high in primary tumors that originated, in and metastasized to, brain and lung, and it predicted worse progression-free survival for metastatic tumors. Good tumor response to neoadjuvant chemotherapy or hormonal therapy was accompanied by score reduction. High scores were also predictive of response to neoadjuvant chemotherapy for HR-positive/Her2-negative subtype. High score tumors had increased expression of T cell exhaustion marker genes, suggesting that the score may also be a biomarker for immunotherapy response. Our novel 4-gene score with both prognostic and predictive values may, therefore, be clinically useful particularly in HR-positive breast cancer.Entities:
Keywords: biomarker; breast cancer; gene expression; metastasis; prognosis; treatment response
Year: 2020 PMID: 32370309 PMCID: PMC7281399 DOI: 10.3390/cancers12051148
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Patients with a high 4-gene score had poor survival and aggressive clinical parameters in two breast cancer cohorts. (A) Disease-free survival (DFS), and overall (OS) or disease-specific (DSS) survival in TCGA and METABRIC cohorts along with hazard ratios (HR) and their 95% confidence intervals and p-value are shown for the score and individually for each of its constituent genes. (B) Boxplots of 4-gene scores of tumors of different Nottingham pathological grades and American Joint Committee on Cancer (AJCC) stages. All plots are of Tukey type and boxes depict medians and inter-quartile ranges. p values were calculated with one-way ANOVA test. (C,D) Kaplan–Meier plots with logrank test p values are shown for association between the 4-gene score with DFS and OS or DSS for different grades and stages. Within-cohort one-third percentile value of the 4-gene score or expression of individual genes was used to classify patients into low and high groups.
Survival analyses of 4-gene score and other factors in TCGA and METABRIC cohorts.
| Factors | Univariate | Multivariate | ||
|---|---|---|---|---|
| HR (95% CI) |
| HR (95% CI) |
| |
| TCGA (DFS) | ||||
| Age (≥ 50 vs. < 50 y) | 0.91 (0.67–1.33) | 0.609 | ||
| Subtype (HR+/HER2− vs. other) | 0.67 (0.44–1.01) | 0.054 | ||
| AJCC stage (III/IV vs. I/II) | 2.86 (1.93–4.21) | <0.001 | 2.89 (1.96–4.27) | <0.001 |
| Race (white vs. black) | 0.66 (0.42–1.02) | 0.059 | ||
| Histology (ILC vs. IDC) | 1.15 (0.71–1.84) | 0.577 | ||
| Score (high vs. low) | 1.74 (1.20–2.53) | 0.004 | 1.91 (1.28–2.83) | 0.001 |
| TCGA (OS) | ||||
| Age (≥ 50 vs. < 50 y) | 1.53 (1.06–2.22) | 0.024 | 1.81 (1.16–2.83) | 0.009 |
| Subtype (HR+/HER2− vs. Other) | 0.611 (0.42–0.89) | 0.010 | 0.68 (0.46–1.00) | 0.052 |
| AJCC stage (III/IV vs. I/II) | 2.64 (1.89–3.69) | <0.001 | 3.18 (2.18–4.65) | <0.001 |
| Race (white vs. black) | 0.82 (0.55–1.23) | 0.347 | ||
| Histology (ILC vs. IDC) | 0.90 (0.58–1.40) | 0.637 | ||
| Score (High vs. Low) | 1.87 (1.35–2.58) | <0.001 | 2.18 (1.48–3.23) | <0.001 |
| METABRIC (DFS) | ||||
| Age (≥ 50 vs. < 50 y) | 1.12 (0.94–1.33) | 0.205 | ||
| Subtype (HR+/HER2- vs. Other) | 0.64 (0.55–0.75) | <0.001 | 0.75 (0.62–0.91) | 0.004 |
| AJCC stage (III/IV vs. I/II) | 3.03 (2.37–3.88) | <0.001 | 2.84 (2.21–3.64) | <0.001 |
| Cellularity (high vs. other) | 1.02 (0.87–1.18) | 0.840 | ||
| Histology (ILC vs. IDC) | 0.97 (0.73–1.30) | 0.862 | ||
| Grade (3 vs. 1/2) | 1.52 (1.30–1.77) | <0.001 | 1.55 (1.04–2.32) | 0.031 |
| Score (High vs. Low) | 1.44 (1.23–1.68) | <0.001 | 1.34 (1.12–1.61) | 0.002 |
| METABRIC (DSS) | ||||
| Age (≥ 50 vs. < 50 y) | 1.17 (0.98–1.39) | 0.070 | ||
| Subtype (HR+/HER2− vs. Other) | 0.67 (0.57–0.78) | <0.001 | 0.78 (0.65–0.95) | 0.014 |
| AJCC stage (III/IV vs. I/II) | 2.94 (2.30–3.76) | <0.001 | 2.77 (2.15–3.55) | <0.001 |
| Cellularity (high vs. other) | 1.01 (0.87–1.18) | 0.868 | ||
| Histology (ILC vs. IDC) | 0.98 (0.74–1.31) | 0.900 | ||
| Grade (3 vs.1/2) | 1.49 (1.27–1.74) | <0.001 | 1.57 (1.05–2.34) | 0.027 |
| Score (High vs. Low) | 1.46 (1.25–1.71) | <0.001 | 1.39 (1.15–1.67) | <0.001 |
AJCC, American Joint Committee on Cancer; CI, confidence interval; DSS, disease-specific survival; DFS, disease-free survival; HR, hazard rate; HR+/HER2-, hormonal receptor positive/human epidermal growth factor receptor 2 negative; ILC, infiltrating lobular carcinoma; IDC, infiltrating ductal carcinoma; OS, overall survival; TNBC, triple negative breast cancer.
Figure 2The 4-gene score is prognostic for disease-free survival (DFS) especially for hormonal receptor (HR)+/ human epidermal growth factor receptor 2 (HER2) cancer. At the top, boxplots show the association between the 4-gene score and cancer subtype. All boxplots are of Tukey type and boxes depict medians and inter-quartile ranges. p values were calculated with one-way ANOVA test. Below are Kaplan–Meier plots with logrank test p values for association between tumor 4-gene score and DFS and overall (OS) or disease-specific (DSS) survival for cancer of HR+/HER2−, HER2+, and triple-negative (TNBC) subtypes. Within-group one-third percentile value of the 4-gene score was used to divide patients into low and high classes.
Figure 3Gene expression enrichment in tumors of high or low 4-gene scores in TCGA (n = 1093) and METABRIC (n = 1903) cohorts. Enrichment plots along with normalized enrichment scores (NES) and false discovery rate (FDR) values are shown for the four and one gene sets whose expression was respectively enriched among high- (A) and low- (B) score tumors. (C) Tukey boxplots of tumor gene expression for KI67 gene (log2 transcripts per million). Boxes depict medians and inter-quartile ranges. p values were calculated with one-way ANOVA test. Within-group one-third percentile value of the 4-gene score was used to divide patients into low and high classes.
Figure 4Association of the 4-gene score with metastasis in breast cancer. The examined cohorts are indicated. (A) Boxplots show 4-gene scores of primary tumors and metastases of patients who later developed metastasis to only bone, or to sites that included brain or lung, or did not have any metastasis. All boxplots are of Tukey type and the boxes depict medians and inter-quartile ranges. The depicted p values were calculated with standard t tests. (B) The Kaplan–Meier survival plots depict metastasis-free survival for metastasis to bone, brain, or lung based on the pre-metastasis primary tumor’s 4-gene score for three cohorts. For each plot, within-cohort one-third percentile value of the score was used to classify tumors into low and high groups.
Figure 5Relationships between response to therapy and the gene score. The examined cohorts are indicated. (A) Boxplots show the scores of pre- and post-neoadjuvant chemotherapy tumors of GSE28844 cohorts, and of pre- and post-therapy tumors of GSE33658 cohort in which neoadjuvant hormonal therapy was used. For GSE28844, response categories are as per Miller–Payne criteria. RECIST criteria are used for GSE33658 (PD, partial disease; SD, stable disease; PR, partial response; CR, complete response). Standard t tests were used to determine p values. All boxplots are of Tukey type and the boxes depict medians and inter-quartile ranges. (B) Barplots show the pathologic complete response (pCR) rates in high- and low-score groups of GSE25066 (n = 508) and GSE23988 (n = 61). Fisher’s exact test was used to determine p values. The one-third percentile value of the 4-gene score was used to classify tumors into low and high groups. (C) Comparisons of gene expression (log2 transcripts per million) of immune response genes in tumors of high or low 4-gene scores in both TCGA and METABRIC cohorts. Boxplots of expression of immune response genes in tumors with high and low 4-gene scores. The one-third percentile value of the 4-gene score was used to classify tumors into low and high groups. All boxplots are of Tukey type and the boxes depict medians and inter-quartile ranges. The depicted p values were calculated with standard t tests. CTLA4, cytotoxic T-lymphocyte-associated protein 4; IDO1/2, indoleamine dioxygenase 1/2; LAG3, lymphocyte activation gene 3; PD-1, programmed death-1; PD-L1, (programmed death ligand 1).