| Literature DB >> 30733547 |
Guillermo Prado-Vázquez1,2, Angelo Gámez-Pozo1,2, Lucía Trilla-Fuertes2, Jorge M Arevalillo3, Andrea Zapater-Moros1,2, María Ferrer-Gómez1, Mariana Díaz-Almirón4, Rocío López-Vacas1, Hilario Navarro3, Paloma Maín5, Jaime Feliú6,7, Pilar Zamora6, Enrique Espinosa6,7, Juan Ángel Fresno Vara8,9.
Abstract
Triple-negative breast cancer is a heterogeneous disease characterized by a lack of hormonal receptors and HER2 overexpression. It is the only breast cancer subgroup that does not benefit from targeted therapies, and its prognosis is poor. Several studies have developed specific molecular classifications for triple-negative breast cancer. However, these molecular subtypes have had little impact in the clinical setting. Gene expression data and clinical information from 494 triple-negative breast tumors were obtained from public databases. First, a probabilistic graphical model approach to associate gene expression profiles was performed. Then, sparse k-means was used to establish a new molecular classification. Results were then verified in a second database including 153 triple-negative breast tumors treated with neoadjuvant chemotherapy. Clinical and gene expression data from 494 triple-negative breast tumors were analyzed. Tumors in the dataset were divided into four subgroups (luminal-androgen receptor expressing, basal, claudin-low and claudin-high), using the cancer stem cell hypothesis as reference. These four subgroups were defined and characterized through hierarchical clustering and probabilistic graphical models and compared with previously defined classifications. In addition, two subgroups related to immune activity were defined. This immune activity showed prognostic value in the whole cohort and in the luminal subgroup. The claudin-high subgroup showed poor response to neoadjuvant chemotherapy. Through a novel analytical approach we proved that there are at least two independent sources of biological information: cellular and immune. Thus, we developed two different and overlapping triple-negative breast cancer classifications and showed that the luminal immune-positive subgroup had better prognoses than the luminal immune-negative. Finally, this work paves the way for using the defined classifications as predictive features in the neoadjuvant scenario.Entities:
Mesh:
Year: 2019 PMID: 30733547 PMCID: PMC6367406 DOI: 10.1038/s41598-018-38364-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Clinical features of the main and neoadjuvant datasets.
| Main Dataset | Neoadjuvant dataset | p-value | |
|---|---|---|---|
| Number of patients | 494 | 153 | |
|
| |||
| T1 | 99 (20%) | 9 (6%) | <0.0001 |
| >T1 | 276 (56%) | 144 (94%) | |
| NA | 119 (24%) | ||
|
| |||
| G1&2 | 103 (21%) | 16 (10%) | 0.0001 |
| G3 | 280 (57%) | 124 (81%) | |
| NA | 111 (22%) | ||
|
| |||
| N0 | 251 (51%) | 37 (24%) | <0.0001 |
| N1 | 68 (14%) | 116 (76%) | |
| NA | 175 (35%) | ||
|
| |||
| No | 257 (52%) | ||
| Yes | 71 (14%) | ||
| NA | 166 (34%) | ||
|
| |||
| RD | 95 (62%) | ||
| pCR | 53 (34%) | ||
Size data is divided into T1 (<2 cm) and >T1 (>2 cm) tumors; grade is classified as G1&2 (well or moderately differentiated tumors) or G3 (poorly differentiated tumors); lymph node status represents lymph node invasion (N0: no invasion; N1: invasion or metastasis); and the adjuvant chemotherapy column comprises patients who had been treated with adjuvant chemotherapy or not. The pathological response column stands for the response to neoadjuvant treatment (RD: residual disease; pCR: pathological complete response). The chi-squared test confirmed that both cohorts are different regarding clinical parameters and treatment.
Figure 1PGM resulting network; each functional node is encoded from 0 to 26. Each box (node) represents one gene, and lines (edges) connect genes with related expression. Functional nodes are represented by the same color, and metanodes are presented the same color palette, with basal nodes in red, luminal nodes in blue and immune nodes in green.
Figure 2PGM represents the resulting network in which each functional node is encoded from 0 to 26, each box (node) represents one gene and lines (edges) connect genes with related expression. Genes from Rody’s metagenes are represented by different colors.
Figure 3Workflow from the sparse k-means groups in each metanode to the final cellular classification.
Number of tumors classified in each metanode sparse k-means group and in the cellular classification.
| Luminal | N | Basal | N | CLDN | Tumors | % of total | Cellular | N |
|---|---|---|---|---|---|---|---|---|
| — | 403 (82%) | — | 93 (23%) | High | 40 (43%) | 8% | CLDN-High | 40 (8%) |
| Low | 53 (57%) | 11% | CLDN-Low | 53 (11%) | ||||
| + | 310 (77%) | High | 245 (79%) | 50% | Basal | 310 (63%) | ||
| Low | 65 (21%) | 13% | ||||||
| + | 91 (18%) | — | 84 (92%) | High | 79 (94%) | 16% | LAR | 91 (18%) |
| Low | 5 (6%) | 1% | ||||||
| + | 7 (8%) | High | 7 (100%) | 1% | ||||
| Low | 0 | 0% |
Number of tumors with clinical characteristics.
| Cellular Classification | Tumor size | Grade | Nodal | ||||||
|---|---|---|---|---|---|---|---|---|---|
| T1 | >T1 | p-value | G1 or G2 | G3 | p-value | N0 | N1 | p-value | |
| Basal | 76 (32%) | 163 (68%) | 0.169 | 45 (18%) | 199 (82%) | 0.015 | 168 (83%) | 35 (17%) | 0.262 |
| CLDN-High | 2 (7%) | 27 (93%) | 0.023 | 5 (14%) | 31 (86%) | 0.110 | 19 (83%) | 4 (17%) | 0.795 |
| CLDN-Low | 10 (24%) | 32 (76%) | 0.853 | 20 (49%) | 21 (51%) | 0.005 | 28 (72%) | 11 (28%) | 0.313 |
| LAR | 11 (17%) | 54 (83%) | 0.121 | 33 (53%) | 29 (47%) | <0.001 | 36 (67%) | 18 (33%) | 0.056 |
| Total | 99 (26%) | 276 (74%) | — | 103 (27%) | 280 (73%) | — | 251 (79%) | 68 (21%) | — |
T1: tumor smaller than 2 cm; >T1: tumor larger than 2 cm; G3: grade 3; G1 or G2: grade 1 or grade 2; Nodal (N0): no node infiltration; N1: node infiltration. % is calculated using the total amount of a row for each clinical characteristic. Fisher exact test were performed between each group of the cellular classification and the total population (significant p-value = 0.05).
Immune characteristic interaction with cellular classification. According to the chi-squared test, IM characteristics and cellular classification are dependent.
| IM negative | IM positive | ||||
|---|---|---|---|---|---|
| Cellular Classification | Tumors | % | Cellular Classification | Tumors | % |
| Basal | 159 | 68% | Basal | 151 | 58% |
| CLDN-Low | 23 | 10% | CLDN-Low | 30 | 12% |
| LAR | 42 | 18% | LAR | 49 | 19% |
| CLDN-High | 11 | 5% | CLDN-High | 29 | 11% |
Figure 4Kaplan-Meier survival curves represent the survival rate of immune-positive and immune-negative tumors in the whole cohort (A) and in the four cellular subgroups (B).
Figure 5Various molecular classifications compared with the cellular classification. From top to bottom, cellular, PAM50 + CLDN-low, Lehmann 2016 TNBC4 type, immune and Burstein’s classifications are presented.
Shows comparisons between Cellular classification and PAM50, Lehmann’s and Burstein’s classifications.
| Basal | CLDN-High | CLDN-Low | LAR | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Burstein | N | % | Burstein | N | % | Burstein | N | % | Burstein | N | % |
| BLIA | 104 | 34% | BLIA | 23 | 58% | BLIA | 11 | 21% | BLIA | 2 | 2% |
| BLIS | 149 | 48% | BLIS | 3 | 1% | BLIS | 3 | 6% | BLIS | 1 | 1% |
| LAR | 4 | 1% | LAR | 4 | 1% | LAR | 3 | 6% | LAR | 76 | 84% |
| MES | 53 | 17% | MES | 10 | 25% | MES | 36 | 68% | MES | 12 | 13% |
|
|
|
|
|
|
|
|
|
|
|
|
|
| Basal | 125 | 40% | Basal | 13 | 33% | Basal | 5 | 9% | Basal | 0 | 0% |
| CLDN-Low | 76 | 25% | CLDN-Low | 9 | 23% | CLDN-Low | 44 | 83% | CLDN-Low | 13 | 14% |
| Her2 | 23 | 7% | Her2 | 6 | 15% | Her2 | 1 | 2% | Her2 | 8 | 9% |
| LumA | 25 | 8% | LumA | 7 | 18% | LumA | 1 | 2% | LumA | 52 | 57% |
| LumB | 27 | 9% | LumB | 4 | 10% | LumB | 4 | 4% | LumB | 16 | 18% |
| Normal | 34 | 11% | Normal | 1 | 3% | Normal | 0 | 0% | Normal | 2 | 2% |
|
|
|
|
|
|
|
|
|
|
|
|
|
| BL1 | 57 | 18% | BL1 | 8 | 20% | BL1 | 1 | 2% | BL1 | 0 | 0% |
| BL2 | 81 | 26% | BL2 | 29 | 73% | BL2 | 47 | 89% | BL2 | 28 | 31% |
| LAR | 3 | 1% | LAR | 0 | 0% | LAR | 1 | 2% | LAR | 52 | 57% |
| M | 169 | 55% | M | 3 | 8% | M | 4 | 8% | M | 11 | 12% |
Figure 6Kaplan-Meier survival curves represent the survival rate of immune-positive and immune-negative tumors in the TNBC4-type subgroups.
Shows immune characteristics in the PAM50+CLDN-low subgroups.
| PAM50 + CLND-low | IM− | IM+ |
|---|---|---|
| Basal | 69 (48%) | 74 (52%) |
| CLDN-low | 62 (44%) | 80 (56%) |
| Her2 | 10 (26%) | 28 (74%) |
| LumA | 43 (51%) | 42 (49%) |
| LumB | 27 (55%) | 22 (57%) |
| Normal | 24 (65%) | 13 (35%) |
Figure 7Kaplan-Meier survival curves represent the survival rate of immune-positive and immune-negative tumors in the PAM50 + CLDN-low subgroups.
Figure 8Kaplan-Meier survival curves represent the survival rate of immune-positive and immune-negative tumors in the Burstein’s subgroups.
Shows the cellular classification and the immune characteristic in the neoadjuvant dataset.
| Cellular Classification | Number | IM Characteristic | Number | %Intragroup |
|---|---|---|---|---|
| Basal | 79 (52%) | IM− | 41 | 52% |
| IM+ | 38 | 48% | ||
| CLDN-High | 8 (5%) | IM− | 2 | 25% |
| IM+ | 6 | 75% | ||
| CLDN-Low | 19 (12%) | IM− | 12 | 63% |
| IM+ | 7 | 37% | ||
| LAR | 47 (31%) | IM− | 25 | 53% |
| IM+ | 22 | 47% |
Figure 9Kaplan–Meier survival curves represent the distant relapse-free survival rate of the cellular and the TNBC4-type subgroups in the GSE25066 series.