| Literature DB >> 29713003 |
Zhengyan Kan1, Ying Ding2, Jinho Kim3, Hae Hyun Jung4, Woosung Chung3, Samir Lal2, Soonweng Cho2, Julio Fernandez-Banet2, Se Kyung Lee5, Seok Won Kim5, Jeong Eon Lee5, Yoon-La Choi6, Shibing Deng2, Ji-Yeon Kim7, Jin Seok Ahn7, Ying Sha2, Xinmeng Jasmine Mu2, Jae-Yong Nam3, Young-Hyuck Im4,7, Soohyeon Lee8, Woong-Yang Park3, Seok Jin Nam9, Yeon Hee Park10,11.
Abstract
Breast cancer (BC) in the Asia Pacific regions is enriched in younger patients and rapidly rising in incidence yet its molecular bases remain poorly characterized. Here we analyze the whole exomes and transcriptomes of 187 primary tumors from a Korean BC cohort (SMC) enriched in pre-menopausal patients and perform systematic comparison with a primarily Caucasian and post-menopausal BC cohort (TCGA). SMC harbors higher proportions of HER2+ and Luminal B subtypes, lower proportion of Luminal A with decreased ESR1 expression compared to TCGA. We also observe increased mutation prevalence affecting BRCA1, BRCA2, and TP53 in SMC with an enrichment of a mutation signature linked to homologous recombination repair deficiency in TNBC. Finally, virtual microdissection and multivariate analyses reveal that Korean BC status is independently associated with increased TIL and decreased TGF-β signaling expression signatures, suggesting that younger Asian BCs harbor more immune-active microenvironment than western BCs.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29713003 PMCID: PMC5928087 DOI: 10.1038/s41467-018-04129-4
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Age and molecular subtype distribution in SMC and TCGA. a Bar charts comparing the distribution of patient age between SMC and TCGA colored by menopausal status. Stacked bar charts comparing molecular subtype distribution across SMC and TCGA age groups based on three classifications: Consensus (b), IHC (c), and PAM50 (d)
Clinical data summary for SMC and TCGA
| SMC | TCGA | Statistical significance | |
|---|---|---|---|
| Subjects ( | 187 | 1,116 | |
| Patient age (yr) | 39.3 ± 8.5 | 58.3 ± 13.2 | |
| Tumor purity (%) | 71 ± 17.8 | 77.7 ± 10.7 | |
| Menopausal status ( | |||
| Pre-menopausal | 165 (88.2%) | 232 (20.8%) | |
| Post-menopausal | 19 (10.2%) | 715 (64.1%) | |
| Peri-menopausal | 41 (3.7%) | ||
| N/A | 128 (11.5%) | ||
| Clinical subtype ( | |||
| ER+ | 103 (55.1%) | 480 (43.0%) | |
| ER+/HER2+ | 27 (14.4%) | 116 (10.4%) | |
| HER2+ | 15 (8%) | 32 (2.9%) | |
| TNBC | 37 (19.8%) | 121 (10.8%) | |
| N/A | 5 (2.7%) | 367 (32.9%) | |
| TNM stage (n(%)) | |||
| I | 27 (14.4%) | 177 (15.9%) | |
| II | 101 (54%) | 605 (54.2%) | |
| III | 58 (31%) | 237 (21.2%) | |
| V | 1 (0.5%) | 18 (1.6%) | |
| N/A | 79 (7.1%) | ||
| Histology subtype ( | |||
| Lobular carcinoma | 7 (3.7%) | 193 (17.3%) | |
| Ductal Carcinoma | 172 (92.0%) | 830 (74.4%) | |
| Others | 8 (4.3%) | 37 (3.3%) | |
| N/A | 56 (5.0%) | ||
| Race ( | |||
| Asian | 187 (100%) | 57 (5.1%) | |
| Black | 158 (14.2%) | ||
| White | 745 (66.8%) | ||
| N/A | 156 (14%) |
For SMC vs. TCGA comparisons of continuous and categorical variables, p values were calculated using Student’s t test and Fisher’s exact test respectively
Significantly mutated genes in SMC
| SMC | TCGA | SMC vs. TCGA | |||||
|---|---|---|---|---|---|---|---|
| Gene symbol | Gene description | Mut Freq ( | Mut Freq ( | ||||
|
| tumor protein p53 | 89 (47.9%) | 0 | 0 | 320 (32.0%) | 0 | 2.89e-04 |
|
| phosphatidylinositol-4,5-bisphosphate 3-kinase, catalytic subunit alpha | 53 (28.5%) | 0 | 0 | 320 (32.0%) | 2.09e-12 | 0.81 |
|
| GATA binding protein 3 | 23 (12.4%) | 0 | 0 | 91 (9.1%) | 0 | 0.49 |
|
| core-binding factor, beta subunit | 5 (2.7%) | 0 | 0 | 21 (2.1%) | 7.85e-07 | 0.83 |
|
| phosphatase and tensin homolog | 6 (3.2%) | 6.40e-06 | 0.015 | 44 (4.4%) | 0 | 0.83 |
|
| Cadherin-1 | 4 (2.2%) | 3.64e-05 | 0.076 | 115 (11.5%) | 8.64e-13 | 2.46e-04 |
SMC has 186 WES samples and 167 tumor samples with both WES and RNA-Seq data. TCGA has 1002 WES samples. Mut Freq: number of mutated samples (% of mutated samples). p value, q value: mutation significance reported by MutSigCV. SMC vs. TCGA: Mutation prevalence were compared between SMC and TCGA by Fisher’s exact test to determine the p value
Fig. 2Landscape of somatic alterations in SMC and TCGA. a Heatmap showing the integrated statuses of protein-altering somatic mutations and copy number alterations affecting the most frequently altered genes (rows) and samples (columns) from SMC and TCGA. Cell colors represent different types of genomic alterations—Mut (mutation), Amp (amplification), Del (deletion), and Multi (multiple alterations). Genes are sorted by the prevalence of alterations (≥3 samples) in descending order. The bar chart above represents the sample-level count of protein-altering somatic mutations. The unstacked bar chart to the right compares the prevalence of somatic alterations in SMC and TCGA for individual genes. Column color labels represent age-based group (Cohort), intrinsic molecular subtype classifications (Consensus), and menopausal statuses for all samples. b Comparison of TP53 mutation frequencies between SMC and TCGA overall and within individual subtypes. Lobular carcinoma cases were excluded from frequency calculations. c Distribution of TP53 mutation and ERBB2 amplification statuses across PAM50 subtypes in SMC and TCGA. TP53: TP53 mutated, ERBB2: ERBB2 amplified (CN ≥ 6), ERBB2 & TP53: TP53 mutated and ERBB2 amplified, WT: wild-type TP53 and ERBB2
Fig. 3Landscape of mutation signatures in SMC and TCGA. a Distribution of major mutation signatures across subtypes in SMC and TCGA. b Heatmap showing scores of major mutation signatures in SMC samples grouped by molecular subtypes. Sample-level mutation counts, age group, and menopausal status are illustrated by colored column labels. c Violin plots comparing signature 3 scores in SMC and TCGA samples binned by BRCA1/BRCA2 mutation and TP53 mutation statuses. Horizontal line indicates the median value. Violin plots of APOBEC signatures S2 (d) and S13 (e) vs. ERBB2 amplification status (amp: CN ≥ 6; WT: CN < 6) in SMC and TCGA; f Violin plots of HRD signature S3 vs. consensus molecular subtypes of SMC and TCGA. Student’s t test: ***p < 0.001; **p < 0.01; *p < 0.05
Fig. 4Tumor virtual microdissection identified distinct tumor intrinsic and microenvironment factors. a Distribution of NMF factors representing TME or tumor intrinsic compartment across subtypes of SMC and TCGA. b Heatmap showing the mean sample factor weight for 14 NMF factors, z-normalized by rows, in different cohorts. Samples of normal tissue origins were clustered to the left and samples of tumor origins were clustered to the right. c Plot of CYT score vs. F9 factor weight with color gradient representing CD8A gene expression. d Heatmap of Pearson correlation coefficients between Bindea immune expression signature scores (GSVA) and NMF factors
Fig. 5Distribution and multivariate analyses of distinctive molecular features. Boxplots showing the distributions of TIL factor F9 weights (a), PD-L1 gene expression (b), TGF-β expression signature score (c) and TGFB1 gene expression (d) across molecular subtypes of SMC and TCGA. TPM transcript per million. pvalue was determined by Student’s t test: ***p < 0.001; **p < 0.01; *p < 0.05; NS: p ≥ 0.05. The box is bounded by the first and third quartile with a horizontal line at the median and whiskers extend to 1.5 times IQR. e Heatmap showing the statistical significance of associations (−log10p) between distinctive molecular features (rows) and key clinical features (columns) in the combined cohort. Molecular features were ranked by variable usage frequencies in descending order. Clinical features include two continuous variables—Patient age and Tumor purity and five discrete variables—Cohort status: SMC or TCGA; Subtype: ER positive or ER negative; Tumor stage: early (1−2) or late (3−4); Menopausal status: pre or post; Histology subtype: Lobular carcinoma or Invasive ductal carcinoma. Molecular features include S3: HRD-related mutation signature 3; F7: NMF factor 7 associated with ER+ subtype; F9: NMF factor 9 associated with TILs; ESR1-Exp: ESR1 gene expression; HER2-Amp: ERBB2 amplification status; BRCA-Mut: BRCA1/BRCA2 germline pathogenic mutation status; TP53-Mut: TP53 somatic mutation status; TGFB-Sig: TGF-β pathway expression signature