| Literature DB >> 35047861 |
Bin Zhu1,3, Lijin Joo1, Tongwu Zhang1, Hela Koka1, DongHyuk Lee1, Jianxin Shi1, Priscilla Lee2, Difei Wang1,3, Feng Wang2, Wing-Cheong Chan4, Sze Hong Law4,5, Yee-Kei Tsoi4, Gary M Tse6, Shui Wun Lai7, Cherry Wu7, Shuyuan Yang2, Emily Ying Yang Chan2, Samuel Yeung Shan Wong2, Mingyi Wang1,3, Lei Song1,3, Kristine Jones1,3, Bin Zhu1,3, Amy Hutchinson1,3, Belynda Hicks1,3, Ludmila Prokunina-Olsson1, Montserrat Garcia-Closas1, Stephen Chanock1, Lap Ah Tse2, Xiaohong R Yang1.
Abstract
Recent genomic studies suggest that Asian breast cancer (BC) may have distinct somatic features; however, most comparisons of BC genomic features across populations did not account for differences in age, subtype, and sequencing methods. In this study, we analyzed whole-exome sequencing (WES) data to characterize somatic copy number alterations (SCNAs) and mutation profiles in 98 Hong Kong BC (HKBC) patients and compared with those from The Cancer Genome Atlas of European ancestry (TCGA-EA, N = 686), which had similar distributions of age at diagnosis and PAM50 subtypes as in HKBC. We developed a two-sample Poisson model to compare driver gene selection pressure, which reflects the effect sizes of cancer driver genes, while accounting for differences in sample size, sequencing platforms, depths, and mutation calling methods. We found that somatic mutation and SCNA profiles were overall very similar between HKBC and TCGA-EA. The selection pressure for small insertions and deletions (indels) in GATA3 (false discovery rate (FDR) corrected p < 0.01) and single-nucleotide variants (SNVs) in TP53 (nominal p = 0.02, FDR corrected p = 0.28) was lower in HKBC than in TCGA-EA. Among the 13 signatures of single-base substitutions (SBS) that are common in BC, we found a suggestively higher contribution of SBS18 and a lower contribution of SBS1 in HKBC than in TCGA-EA, while the two APOBEC-induced signatures showed similar prevalence. Our results suggest that the genomic landscape of BC was largely very similar between HKBC and TCGA-EA, despite suggestive differences in some driver genes and mutational signatures that warrant future investigations in large and diverse Asian populations.Entities:
Keywords: Asian; breast cancer; cancer genomics; driver gene selection pressure; mutational signatures
Year: 2021 PMID: 35047861 PMCID: PMC8756551 DOI: 10.1016/j.xhgg.2021.100076
Source DB: PubMed Journal: HGG Adv ISSN: 2666-2477
Figure 1BC driver gene landscape in the HKBC study
(A) Genomic driver landscape of BC in HKBC. Top bar graph shows the number of NS mutations per tumor. The middle gene panel reports NS mutations in 23 known BC driver genes that were reported previously for the TCGA BC (BRCA) study. Bottom panels show age groups and PAM50 subtypes.
(B) Frequencies of mutations in 23 known BC driver genes in HKBC and TCGA-EA (European ancestry) studies.
Figure 2Comparisons of selection pressure of BC driver genes between HKBC and TCGA-EA studies
(A) Selection pressure of SNVs; (B) selection pressure of small indels. Genes without mutations detected in either HKBC or TCGA-EA samples are not shown. The bar represents the 95% confidence interval of the estimate of selection pressure.
Selection pressure for 23 BC driver genes in HKBC and TCGA-EA studies
| TP53 | 2 | 14 | 8 | 94.8 (56, 106) | 642.5 (321, 1,285) | 0 | 175 | 41 | 167.0 (144, 194) | 864.6 (651, 1,201) | 0.03∗ | 0.28 | 0.43 | 0.70 |
| PIK3CA | 0 | 36 | 1 | 100.9 (73, 140) | 59.1 (8, 419) | 4 | 256 | 12 | 97.9 (87, 111) | 54.2 (31, 96) | 0.87 | 0.93 | 0.94 | 0.98 |
| CDH1 | 0 | 0 | 4 | 0 (NA) | 138.6 (52, 369) | 3 | 41 | 52 | 18.4 (14, 25) | 340.5 (259, 447) | <0.01∗ | 0.03∗ | 0.05∗ | 0.37 |
| MAP3K1 | 0 | 1 | 3 | 1.9 (0.3, 132) | 104.7 (34, 325) | 3 | 29 | 51 | 7.5 (5, 11) | 165.0 (125, 217) | 0.08 | 0.35 | 0.41 | 0.70 |
| PTEN | 0 | 5 | 4 | 36.5 (15.88) | 413.0 (155, 1,100) | 0 | 18 | 22 | 18.1 (11, 29) | 260.8 (172, 396) | 0.19 | 0.45 | 0.42 | 0.70 |
| MAP2K4 | 0 | 0 | 1 | 0 (NA) | 101.0 (14, 717) | 1 | 17 | 13 | 16.7 (10, 27) | 146.7 (87, 257) | 0.04∗ | 0.28 | 0.70 | 0.96 |
| FOXA1 | 0 | 2 | 2 | 10.8 (3, 43) | 160.0 (40, 640) | 1 | 18 | 7 | 13.8 (9, 33) | 139.3 (87, 257) | 0.73 | 0.93 | 0.86 | 0.96 |
| KMT2C | 1 | 7 | 6 | 4.1 (2, 9) | 28.4 (13, 63) | 0 | 26 | 21 | 2.1 (2, 3) | 24.5 (16, 38) | 0.14 | 0.44 | 0.75 | 0.96 |
| GATA3 | 0 | 1 | 4 | 6.3 (0.9, 45) | 157.0 (59, 418) | 0 | 9 | 75 | 8.0 (4, 16) | 998.3 (796, 1,252) | 0.81 | 0.93 | <0.01∗ | <0.01∗ |
| RUNX1 | 0 | 1 | 1 | 5.2 (0.7, 37) | 76.7 (11, 544) | 1 | 7 | 16 | 5.2 (3, 11) | 188.2 (115, 307) | 0.99 | 0.99 | 0.33 | 0.68 |
| TBX3 | 0 | 0 | 0 | 0 (NA) | 0 (NA) | 0 | 7 | 16 | 3.5 (2, 7) | 151.2 (93, 247) | 0.17 | 0.44 | 0.02∗ | 0.22 |
| PIK3R1 | 0 | 0 | 1 | 0 (NA) | 45.2 (6, 321) | 0 | 6 | 14 | 3.2 (2, 7) | 83.8 (50, 142) | 0.21 | 0.45 | 0.51 | 0.78 |
| RB1 | 0 | 2 | 1 | 6.1 (2, 24) | 49.1 (7, 349) | 1 | 12 | 6 | 5.0 (3, 9) | 28.0 (11, 75) | 0.80 | 0.93 | 0.64 | 0.96 |
| CBFB | 0 | 2 | 0 | 25.2 (6, 101) | 0 (NA) | 1 | 12 | 4 | 20.8 (12, 37) | 123.5 (46, 329) | 0.81 | 0.93 | 0.32 | 0.68 |
| NCOR1 | 0 | 1 | 0 | 1.1 (0.2, 8) | 0 (NA) | 2 | 24 | 9 | 3.8 (3, 6) | 20.5 (11, 75) | 0.15 | 0.44 | 0.10 | 0.37 |
| AKT1 | 0 | 2 | 0 | 10.6 (4, 42) | 0 (NA) | 0 | 18 | 0 | 13.4 (8, 21) | 0 (NA) | 0.75 | 0.93 | 1.00 | 1.00 |
| GPS2 | 0 | 0 | 0 | 0 (NA) | 0 (NA) | 0 | 2 | 7 | 2.5 (0.6, 10) | 80.8 (39, 170) | 0.46 | 0.82 | 0.23 | 0.68 |
| ARID1A | 0 | 1 | 2 | 1.2 (0.2, 9) | 24.3 (6, 97) | 0 | 9 | 3 | 1.6 (0.8, 3) | 5.8 (2, 18) | 0.80 | 0.93 | 0.15 | 0.56 |
| CTCF | 0 | 1 | 0 | 3.8 (0.5, 27) | 0 (NA) | 2 | 10 | 4 | 5.2 (3, 10) | 35.8 (13, 95) | 0.75 | 0.93 | 0.32 | 0.68 |
| CDKN1B | 0 | 0 | 2 | 0 (NA) | 518.3 (130, 2,072) | 1 | 3 | 5 | 5.6 (2, 17) | 84.9 (35, 204) | 0.37 | 0.72 | 0.06 | 0.37 |
| NF1 | 0 | 2 | 1 | 2.2 (0.5, 9) | 10.6 (2, 76) | 1 | 13 | 5 | 1.9 (1, 3) | 8.9 (4, 21) | 0.89 | 0.93 | 0.87 | 0.96 |
| BRCA1 | 0 | 0 | 1 | 0 (NA) | 16.5 (2, 117) | 0 | 11 | 4 | 2.5 (1, 5) | 11.2 (4, 30) | 0.09 | 0.35 | 0.73 | 0.96 |
| ERBB2 | 0 | 0 | 1 | 0 (NA) | 29.2 (4, 208) | 0 | 11 | 1 | 3.6 (2, 7) | 6.1 (1, 43) | 0.09 | 0.35 | 0.28 | 0.68 |
The background mutation rates were estimated as , assuming the mutation counts following a Poisson distribution.
: the number of synonymous mutations; : the number of NS mutations; : the number of indels.
: selection pressure among indels in cancer genes (relative to indel in non-cancer genes); : selection pressure among NS mutations (relative to synonymous mutations); : selection pressure among indels in cancer genes (relative to indel in non-cancer genes).
95% confidence intervals for and in brackets.
p value from two sample Poisson likelihood ratio test for point mutations; p value from two sample Poisson likelihood ratio test for indels; q value for two sample Poisson likelihood ratio tests for point mutations; q value for two sample Poisson likelihood ratio tests for indels.
A star mark (∗) indicates a test is significant at level 0.05.
Figure 3SBS mutational spectrum and prevalence of SBS signatures in HKBC
(A and B) Mutational spectrum in HKBC (A) and TCGA-EA studies (B).
(C) The contributions of COSMIC SBS signatures for each patient in HKBC.
Figure 4Comparisons of SCNAs between HKBC and TCGA-EA studies
(A) Main clones (left: HKBC, right: TCGA-EA). Each panel shows the frequency of copy number gain, loss, and copy number neutral LOH across the samples in each study.
(B) Subclones (left: HKBC, right: TCGA-EA).