| Literature DB >> 30728399 |
Pushpinder Kaur1,2, Tania B Porras1,2, Alexander Ring1,2, John D Carpten2,3, Julie E Lang4,5.
Abstract
Whole exome sequencing (WES), targeted gene panel sequencing and single nucleotide polymorphism (SNP) arrays are increasingly used for the identification of actionable alterations that are critical to cancer care. Here, we compared The Cancer Genome Atlas (TCGA) and the Genomics Evidence Neoplasia Information Exchange (GENIE) breast cancer genomic datasets (array and next generation sequencing (NGS) data) in detecting genomic alterations in clinically relevant genes. We performed an in silico analysis to determine the concordance in the frequencies of actionable mutations and copy number alterations/aberrations (CNAs) in the two most common breast cancer histologies, invasive lobular and invasive ductal carcinoma. We found that targeted sequencing identified a larger number of mutational hotspots and clinically significant amplifications that would have been missed by WES and SNP arrays in many actionable genes such as PIK3CA, EGFR, AKT3, FGFR1, ERBB2, ERBB3 and ESR1. The striking differences between the number of mutational hotspots and CNAs generated from these platforms highlight a number of factors that should be considered in the interpretation of array and NGS-based genomic data for precision medicine. Targeted panel sequencing was preferable to WES to define the full spectrum of somatic mutations present in a tumor.Entities:
Mesh:
Year: 2019 PMID: 30728399 PMCID: PMC6365517 DOI: 10.1038/s41598-018-37574-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Clinicopathological features of the TCGA and GENIE cohorts.
| TCGA | GENIE | ILC (TCGA versus GENIE) p-value | IDC (TCGA versus GENIE) p-value | ILC (TCGA versus GENIE) q-value | IDC (TCGA versus GENIE) q-value | |||
|---|---|---|---|---|---|---|---|---|
| Histological type | ILC (n = 127), n(%) | IDC (n = 490), n(%)) | ILC (n = 248), n(%) | IDC (n = 1724), n(%) | ||||
|
| ||||||||
| Mean | 62.3 | 57.4 | 58.9 | 53 | 0.66 | 0.66 | 0.693 | 0.693 |
| 18–50 | 29 (22.8%) | 163 (33.3%) | 63 (25.4%) | 734 (42.6%) | 0.6141 | ****<0.0001 | 0.6448 | 0.0001 |
| 51–70 | 60 (47.2%) | 242 (49.4%) | 142 (57.3%) | 859 (49.8%) | 0.0798 | ****<0.0001 | 0.1815 | 0.0001 |
| 71–90 | 38 (29.9%) | 85 (17.3%) | 41 (16.5%) | 129 (7.5%) | **0.0033 | ****<0.0001 | 0.009 | 0.0001 |
| Not Available (NA) | 0 (0.0%) | 0 (0.0%) | 2 (0.8%) | 2 (0.1%) | 0.5509 | >0.9999 | 0.6266 | 0.4083 |
|
| ||||||||
| White | 107 (84.3%) | 344 (70.2%) | 212 (85.5%) | 1320 (76.6%) | 0.761 | ****<0.0001 | 0.7397 | 0.0001 |
| Black | 9 (7.1%) | 63 (12.9%) | 8 (3.2%) | 134 (7.8%) | 0.1147 | ****<0.0001 | 0.1957 | 0.0001 |
| Asian | 3 (2.4%) | 36 (7.3%) | 6 (2.4%) | 98 (5.7%) | >0.9999 | ****<0.0001 | 0.7583 | 0.0001 |
| Native American | 0 (0.0%) | 0 (0.0%) | 1 (0.4%) | 1 (0.1%) | >0.9999 | >0.9999 | 0.7583 | 0.4083 |
| Asian Indian or Alaska Native | 0 (0.0%) | 1 (0.2%) | 0 (0.0%) | 0 (0.0%) | >0.9999 | 0.2213 | 0.7583 | 0.1251 |
| NA | 7 (5.5%) | 46 (9.4%) | 0 (0.0%) | 0 (0.0%) | ***0.0005 | ****<0.0001 | 0.0023 | 0.0001 |
| Not Evaluated | 1 (0.8%) | 0 (0.0%) | 0 (0.0%) | 0 (0.0%) | 0.3387 | >0.9999 | 0.4203 | 0.4083 |
| Other | 0 (0.0%) | 0 (0.0%) | 7 (2.8%) | 48 (2.8%) | 0.1006 | 0.3594 | 0.1957 | 0.1887 |
| Unknown | 0 (0.0%) | 0 (0.0%) | 14 (5.6%) | 123 (7.1%) | **0.0033 | *0.0496 | 0.009 | 0.0304 |
|
| ||||||||
| Hispanic or Latino | 6 (4.7%) | 17 (3.5%) | 15 (6.0%) | 99 (5.7%) | 0.8129 | ***0.0001 | 0.7397 | 0.0001 |
| Not Hispanic or Latino | 106 (83.5%) | 393 (80.2%) | 191 (77.0%) | 1196 (69.4%) | 0.1787 | ****<0.0001 | 0.271 | 0.0001 |
| NA | 14 (11.0%) | 80 (16.3%) | 0 (0.0%) | 0 (0.0%) | ****<0.0001 | ****<0.0001 | 0.0007 | 0.0001 |
| Not Evaluated | 1 (0.8%) | 0 (0.0%) | 0 (0.0%) | 0 (0.0%) | 0.3387 | >0.9999 | 0.4203 | 0.4083 |
| Unknown | 0 (0.0%) | 0 (0.0%) | 42 (16.9%) | 429 (24.9%) | ****<0.0001 | ****<0.0001 | 0.0007 | 0.0001 |
|
| ||||||||
|
| ||||||||
| T1 | 21 (16.5%) | 135 (27.6%) | NA | NA | — | — | — | — |
| T2 | 59 (46.5%) | 300 (61.2%) | NA | NA | — | — | — | — |
| T3 | 46 (36.2%) | 30 (6.1%) | NA | NA | — | — | — | — |
| T4 | 1 (0.8%) | 24 (4.9%) | NA | NA | — | — | — | — |
| TX | 0 (0.0%) | 1 (0.2%) | NA | NA | — | — | — | — |
|
| ||||||||
| N0 | 54 (42.5%) | 234 (47.8%) | NA | NA | — | — | — | — |
| N1 | 38 (29.9%) | 170 (34.7%) | NA | NA | — | — | — | — |
| N2 | 13 (10.2%) | 53 (10.8%) | NA | NA | — | — | — | — |
| N3 | 21 (16.5%) | 24 (4.9%) | NA | NA | — | — | — | — |
| NX | 1 (0.8%) | 9 (1.8%) | NA | NA | — | — | — | — |
|
| ||||||||
| M0 | 98 (77.2%) | 440 (89.8%) | NA | NA | — | — | — | — |
| MX | 29 (22.8%) | 50 (10.2%) | NA | NA | — | — | — | — |
|
| ||||||||
|
| ||||||||
| ER-positive | 117 (92.1%) | 328 (66.9%) | NA | NA | — | — | — | — |
| ER-negative | 8 (6.3%) | 133 (27.1%) | NA | NA | — | — | — | — |
| Not Evaluated | 2 (1.6%) | 27 (5.5%) | NA | NA | — | — | — | — |
| Indeterminate | 0 (0.0%) | 2 (0.4%) | NA | NA | — | — | — | — |
| Equivocal | 0 (0.0%) | 0 (0.0%) | NA | NA | — | — | — | — |
|
| ||||||||
| PR-positive | 100 (78.7%) | 284 (58.0%) | NA | NA | — | — | — | — |
| PR-negative | 24 (18.9%) | 176 (35.9%) | NA | NA | — | — | — | — |
| Not Evaluated | 2 (1.6%) | 28 (5.7%) | NA | NA | — | — | — | — |
| Indeterminate | 1 (0.8%) | 2 (0.4%) | NA | NA | — | — | — | — |
| Equivocal | 0 (0.0%) | 0 (0.0%) | NA | NA | — | — | — | — |
|
| ||||||||
| HER2-positive | 9 (7.1%) | 82 (16.7%) | NA | NA | — | — | — | — |
| HER2-negative | 71 (55.9%) | 243 (49.6%) | NA | NA | — | — | — | — |
| Not Evaluated | 22 (17.9%) | 71 (14.5%) | NA | NA | — | — | — | — |
| Indeterminate | 1 (0.8%) | 5 (1.0%) | NA | NA | — | — | — | — |
| Equivocal | 24 (18.9%) | 83 (16.9%) | NA | NA | — | — | — | — |
| NA | 0 (0.0%) | 6 (1.2%) | NA | NA | — | — | — | — |
*Significant p-value.
Figure 1Overview of the genomic alterations in breast cancer patients in the TCGA and GENIE cohort (a) Bar graph maps depicting the percentage of cases with mutations obtained from WES (TCGA dataset) versus targeted gene panel (combined data of PCR and hybridization capture, GENIE dataset) approach in 40 actionable genes in ILC and IDC subtypes. (b) Bar graph maps depicting the percentage of cases having mutational hotspots obtained from WES (TCGA dataset) versus PCR and hybridization capture (GENIE dataset) in ILC and IDC subtypes (c) Bar graph maps depicting the percentage of cases with CNAs obtained from the SNP-based array (TCGA dataset) versus targeted gene panel (hybridization capture, GENIE dataset) approach in 40 actionable genes in ILC and IDC tumors. (d) Percentage of mutations in 40 actionable genes in TCGA and GENIE ILC patient samples analyzed by WES versus PCR and hybridization capture technique. PIK3CA dominated the mutational landscape in both data sets and missense mutations (i.e. nontruncating) were more prevalent than truncating and inframe mutations. The inset shows the variation in the percentages of missense, truncating and inframe mutations in the TCGA and GENIE cohort in ILC subtype. (e) Percentage of mutations in 40 actionable genes in TCGA and GENIE IDC patients. TP53 was the most commonly mutated gene in TCGA and GENIE IDC patients. The inset shows the variation in the percentages of missense, truncating and inframe mutations in the TCGA and GENIE cohort in IDC tumors. In both cohorts, missense mutations were more prevalent than truncating and inframe mutations in both ILC and IDC tumors (Kruskal-Wallis test, ****p < 0.0001).
Figure 2Differential pattern of CNAs in actionable genes in the TCGA and GENIE cohort across ILC and IDC subtypes (a) Bars depict the proportion of tumors with CNAs in potentially actionable genes altered in ILC samples. The percentage of tumors with hemizygous deletion (grey), homozygous deletion (yellow), low-level gain (blue) and high-level amplification (red) are shown. (b) Bars depict the proportion of tumors with CNAs in potentially actionable genes altered in IDC samples. The percentage of tumors with hemizygous deletion (grey), homozygous deletion (yellow), low-level gain (blue) and high-level amplification (red) are shown.
Figure 3Significant CNAs in ILC cohort in the TCGA and GENIE datasets. (a) GISTIC analysis of significant amplifications (red) determined by segmentation analysis from SNP-based array in TCGA ILC cohort. The statistical significance of the aberrations is displayed as FDR (q-values) and scores for each alteration are given at x-axis. The cut-off for significant threshold is 0.25 (green line). The y-axis indicates the chromosome positions and dotted lines indicate the centromeres. (b) GISTIC analysis of significant amplifications (red) determined by segmentation analysis from hybridization capture technique in GENIE ILC cohort. (c) The heat map represents significant amplified regions in ILC patients in the TCGA and GENIE datasets. The genes from our potential actionable gene list are given in parentheses. (d) GISTIC analysis of significant deletions (blue) determined by segmentation analysis from SNP-based array in TCGA ILC cohort. (e) GISTIC analysis of significant deletions (blue) determined by segmentation analysis from hybridization capture technique in GENIE ILC cohort. (f) The heat map represents significant deleted regions in ILC patients in the TCGA and GENIE datasets. The genes from our potential actionable gene list are given in parentheses.
Figure 4Significant CNAs in IDC cohort in the TCGA and GENIE datasets. (a) GISTIC analysis of significant amplifications (red) determined by segmentation analysis from SNP-based array in TCGA IDC cohort. The statistical significance of the aberrations is displayed as false-discovery rate (q-values) and scores for each alteration are given at x-axis. The cut-off for significant threshold is 0.25 (green line). The y-axis indicates the chromosome positions and dotted lines indicate the centromeres. (b) GISTIC analysis of significant amplifications (red) determined by segmentation analysis from hybridization capture technique in GENIE IDC cohort. (c) The heat map represents significant amplified regions in IDC patients in the TCGA and GENIE datasets. The genes from our potential actionable gene list are given in parentheses. (d) GISTIC analysis of significant deletions (blue) determined by segmentation analysis from SNP-based array in TCGA IDC cohort. (e) GISTIC analysis of significant deletions (blue) determined by segmentation analysis from hybridization capture technique in GENIE IDC cohort. (f) The heat map represents significant deleted regions in IDC patients in the TCGA and GENIE datasets. The genes from our potential actionable gene list are given in parentheses.
Figure 5Differential pattern of CNAs in actionable genes in the TCGA and GENIE cohort across NSCLC and colorectal cancer. (a) Bars depict the proportion of tumors with CNAs in potentially actionable genes altered in NSCLC samples. (b) Bars depict the proportion of tumors with CNAs in potentially actionable genes altered in colorectal cancer samples. The percentage of tumors with hemizygous deletion (grey), homozygous deletion (yellow), low-level gain (blue) and high-level amplification (red) are shown. The Fisher’s exact test was used to determine whether the frequencies of CNAs are different in potentially actionable genes between TCGA and GENIE datasets analyzed by the array and NGS-based technologies.
List of potentially breast cancer related genes.
| Genes | Foundation One | MSK-IMPACT | OncoKB | Clinical Trials | Candidate Drugs |
|---|---|---|---|---|---|
| PIK3CA | P | P | P | NCT02465060, NCT03337724, NCT01513356, NCT01337765, NCT01928459, NCT03243331 | Buparlisib, Alpelisib + Fulvestrant, Serabelisib, Copanlisib, GDC-0077, Alpelisib |
| AKT3 | P | P | — | NCT01964924, NCT02162719, NCT01226316, NCT02077569, NCT01277757, NCT02423603, NCT01980277, NCT01964924, NCT01992952 | Taselisib + Fulvestrant, Buparlisib + Fulvestrant, Taselisib, GDC-0941 |
| NF1 | P | P | P | NCT02465060 | Ipatasertib, BKM120, BEZ235, BGJ398 with BYL719, Gedatolisib |
| PIK3CB | P | P | — | NCT02465060, NCT03337724, NCT01513356, NCT01337765, NCT01928459, NCT03243331 | — |
| RPTOR | — | — | — | NCT02456857, NCT01674140, NCT00107016, NCT02465060, NCT02583542, NCT01390818, NCT01337765 | Ipatasertib, AZD5363, PF-04691502, Triciribine, CCT128930 |
| AKT1 | P | P | P | NCT01964924, NCT02162719, NCT01226316, NCT02077569, NCT01277757, NCT02423603, NCT01980277, NCT01964924, NCT01992952 | Honokiol, AT13148, TIC10 (ONC201), MK2206 |
| FBXW7 | P | P | — | — | LY2780301, GSK2141795 |
| IGF1 | P | P | — | NCT00984490, NCT02278965, NCT01479179, NCT00984490, NCT00759785, NCT01372618, NCT00897884 | — |
| GRB7 | — | — | — | NCT00513292, NCT00004067 | LTT462, Binimetinib, BVD523, Trametinib, |
| KRAS | P | P | P | NCT00894504, NCT02259114, NCT01520389, NCT01337765 | MAPK/PI3K/mTOR inhibitors, e.g., MSC1936369B |
| BRAF | — | P | P | NCT02401347, NCT03065387, NCT01363232, NCT01337765 | Everolimus, Temsirolimus |
| EGFR | P | P | P | NCT02465060, NCT01582191, NCT01934335, NCT01732276, NCT00739063, NCT02720185, NCT00820924, NCT00894504 | — |
| MAP2K1 | P | P | P | NCT02322814, NCT01160718, NCT02685657, NCT00147550, NCT01467310, NCT01337765 | Buparlisib, Alpelisib + Fulvestrant, Serabelisib, Copanlisib, GDC-0077 |
| JAK2 | P | P | P | NCT02041429, NCT02637375, NCT01929941 | Alpelisib, Taselisib + Fulvestrant, Buparlisib + Fulvestrant, Taselisib |
| ERBB2 | P | P | P | NCT02465060, NCT03065387, NCT00878709, NCT01953926, NCT00875979 | GDC-0941, Ipatasertib, BKM120, BEZ235, BGJ398 with BYL719, Gedatolisib |
| ERBB3 | — | P | — | NCT03065387, NCT00073528, NCT02980341, NCT02297698, NCT01918254, NCT03321981, NCT00073528 | — |
| CCND1 | P | P | — | NCT02936206, NCT03304080, NCT01740427, NCT02187783, NCT01037790 | Everolimus, AZD8055, Becacizumab, Voxtalisib, PP242 |
| CDKN2A | P | P | P | NCT01740427 | OSI-027, Apitolisib, Gedatolisib (PKI-587), Sapanisertib |
| CDKN2B | P | P | — | NCT01740427 | AZD6244, SAR245409, BEZ235 |
| CCND3 | P | P | — | NCT02187783 | — |
| CCND2 | P | P | — | NCT01037790, NCT00334542, NCT02187783 | Ipatasertib, AZD5363, PF-04691502, Triciribine, CCT128930 |
| CCNE1 | P | P | — | NCT03184090 | Honokiol, AT13148, TIC10 (ONC201), MK2206, LY2780301 |
| CDK6 | P | P | — | NCT03184090 | GSK2141795 |
| CDK4 | P | P | P | NCT03184090 | — |
| TP53 | P | P | — | NCT00044993, NCT00004038, NCT01386502, NCT00496860 | — |
| RB1 | P | P | — | NCT02599363, NCT03130439, NCT03007979 | Tivozanib, AMG 479, Metformin, MK-0646, Pasireotide, Ganitumab |
| NOTCH4 | — | — | — | NCT00645333, NCT01372579 | G7–18NATE, NVP-AEW541, BMS-536924, BMS-536924, Dovitinib |
| NOTCH1 | P | P | — | NCT02299635, NCT01208441, NCT00645333, NCT01372579, NCT00106145, NCT01151449, NCT01071564 | Cobimetinib, Trametinib, AZD6244, MSC1936369B |
| ALDH1A1 | — | — | — | NCT01190345, NCT01424865, NCT00949013, NCT01688609, NCT02001974, NCT01372579 | Selumetinib, PD-325901, GSK1120212, MEK162 |
| MET | P | P | P | NCT02465060, NCT03316586, NCT01837602, NCT01575522, NCT01138384 | — |
| FGFR1 | P | P | — | NCT01283945 | Cobimetinib, Vemurafenib, Dabrafenib,Trametinib |
| FGFR2 | P | P | — | NCT01283945 | BKM120 Plus MEK162, BEZ235 Plus MEK162 |
| WNT1 | — | — | — | NCT03243331, NCT01351103 | ″ |
| ATM | P | P | P | NCT02401347, NCT03344965 | Afatinib, Erlotinib, Gefitinib, Osimertinib, Vandetanib, Dasatinib, Lapatinib, Panitumumab |
| PALB2 | P | P | — | NCT02401347, NCT03344965 | Cobimetinib, Trametinib, AZD6244, MSC1936369B |
| BRCA1 | P | P | P | NCT02163694, NCT01506609, NCT02032823, NCT03205761, NCT02681562, NCT03150576, NCT02826512, NCT01905592 | Selumetinib, PD-325901, GSK1120212, MEK162 |
| BRCA2 | P | P | P | NCT02163694, NCT01506609, NCT02032823, NCT03205761, NCT02681562, NCT03150576, | — |
| BARD1 | P | P | — | NCT02826512, NCT01905592 | Ruxolitinib, Ganetespib, INCB047986 |
| GATA3 | P | P | — | NCT00897065 | Ado-trastuzumab emtansine, Lapatinib, Trastuzumab, Pertuzumab, Neratinib |
| IL4 | — | — | — | NCT00039052 | Neratinib, GW572016, U3–1402, HER2 vaccine nelipepimut-S |
| TGFB1 | — | — | — | NCT00821964, NCT02538471 | Lumretuzumab, MCLA-128, NCT02912949 |
| IL6 | — | — | — | NCT03135171, NCT02041429 | — |
| IL15 | — | — | — | NCT03175666, NCT03127098 | Ribociclib, Palbociclib, Abemaciclib, PD 0332991 |
| CD274 | P | P | — | NCT03206203, NCT02447003, NCT02999477, NCT02685059, NCT03430466, NCT02489448, NCT02530489, NCT03430518, NCT03414684, NCT03175666, NCT01042379 | Ribociclib, Palbociclib, Abemaciclib |
| CXCL9 | — | — | — | NCT03112590 | Ribociclib, Palbociclib, Abemaciclib |
| ESR1 | — | P | P | NCT00849030, NCT03455270, NCT02650817, NCT02734615 | Ribociclib, Palbociclib, Abemaciclib |
| AR | P | P | — | NCT01889238, NCT01918306, NCT02457910, NCT01151046, NCT03207529, NCT02130700, NCT01990209 | Ribociclib, Palbociclib, Abemaciclib, PD 0332991 |
| PGR | — | — | — | NCT00849030, NCT01151046, NCT01421472, NCT03241810 | Ribociclib, Palbociclib, Abemaciclib |
| ESR2 | — | — | — | NCT00580112, NCT00050427, NCT020898547, NCT02067741 | Ribociclib, Palbociclib, Abemaciclib, PD 0332991 |
P = Present in the gene panel, —= Not present in the gene panel, not present in the clinical trials, not present in the candidate drugs.