| Literature DB >> 32650480 |
Pushpinder Kaur1,2, Daniel Campo3, Tania B Porras4, Alexander Ring5, Janice Lu2,6, Yvonne Chairez7, Yunyun Su1,2, Irene Kang2,6, Julie E Lang1,2.
Abstract
The comparison of the landscape of somatic alterations in circulating tumor cells (CTCs) versus metastases is challenging. Here, we comprehensively characterized the somatic landscape in bulk (amplified and non-amplified), spike-in breast cancer cells, CTCs, and metastases from breast cancer patients using whole-exome sequencing (WES). We determined the level of genomic concordance for somatic nucleotide variants (SNVs), copy number alterations (CNAs), and structural variants (SVs). The variant allele fractions (VAFs) of somatic variants were remarkably similar between amplified and non-amplified cell line samples as technical replicates. In clinical samples, a significant fraction of somatic variants had low VAFs in CTCs compared to metastases. The most frequently recurrent gene mutations in clinical samples were associated with an elevated C > T mutational signature. We found complex rearrangement patterns including intra- and inter-chromosomal rearrangements, singleton, and recurrent gene fusions, and tandem duplications. We observed high molecular discordance for somatic alterations between paired samples consistent with marked heterogeneity of the somatic landscape. The most prevalent copy number calls were focal deletion events in CTCs and metastases. Our results demonstrate the feasibility of an integrated workflow for the identification of a complete repertoire of somatic alterations and highlight the intrapatient genomic differences that occur between CTCs and metastases.Entities:
Keywords: breast cancer; circulating tumor cells; clinically actionable; metastasis; whole exome sequencing; whole genome amplification
Mesh:
Year: 2020 PMID: 32650480 PMCID: PMC7402350 DOI: 10.3390/ijms21144826
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Summary of sequencing and alignment statistics across all experimental samples.
| Sample ID | Total Reads Alignment (%) | On-Target Reads (%) | On-Target Bases that are Covered at Least 1x (%) | On-Target Bases that are Covered at Least 20x (%) | Coverage | Quality Score | GC (%) |
|---|---|---|---|---|---|---|---|
| MDA1 (Non-amplified gDNA) Replicate 1 | 99.9 | 74.1 | 99.6 | 98.9 | 105 | 33.4 | 47.9 |
| MDA1 (Non-amplified gDNA) Replicate 2 | 99.9 | 71.2 | 99.6 | 98.8 | 102 | 33.6 | 47.6 |
| P1 (Positive control- Amplified gDNA) Replicate 1 | 99.9 | 80.7 | 98.3 | 78.5 | 80 | 36.3 | 43.2 |
| P2 (Positive control- Amplified gDNA) Replicate 2 | 99.9 | 79.6 | 98.5 | 84.9 | 132 | 36.1 | 43.1 |
| S1 (Spike-in sample) Replicate 1 | 99.9 | 77.6 | 92.9 | 60.3 | 57 | 36.4 | 40.8 |
| S2 (Spike-in sample) Replicate 2 | 99.9 | 78.5 | 94.1 | 63.2 | 61 | 36.4 | 41.1 |
| WL1 (WBCs-Healthy control)-Replicate 1 | 99.9 | 75.0 | 93.2 | 70.3 | 70 | 36.3 | 40.2 |
| WL2 (WBCs-Healthy control)-Replicate 2 | 99.9 | 76.4 | 88.7 | 54.7 | 55 | 36.6 | 40.2 |
Figure 1Overview of whole exome sequencing (WES) analysis in spike-in and unspiked breast cancer cells (a) Scatter plot of variant allele fractions (VAFs) detected using WES for replicate 1 and 2 of amplified and non-amplified bulk cancer MDA-MB-231 cells. For replicate 1, data points are for 133 shared variants between P1 and MDA1 (Pearson’s r2 = 0.9) and 163 shared variants for P2 and MDA2 (Pearson’s r2 = 0.9). (b) Bar graph depicting the percentage of coding variants in experimental samples (Bulk cells = MDA1 and MDA2 without Repli-g; Bulk cells = P1 and P2 with Repli-g; and, spike-in cells= S1 and S2 with Repli-g). The Mann–Whitney test was used to evaluate the differences between coding variants within technical replicates. (c) Copy number alteration (CNA) profile heatmap for amplified (P1, P2, S1, and S2) and non-amplified samples (MDA1 and MDA2) with gain (red), loss (blue), and neutral (white). Both Repli-g amplified DNA and non-amplified DNA showed a similar pattern of copy number profiles. The Pearson r test was used for correlation analysis between amplified and non-amplified samples. (d) Oncoplot showing the 40 most frequently mutated genes color-coded by type of mutations in experimental samples. The percentage to the right of the oncoplot shows the percent of samples with variants for the corresponding gene. Right, barplot shows the mutation rate in each of 40 mutated genes; Top, barplot shows the mutation rate for each patient for 40 most frequently mutated genes. By default, samples are ordered by the most frequently mutated genes. (e) Distribution of base substitutions in experimental samples revealed a signature characterized predominantly by C > T transition substitutions. The percent mutations are shown on the x-axis and substitution mutation types are on the y-axis. (f) The number of overlapping somatic single nucleotide variants (SNVs) and genes in Repli-g amplified bulk MDA-MB-231 (P1 + P2) and spike-in samples (S1 + S2). (g) The number of overlapping chromosomes cytobands (cytogenic bands) with copy number gains and losses in Repli-g amplified bulk MDA-MB-231 (P1 + P2) and spike-in samples (S1 + S2).
Summary of sequencing and alignment statistics across all clinical samples.
| Patient | Total Reads Alignment (%) | On-Target Reads (%) | On-Target Bases that are Covered at Least 1x (%) | On-Target Bases that are Covered at Least 20x (%) | Coverage | Quality Score | GC (%) |
|---|---|---|---|---|---|---|---|
| Patient 1_CTC | 99.9 | 73.1 | 99.1 | 79.3 | 142 | 38.2 | 42.0 |
| Patient 1_Met | 99.9 | 59.0 | 71.5 | 50.2 | 86 | 33.6 | 40.0 |
| Patient 1_WBC | 99.5 | 14.5 | 79.8 | 21.7 | 17 | 38.2 | 38.2 |
| Patient 2_CTC | 99.8 | 76.4 | 98.1 | 72.0 | 95 | 38.3 | 43.7 |
| Patient 2_Met | 99.9 | 64.0 | 43.8 | 13.4 | 72 | 32.0 | 45.0 |
| Patient 2_WBC | 99.9 | 13.0 | 73.4 | 18.5 | 15 | 38.4 | 38.3 |
| Patient 3_CTC | 98.3 | 71.5 | 97.0 | 72.8 | 82 | 38.0 | 43.8 |
| Patient 3_Met | 99.9 | 62.4 | 71.1 | 32.9 | 59 | 32.8 | 43.1 |
| Patient 3_WBC | 99.3 | 14.0 | 80.7 | 18.1 | 14 | 38.2 | 40.0 |
| Patient 4_CTC | 99.8 | 74.2 | 98.9 | 75.8 | 81 | 38.2 | 44.2 |
| Patient 4_Met | 99.9 | 60.9 | 57.5 | 23.2 | 75 | 33.1 | 43.8 |
| Patient 4_WBC | 99.7 | 14.1 | 82.4 | 22.1 | 17 | 38.3 | 38.4 |
| Patient 5_CTC | 99.8 | 73.4 | 98.4 | 70.8 | 68 | 38.1 | 43.5 |
| Patient 5_WBC | 98.8 | 15.7 | 87.2 | 21.7 | 17 | 38.1 | 40.0 |
Figure 2Overview of somatic mutations in circulating tumor cells (CTCs) and matched metastatic tissues (a) Violin plot showing the distribution of VAFs across four CTC samples with matched formalin-fixed paraffin-embedded (FFPE) tissue (two-tailed paired t-test, p < 0.05). The black dots indicate the median and quartile of VAFs and violins extended to represent the entire data range. (b) Oncoplot showing the 40 most frequently mutated genes color-coded by type of mutations in CTCs and matched metastatic tissues. The percentage to the right of the oncoplot shows the percent of samples with variants for the corresponding gene. Right, barplot shows the mutation rate in each of the 40 mutated genes; Top, barplot shows the mutation rate for each patient for 40 most frequently mutated genes. By default, samples are ordered by the most frequently mutated genes. (c) Overlap of the most frequently mutated genes identified by Maftools and IVA to identify shared cancer drivers in CTCs and metastases. (d) Distribution of base substitutions in CTCs and metastases revealing a signature characterized predominantly by C > T followed by C > A substitutions. The percentage of mutations are shown on the x-axis and substitution mutation types are on the y-axis. (e) Bar chart showing the enriched canonical pathways in CTCs vs. white blood cells (WBCs), Metastases vs. WBCs, and CTCs + Metastases vs. WBCs. The y-axis on the left shows the percentage of genes overlapping in each pathway having pathogenic (pink) and likely-pathogenic (blue) variants. The y-axis on the right shows the significance level. The number on the top of each stacked bar indicates the total number of genes present in each pathway. The orange line represents the threshold value (0.05) for the significance level for -log(p-value)). The graph is displaying only those entities that have a -log(p-value) greater than 1.3. (f) The number of overlapping somatic SNVs and genes in CTCs and metastases.
Common actionable genes found in CTCs and FFPE metastatic tissues.
| CTCs | Metastases | ||||||
|---|---|---|---|---|---|---|---|
| Patient | Gene | Mutation Type | Protein Sequence Change | VAF (%) | Mutation Type | Protein Sequence Change | VAF (%) |
| Patient 1 | SLIT2 | Frameshift | p.S1686fs*2 | 0.12 | Missense | p.I94T; p.I8T | 0.07 |
| FAT1 | Missense | p.H2768R; p.H2770R | 0.44 | Missense | p.H2768R; p.H2770R | 0.75 | |
| Missense | p.D4070N; p.D4072N | 0.20 | |||||
| Missense | p.P1349S | 0.03 | |||||
| PTCH1 | Frameshift | p.S1203fs*17; p.S1052fs*17; p.S1137fs*17; p.S1202fs*17; p.S1151fs*17 | 0.05 | Synonymous | p.E200E; p.E134E; p.E199E; p.E49E | 0.29 | |
| CHD2 | Frameshift | p.A253fs*8 | 0.08 | Synonymous | p.E23E | 0.04 | |
| Missense | p.G685S | 0.06 | |||||
| Patient 3 | SPTA1 | Stop gain | p.Q340* | 0.09 | Missense | p.H2293N | 0.30 |
| Missense | p.M121I | 0.50 | |||||
| CDC73 | Missense | p.T334I; p.T253I | 0.03 | Missense | p.R171G | 0.06 | |
| LRP1B | Frameshift | p.H4368fs*6 | 0.14 | Synonymous | p.C4299C | 0.55 | |
| Synonymous | p.F2419F | 0.18 | |||||
| Missense | p.F2419I | 0.18 | |||||
| Missense | p.D1807E | 0.11 | |||||
| Missense | p.T1147S | 0.42 | |||||
| Missense | p.V652L | 0.12 | |||||
| STAT4 | Synonymous | p.N59N | 0.07 | Missense | p.A117T | 0.62 | |
| NSD1 | Missense | p.D1466N; p.D1197N | 0.09 | Synonymous | p.L207L; p.L476L | 0.19 | |
| Missense | p.A1645T; p.A1376T | 0.05 | Stop gain | p.S478*; p.S209* | 0.19 | ||
| Missense | p.R890Q; p.R1159Q | 0.19 | |||||
| Missense | p.R1608C; p.R1877C | 0.09 | |||||
| Missense | p.V1699D; p.V1968D | 0.09 | |||||
| ROS1 | Missense | p.664R | 0.07 | Missense | p.P1133Q; p.P1138Q | 0.14 | |
| Missense | p.E1062K; p.E1057K | 0.05 | |||||
| GRM3 | Missense | p.H94Y | 0.04 | Missense | p.I578N | 0.33 | |
| Missense | p.A609V | 0.21 | |||||
| MET | Synonymous | p.Q165Q | 0.06 | Missense | p.Y369H | 0.14 | |
| Missense | p.M33I; p.M463I | 0.43 | |||||
| Missense | p.C770Y | 0.21 | |||||
| FANCF | Missense | p.P76S | 0.05 | Synonymous | p.L321L | 0.43 | |
| CHD2 | Missense | p.897T | 0.08 | Missense | p.P208T; p.P195T; p.P159T | 0.12 | |
| GNAS | Frameshift | p.V118fs*23; p.V102fs*23; p.V746fs*23; p.V103fs*23; p.V117fs*23; p.V760fs*23; p.V58fs*23 | 0.10 | Missense | p.R150H; p.R106H; p.R166H; p.R808H; p.R151H; p.R794H; p.R165H | 0.12 | |
| KDM6A | Stop gain | p.Q753*; p.Q457*; p.Q760*; p.Q781*; p.Q708*; p.Q805*; p.Q736*; p.Q684*; p.Q702*; p.Q674* | 0.02 | Missense | p.R649L; p.R628L; p.R604L; p.R570L; p.R542L; p.R576L; p.R673L; p.R552L; p.R325L; p.R621L | 0.25 | |
| Stop gain | p.E1062*; p.E993*; p.E941*; p.E931*; p.E965*; p.E1038*; p.E1017*; p.E714*; p.E959*; p.E1010* | 0.15 | |||||
| ATRX | Missense | p.K1294E; p.K1332E | 0.03 | Synonymous | p.Q1603Q; p.Q1565Q | 0.50 | |
| Stop gain | p.E1231*; p.E1193* | 0.06 | |||||
| Missense | p.N1176K; p.N1214K | 0.34 | |||||
| Synonymous | p.D705D; p.D667D | 0.10 | |||||
| Missense | p.L582M; p.L620M | 0.11 | |||||
| CSF3R | Frameshift | p.S469fs*5 | 0.10 | Missense | p.G487A | 0.75 | |
| REL | Missense | p.P580S; p.P548S | 0.03 | Missense | p.G347R; p.G379R | 0.12 | |
| DROSHA | Frameshift | p.C96fs*33 | 0.03 | Missense | p.L804V; p.L767V | 0.07 | |
| Missense | p.W486C; p.W523C | 0.33 | |||||
| MSH3 | Missense | p.G485D | 0.06 | Synonymous | p.L601L | 0.07 | |
| Missense | p.Q750H | 0.45 | |||||
| NCOR1 | Synonymous | p.G1553G; p.G1569G; p.G1458G | 0.05 | Missense | p.S2341G; p.S2335G; p.S2438G | 0.33 | |
| Missense | p.S787N; p.S771N; p.S678N | 0.07 | Missense | p.P1317R; p.P1426R; p.P1410R | 0.42 | ||
| Stop gain | p.R305*; p.R414* | 0.77 | |||||
| Missense | p.S103P | 0.06 | |||||
| Missense | p.D100N | 0.08 | |||||
| PIK3C3 | Frameshift | p.T790fs*4; p.T727fs*4 | 0.04 | Missense | p.Q860E; p.Q797E | 0.13 | |
| NCOA3 | Missense | p.P261S | 0.11 | Missense | p.R198P | 0.27 | |
| Missense | p.Q496K; p.Q506K | 0.43 | |||||
| Missense | p.A511E; p.A501E | 0.28 | |||||
| Synonymous | p.L761L; p.L751L | 0.43 | |||||
| Missense | p.S794Y; p.S804Y | 0.27 | |||||
| Missense | p.Q1123E; p.Q1118E | 0.03 | |||||
| Missense | p.P1405R; p.P1409R; p.P1400R; p.P1408R | 0.62 | |||||
Figure 3Overview of somatic structural variants (SVs) in CTCs and matched metastatic tissues. (a) Distribution of somatic SVs in CTCs and metastatic tissue depicting a high proportion of translocation breakends and tandem duplications. (b) Screenshot from IGV showing the coverage and read alignments (gray) in CTCs, metastases, and WBCs of patient #1 and inversions in colored bars (teal and blue) in the GOLGA5 gene. There is no inversion in the matched WBC sample. (c) Screenshot from IGV showing the coverage and read alignments (gray) in CTCs, metastases, and WBCs of patient #4 and duplications (green) in the TPR gene. There is no duplication in the matched WBC sample. (d) Split-screen view from IGV showing the translocation breakpoint in the FGF14 gene on chromosome 13 in patient #4 metastases. There is no translocation breakpoint in CTC and WBC samples.
Figure 4Overview of copy number alterations (CNAs) in CTCs and matched metastatic tissues (a) Overall copy number landscape showing the high prevalence of focal alterations in all CTCs and metastases. The y-axis shows the frequencies of the occurrence of somatic gains (red) and losses (blue) in sub-cytoband regions. (b) Heatmap showing the segmented copy number profiles at chromosomal arm level in each of the samples in CTCs and metastases (gain = red, loss = blue, and neutral = white). (c) The number of overlapping cytobands depicting copy number gain and loss in CTCs and metastases. (d) The heatmap showing the gains and losses in 49 potentially actionable genes in CTCs and metastases.
Clinicopathological information of breast cancer patients.
| Patient | Sample ID | Histology | ER | PR | HER2 | Metastatic Site | Genomic or Genetic Testing Results | Timeframe between FFPE Biopsies and CTCs Collection |
|---|---|---|---|---|---|---|---|---|
| 1 | 19065 | Invasive ductal carcinoma (IDC) | Y | N | N | Liver | PIK3CA (C420R)- Everolimus, Temsirolimus CGN-NTRK1 fusion-Crizotinib CHD4 loss exons 3–40 CHEK2 M1T | 5 days |
| 2 | 78440 | IDC | Y | Y | N | Pleural effusion | Germline BRCA2 mutation carrier | 39 months |
| 3 | 79130 | IDC | Y | Y | N | Breast | None | 29 months |
| 4 | 80192 | IDC | Y | N | N | Brain | None | 19 months |
| 5 | 28412 | IDC | N | N | Y | Brain metastasis, unavailable for research | None | - |
Y-Yes, N-No.