Literature DB >> 20393555

Genome remodelling in a basal-like breast cancer metastasis and xenograft.

Li Ding1, Matthew J Ellis, Shunqiang Li, David E Larson, Ken Chen, John W Wallis, Christopher C Harris, Michael D McLellan, Robert S Fulton, Lucinda L Fulton, Rachel M Abbott, Jeremy Hoog, David J Dooling, Daniel C Koboldt, Heather Schmidt, Joelle Kalicki, Qunyuan Zhang, Lei Chen, Ling Lin, Michael C Wendl, Joshua F McMichael, Vincent J Magrini, Lisa Cook, Sean D McGrath, Tammi L Vickery, Elizabeth Appelbaum, Katherine Deschryver, Sherri Davies, Therese Guintoli, Li Lin, Robert Crowder, Yu Tao, Jacqueline E Snider, Scott M Smith, Adam F Dukes, Gabriel E Sanderson, Craig S Pohl, Kim D Delehaunty, Catrina C Fronick, Kimberley A Pape, Jerry S Reed, Jody S Robinson, Jennifer S Hodges, William Schierding, Nathan D Dees, Dong Shen, Devin P Locke, Madeline E Wiechert, James M Eldred, Josh B Peck, Benjamin J Oberkfell, Justin T Lolofie, Feiyu Du, Amy E Hawkins, Michelle D O'Laughlin, Kelly E Bernard, Mark Cunningham, Glendoria Elliott, Mark D Mason, Dominic M Thompson, Jennifer L Ivanovich, Paul J Goodfellow, Charles M Perou, George M Weinstock, Rebecca Aft, Mark Watson, Timothy J Ley, Richard K Wilson, Elaine R Mardis.   

Abstract

Massively parallel DNA sequencing technologies provide an unprecedented ability to screen entire genomes for genetic changes associated with tumour progression. Here we describe the genomic analyses of four DNA samples from an African-American patient with basal-like breast cancer: peripheral blood, the primary tumour, a brain metastasis and a xenograft derived from the primary tumour. The metastasis contained two de novo mutations and a large deletion not present in the primary tumour, and was significantly enriched for 20 shared mutations. The xenograft retained all primary tumour mutations and displayed a mutation enrichment pattern that resembled the metastasis. Two overlapping large deletions, encompassing CTNNA1, were present in all three tumour samples. The differential mutation frequencies and structural variation patterns in metastasis and xenograft compared with the primary tumour indicate that secondary tumours may arise from a minority of cells within the primary tumour.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20393555      PMCID: PMC2872544          DOI: 10.1038/nature08989

Source DB:  PubMed          Journal:  Nature        ISSN: 0028-0836            Impact factor:   49.962


Introduction

Basal-like breast cancer is characterized by the absence of estrogen receptor (ER) expression, the lack of ERBB2 gene amplification, and a high mitotic index. The consequent absence of approved targeted therapy options and frequently poor response to standard chemotherapy often result in a rapidly fatal clinical course. The disease also accounts for an elevated percentage of breast cancers in patients with African ancestry1. Clinical progress has been limited by a poor understanding of the genetic events responsible for this tumor subtype and by limited preclinical models to study the disease. Since basal-like breast cancer has a highly unstable genome, a key question is whether the fatal metastatic process is driven by mutations that occur after the tumor cells arrive at the distant site, or whether the primary tumor generates cells with a complete repertoire of somatic mutations required for metastatic growth. The rapid advancement of next generation sequencing technologies allows comprehensive characterization of genomic changes, facilitating the comparison of multiple samples taken from the same patient to address the genetic basis for tumor progression and metastasis.

Results

Case presentation and prior characterization of samples

A 44-year old African-American woman was diagnosed with an ERBB2 negative and ER negative inflammatory breast cancer. She was treated with neoadjuvant dose-dense chemotherapy2, but significant residual tumor was present in the breast and axillary lymph nodes at mastectomy. This indicated chemotherapy resistance and she underwent radiation therapy. Eight months later, she developed a cerebellar metastasis and, despite resection, rapidly succumbed to widely disseminated disease. A transplantable Human-in-Mouse (HIM) xenograft tumor line was generated from a sample of her primary tumor biopsied before treatment3. The xenograft in the mammary fat pad was locally invasive and produced metastatic deposits in lymph nodes and ovaries. Informed consent for full genome sequencing was obtained and DNA samples were prepared from her peripheral blood, primary tumor, brain metastasis, and an early passage xenograft (harvested 101 days after initial engrafting into the mouse host). Application of the PAM50 intrinsic subtype algorithm identified the primary tumor, brain metastasis, and xenograft line as basal-like subtype, with high risk of relapse (ROR) scores4.

Sequence coverage and mutation analysis

Using a paired-end sequencing strategy, we generated 130.7, 124.9, 111.8, and 149.2 billion base pairs of sequence data from genomic DNA derived from blood, primary tumor, brain metastasis, and xenograft samples, respectively, with corresponding haploid coverages of 38.8X, 29.0X, 32.0X, and 23.8X (Supplementary Table 1). These genome-wide coverages were assessed by comparing SNVs detected by MAQ5 with SNPs genotyped using Illumina 1M duo arrays for all tissues excluding the xenograft. Array data from the metastasis were used as a surrogate for monitoring the xenograft SNP coverage and confirmed bi-allelic detection of 98.27%, 96.79%, 96.17%, and 88.77% of the heterozygous array SNPs in the normal, primary tumor, metastasis, and xenograft sequence datasets, respectively (Supplementary Table 1). The process for selecting somatic mutations is shown in Supplementary Table 2 and is detailed in Supplementary Information. Putative somatic SNVs and indels that overlap with coding sequences, splice sites, and RNA genes were included as “tier 1”. We combined tier 1 sites identified in all three tumor samples and obtained deep read count data for all four samples from Illumina and/or 454 platforms (Supplementary Information). Based on pathology review, the tumor cellularity estimates were 70% for the primary tumor and 90% for both the brain metastasis and xenograft. Utilizing these estimates, we calculated the tumor read counts by proportionally removing the counts derived from the normal tissue reads from the counts obtained from primary tumor and metastasis reads (Supplementary Table 3a). Using the Illumina platform, we also generated 15.6 Gb (4.4X haploid coverage) of sequence data for the NOD/SCID mouse genome used as the host for the xenograft line. The mapping rates of NOD/SCID data to human and mouse C57BL/6 reference sequences were 3.17% and 95.85%, respectively. Since the non-malignant contamination in xenograft is largely from murine cells (which do not significantly affect read mapping), no correction was applied for the xenograft data. Adjusted tumor read counts were used to calculate mutant allele frequencies. Somatic changes were validated by comparing mutant allele frequencies in the three tumor genomes against the germline DNA sample, combined with a manual review of ABI 3730 data from PCR products (Supplementary Information). In summary, a total of 50 somatic sites, including 28 missense, 11 silent, 2 splice site, 1 RNA, 1 nonsense, 4 insertions, and 3 deletions, were validated in at least one of the three tumor genomes. Of coding point mutations, the observed nonsynonymous:synonymous (NS:SS) ratio of 2.64:1 (29:11) is not significantly different from that expected by chance6 (P = 0.51), suggesting that the majority of coding mutations do not confer a selective advantage to the basal tumor. This is similar to the NS:SS ratio reported in the small-cell lung cancer cell line NCI-H2097, but higher than the ratio reported in the melanoma cell line COLO-8298.

Mutation spectrum in basal breast tumor

We investigated the spectrum of DNA sequence changes in this basal tumor and found 55% (22/40) of coding point mutations represent C:G->T:A transitions. A similar frequency of C:G->T:A transitions (56% (18/32)) was observed in the lobular breast tumor recently reported9 (Figure 1a). In addition, 15% (6/40) of coding point mutations representing C:G->A:T transversions were detected in the basal tumor, but none were found in the lobular tumor. The statistical significance of these observations should be explored with the comparative analysis of a larger number of basal and lobular breast tumors. Moreover, the observed C:G->T:A transition frequency is notably higher than those observed in a previous breast cancer study10 (P = 0.027; Figure 1b). A set of extremely high confidence tier 1-4 mutations (somatic score > 55 and average mapping quality > 79) was used to explore the genome-wide mutation spectrum. We found that mutations at A:T bases are significantly expanded in the genome-wide set compared to the coding mutations, especially for A:T->G:C transitions (P = 0.0065). This is consistent with the higher A:T content in non-coding sequences than in coding sequences. Comparison to the whole-genome mutation spectrum reported for the melanoma cell line (COLO-829)8 and a small cell lung cancer cell line (NCI-H209)7 suggests that the tumor genome under study shows no sign of tobacco or ultraviolet influence. We then compared the fraction of the three classes of guanine mutations occurring at CpG dinucleotides in primary tumor, brain metastasis, and xenograft and found that the frequencies of G->A mutations are 27.54%, 27.60%, and 28.05% in each respective tumor, significantly higher than both the genome average of 4.45% (P < 10-10) and the frequency reported in NCI-H209 (P < 10-10; Figure 1c).
Figure 1

Mutational signatures in the basal breast tumor

a, Fraction of mutations in each of the transition and transversion categories in the metastasis of a lobular breast tumor9, the metastasis of the basal breast tumor under study, and the 11 breast tumors reported by Wood et al.29 from which 1,104 coding mutations identified in the discovery set were used in the analysis. b, Fraction of mutations in each of the transition and transversion categories in 43 tier1 mutations and 3,204 tier 1-4 mutations in the metastasis under study. c, Fraction of guanine mutations at CpGs in primary tumor, metastasis, xenograft, and NCI-H209 as reported by Pleasance et al.7.

Distribution of mutations among tumors

1. Common mutations detected in three tumor genomes

Of the 50 validated point mutations and small indels, 48 are detectable in all three tumors. We performed a statistical enrichment test that takes the variations of different platforms, experiments, and primer pairs into consideration (Supplementary Information). These 48 sites consist of (1A) 20 sites with relatively comparable frequencies across tumors, (1B) 26 sites significantly enriched (FDR ≤ 0.05) in the metastasis and/or xenograft, and (1C) two sites with significant enrichment (FDR ≤ 0.05) in the primary tumor (Figure 2 and Table 1). The affected genes and the likely consequences of these mutations are summarized in Table 1 and Supplementary Table 3b.
Figure 2

Mutant allele frequency from deep read count data

The mutant allele frequency of each somatic mutation is shown. Mutations were validated using both 454 and Illumina sequencing. Each bar represents the average of the frequency yielded by the two technologies for a single primer pair and the error bars represent the standard deviation. Data were considered only if there were at least 200 reads from Illumina sequencing and at least 20 reads from 454 sequencing. If no error bar exists, then data were only available from a single sequencing platform.

Table 1

Summary of point mutations and small indels identified in the primary tumor, brain metastasis, and xenograft.

ChrStartAllele ChangeGeneAmino Acid ChangeMutant Allele FrequencyCopy NumberEnrichment FDR
NTMXTMXMX
126062702G>APAQR7p.A720.17%5.78%34.55%13.70%2229.00e-40.011
126646672G>ADHDDSp.R159H0.14%21.12%40.24%88.29%2225.73e-52.46e-5
143684654G>AKIAA0467p.G2119R0.13%7.00%40.02%84.84%2220.0015.98e-5
145068225C>GPTCH2p.W293S0.02%13.70%36.03%43.61%2220.0850.381
1152723308delGCAACTTTTCATTSHEp.LPFKG476in_frame_delW0.19%19.28%21.33%7.69%4.615.346.920.8200.065
1226395989C>AGUK1p.P11Q0.01%36.29%33.12%40.17%3.663.964.640.3740.365
1242935580A>TPPPDE1p.T151S0.12%3.39%48.57%11.47%3.453.714.380.0120.063
224994872G>AADCY3p.H1630.06%7.10%37.49%48.71%3.173.243.780.0070.029
256273320delGCCDC85Ap.E161fs0.16%1.71%17.11%28.78%2.93.243.340.0020.006
2197349569G>CGTF3C3p.R474G0.11%29.42%37.75%11.08%1.41.311.240.3160.065
2229835724C>TPID1p.S140.11%8.95%38.89%66.13%221.360.0010.166
2241641282C>TSNED1p.T708I0.04%0.32%36.52%2.30%2221.58e-40.719
310236363G>TIRAK2e8-10.38%48.37%52.69%53.33%2220.1560.762
3139505123G>ATXNDC6p.R221W0.29%39.50%58.62%15.81%2220.0390.012
3150728323A>CWWTR1p.F299V0.03%6.87%28.14%9.53%2221.43e-40.020
440051165C>ACHRNA9p.D437E0.10%28.38%36.82%90.26%2220.0739.67e-5
440134827delGRBM47p.I280fs0.05%8.62%79.15%79.74%2220.0300.124
482232630C>TPRKG2p.R7090.11%6.99%82.99%91.51%2220.0830.094
5135422725G>ATGFBIp.E576K0.17%89.09%37.58%18.45%2223.34e-63.23e-5
5169466048C>TFOXI1p.S170F0.15%69.28%78.33%93.61%2220.4730.009
7100463999G>CMUC17p.S861T1.69%9.22%1.46%14.43%22.764.040.0730.816
7128284099C>TFLNCp.N24830.11%0.17%18.21%0.16%2.542.82.930.0020.193
7148400407G>AZNF786p.F1300.13%13.61%62.86%81.04%2.512.853.543.01e-43.23e-5
83232441C>ACSMD1p.A409S0.04%29.75%54.22%65.18%2220.3550.141
88477326C>TENSG00000222487NULL0.11%9.61%28.43%13.74%222.670.1200.787
95040714T>CJAK2p.I166T0.09%61.63%21.93%47.40%2.832.672.840.2460.999
9107137789G>ASLC44A1p.A132T0.08%2.59%76.14%85.31%21.291.191.43e-41.05e-4
1014603968C>TFAM107Bp.R237Q2.65%13.53%63.25%97.88%3.74.044.763.29e-68.54e-8
1030789749C>TMAP3K8p.P461L0.11%13.33%31.72%77.47%3.443.714.210.0029.67e-5
1079240899G>ADLG5p.D14740.07%32.94%76.10%74.72%2226.12e-50.011
1112496610insATGGAGPARVAp.338in_frame_insDG0.00%1.41%10.75%10.58%2220.3470.365
1148128224A>TPTPRJp.K1017N0.20%1.25%32.08%57.23%222.993.48e-42.20e-4
11102687902G>ADYNC2H1p.R3867Q0.06%12.81%25.78%15.69%2220.0020.023
1231122692T>GDDX11p.V33G0.02%44.35%40.39%57.88%1.491.371.240.3160.386
1376628331G>AMYCBP2p.Q2222*0.10%87.84%43.76%36.95%2220.0040.003
13100688137A>TNALCNp.D468E0.16%18.60%87.66%1.65%22.742.920.0040.216
1419285546G>TOR4Q3p.L400.22%36.94%40.31%32.28%2220.3130.107
1666569387T>GDPEP3p.R262S0.84%45.61%39.43%76.59%22.93.020.2936.93e-4
1682828230C>AKCNG4p.G1210.04%4.15%26.82%69.89%2.433.093.490.0830.259
177519157insGTP53p.Q167fs4.61%79.40%62.62%97.96%2220.0850.003
1732904736C>TTADA2Lp.R339W0.12%17.49%59.92%79.47%2220.0020.002
1912363315G>AZNF799p.H2990.17%2.05%26.23%11.81%223.060.0620.618
1916006577insAENSG00000167459p.I38fs4.82%26.53%48.47%37.74%223.060.2860.809
205851563G>ACHGBp.R258Q0.14%35.64%45.50%54.87%2.572.863.640.0570.005
2145015744G>AUBE2G2p.I1580.12%21.57%26.72%20.89%2220.5220.728
X15731812C>GZRSR2p.A95G1.01%64.01%58.66%72.08%2.512.772.990.1370.969
X43893087C>GEFHC2e15-10.01%9.88%23.15%7.35%22.682.820.1140.381
X46318872insACHST7p.T188fs0.19%3.67%54.36%38.84%22.682.820.0730.058
X105040331C>ANRKp.A681E0.12%4.08%30.84%52.45%2220.0850.017
X129374039A>GRBMX2p.K169E0.30%11.88%38.36%69.46%22.652.770.0020.003

Gene sets from Ensembl build 54 and Genbank (downloaded in May 2009) were used for annotation of mutations. T: primary tumor; M: metastasis; X: xenograft.

1A. Mutations with comparable frequencies in three tumors

We detected a JAK2 mutation (I166T), residing in the FERM domain, which is different from the previously reported activating mutations in myeloproliferative diseases, often found in the pseudokinase domain11. Screening of an additional 116 breast tumors identified another mutation (R1122P) in the kinase domain of JAK2 from a Luminal B-type breast cancer. A splice site mutation (e8-1) was found in IRAK2. We performed an RT-PCR experiment using RNAs from the brain metastasis and xenograft and found that the first 30 nucleotides of exon 8 (IRAK2, NM_001570) were skipped and an internal exonic AG site was used as a splice acceptor, resulting in an in-frame deletion. A missense mutation (A401S) in CSMD1 was found in all three tumors. Loss of CSMD1 expression is associated with poor survival in invasive ductal breast carcinoma12 and it is frequently deleted in colorectal adenocarcinoma and head/neck carcinomas13. We also identified 3 missense (E608K, T1456R, and Q2204R) and 1 nonsense (Q3005*) mutations in CSMD1 in 4 breast cancers out of 116 screened. A binomial test shows that CSMD1 is significantly mutated in breast cancer (P = 0.022 and FDR = 0.197)(Supplementary Table 4).

1B. Mutations highly enriched in metastasis and/or xenograft

A missense mutation (A681E) in NRK, a protein kinase involved in activating JNK, was found to be present in all three tumors, but at 8- and 13-fold increased allele frequencies in the metastasis and xenograft, respectively (Figure 2 and Table 1). Two somatic mutations (S424C and Q521*) in NRK have been previously reported in breast cancer14. The missense mutation (P461L) identified in the C-terminus of MAP3K8 was present at a roughly 6-fold increase in the xenograft compared to the primary tumor. C-terminal truncation of MAP3K8 has been shown to activate this oncogenic kinase15,16, raising the possibility that this C-terminal substitution (P461L) is an activating mutation. Another missense mutation (K1017N) in PTPRJ, a protein tyrosine phosphatase, had a mutant allele frequency of 32% in the metastasis and 57% in the xenograft compared to just 1.3% in the primary tumor. This K1017N mutation in PTPRJ is among the most highly enriched mutations in both the metastasis (FDR = 0.00035) and xenograft (FDR = 0.00022). The mutation site is in the juxtamembrane domain (a basic residue motif) and is in close proximity to the tyrosine-protein phosphatase domain (aa 1041-1298). Sacco et al.17 reported that the PTPRJ charged peptide (aa 1013-1024) is responsible for interaction with its substrates, such as ERK1/2. The K1017N mutation found in the basal tumor and the K1016A mutation described in Sacco's report both change a basic residue to a neutral residue, suggesting these two mutations may be functionally similar. A missense mutation (F299V) in WWTR1, assigned as deleterious by SIFT18, was detected at 28% mutant allele frequency in metastasis, but only at 7% and 10% in primary tumor and xenograft, respectively (Figure 2 and Table 1). WWTR1, a 14-3-3 binding protein with a PDZ binding motif, has been shown to modulate mesenchymal stem cell differentiation19. Over-expression of WWTR1 has also been implicated in promoting the migration, invasion, and tumorigenesis of breast cancer cells20. Another point mutation (R258Q) was identified in CHGB (chromogranin B) encoding a tyrosine-sulfated secretory protein. A SNP at the same position was reported to dbSNP in January 2009 for a Yoruba sample. It was also assigned as a germline site in another African-American with breast cancer when we genotyped this mutation in 112 additional primary tumors and 73 metastatic tumors of various expression classes (Supplementary Information). To investigate this variant further, 84 cancer-free African American women with an average age of 71.2 yrs (low risk for developing breast cancer) and 38 early-onset African American breast cancer patients with an average age of 35.6 yrs were genotyped. The results indicated that 8 out of 84 controls and 3 out of 38 cases carried the variant allele, suggesting this variant is unlikely to be a breast cancer susceptibility allele. Three validated indels were enriched in the metastasis and/or xenograft. One was the 1-bp insertion in exon 4 of the TP53 gene, which creates a frameshift mutation (Q167fs) in the DNA binding domain and results in a truncated protein. We found the TP53 mutation significantly enriched in the xenograft, while present at a relatively constant frequency in primary tumor and metastasis (Figure 2 and Table 1).

1C. Mutations enriched in the primary tumor

A nonsense mutation (Q2222*) in MYCBP2 and a missense mutation (E576K) in TGFBI, both found in all three tumors, had higher mutant allele frequencies in the primary tumor (88% for MYCBP2 and 89% for TGFBI) than in the metastasis (44% for MYCBP2 and 38% for TGFBI) or the xenograft (37% for MYCBP2 and 18% for TGFBI) (Figure 2 and Table 1).

2. De novo mutations identified in the metastasis

Two de novo mutations were discovered in the metastatic tumor, neither of which was detected in the primary or xenograft tumor genomes. One was a missense mutation (T708I) in SNED1, with a mutant allele frequency of 37%. The other was a silent mutation (N2483) in FLNC with a mutant allele frequency of 18% (Figure 2 and Table 1). Since the xenograft line, without these two mutations, exhibits metastatic lesions in ovarian, lymphoid, and subcutaneous tissue (data not shown), it is unlikely that these mutated genes are essential to the metastatic process.

Elevated copy number alterations in metastasis and xenograft

The cnvHMM algorithm (unpublished) was applied to the aligned sequence reads to detect regions of copy number alterations in all three tumors. Using pathology-based purity estimates for the primary tumor and brain metastasis, we calculated the read depth contributed from the tumor cells alone and then computed the copy number for all genomic positions. Read depth correction was not applied to the xenograft, as stated earlier. We subsequently compared the copy number data from all three tumors with those from peripheral blood, to identify genomic segments with significant copy number alterations (CNAs) (Supplementary Information). A total of 516.5 Mb, 640.4 Mb, and 754.5 Mb were amplified, while 342.5 Mb, 383.1 Mb, and 562.5 Mb were deleted, in primary tumor, metastasis, and xenograft, respectively (Supplementary Table 5-7). Moreover, 96.11% and 93.98% of CNA sequences in the primary tumor also were found in CNA segments in the metastasis and xenograft respectively, suggesting most primary tumor CNAs are preserved during disease progression and engraftment. On the other hand, only 80.65% of metastasis and 61.29% of xenograft CNA sequences overlap with primary tumor CNAs. Furthermore, 155 regions with focal copy number segments (≤ 2Mbp) were detected in the primary tumor, but only 101 and 97 regions in the metastasis and xenograft (Supplementary Table 8-10). Our result also shows that 111 (average span = 745,183 bp) and 99 (average span = 799,395 bp) focal copy number segments (≤ 2Mbp) in the primary tumor overlap with broader copy number segments in the metastasis (average span = 2,245,546 bp) and xenograft (average span = 3,565,456 bp), suggesting possible expansion of primary focal regions or selection of new adjacent events during disease progression and in the mouse host. Sequence depth-based copy number analysis shows overall the highest concordance with other platforms, including the array CGH and Illumina SNP array, and also provided the highest concordance of copy number (correlation coefficients: 0.89-0.97) between primary tumor, metastasis, and xenograft(Supplementary Table 11).

Common and unique structural variations in three tumors

We used BreakDancer 21 to detect structural variants (SV) in sequencing data from paired end libraries (Supplementary Table 12) and applied a set of thresholds to identify putative somatic structural events.

1. Deletions, insertions, and inversions

Breakpoint-containing contigs from the three tumor samples, that were not present in the matched normal genome, were successfully assembled for 137 deletions, 15 insertions, and 38 inversions using the TIGRA assembler (unpublished), suggesting they were putative somatic events. We then re-mapped individual reads to these assembled contigs to screen out germline SVs and to confirm somatic SVs (Supplementary Information), resulting in the detection of 59 deletions and 18 inversions. PCR primers were designed successfully to validate 73 out of 77 putative SV events and the resulting amplicons were sequenced by either the Roche454 or ABI 3730 platform. Subsequently, 28 deletions and 6 inversions were validated as somatic events (Table 2). Among them, a 46,462 bp heterozygous deletion in FBXW7 removes the last 10 exons and a portion of the first exon of NM_018315, likely inactivating FBXW7. FBXW7 targets Cyclin E and mTOR for ubiquitin-mediated degradation22,23. Numerous cancer-associated mutations in FBXW7 have been previously reported, and loss of FBXW7 function causes chromosomal instability and tumorigenesis24. Two overlapping deletions (538,467 bp and 515,465 bp in length) on chromosome 5, affecting CTNNA1 along with LRRTM2, MATR3, SNORA74A, and SIL1 also were validated. This result is consistent with the detection of a focal copy number deletion encompassing this region in both metastasis (copy number = 0.65) and xenograft (copy number = 0.03) (Figure 3 and Supplementary Table 9 and 10). Careful examination of this region in the aligned sequence reads for the primary tumor confirms the existence of copy number deletion. Loss of CTNNA1 was shown to result in global loss of cell adhesion in human breast cancer cells25 and increased in vitro tumorigenic characteristics26, suggesting this bi-allelic deletion has functional importance. A 109,563 bp heterozygous deletion on chromosome 8 was assembled and validated in all three tumors. This event removed three exons of NRG1, which encodes a peptide growth factor that binds to ERBB3 and ERBB4. Interestingly, a 26,919 bp deletion in MECR was only identified, assembled, and validated in the metastasis, suggesting its de novo nature in this sample.
Table 2

Validated structural variations.

TypeTumor sourceChromosome ABreakpoint AOrientation AChromosome BBreakpoint BOrientation BEvent sizeGene
TranslocationT,M,X1245548334minus264855174plusZNF496
TranslocationT,M1245548342Plus6144243130plusZNF496, C6orf94
TranslocationT,M,X264855565Plus6144243118minusC6orf94
TranslocationT,M,X2165126335plus164537866plusGRB14
TranslocationT,M,X4188855443plus9139022260plusABCA2
TranslocationT,M,X1210874022plus1499382256minusEML1
TranslocationT,M1917188977minus3188010735plusUSE1
InversionT,M,X13570368213573214828465KIAA0319L
InversionT,M,X1959195291959209401410
InversionT,M,X120445909712044612972200
InversionT,M,X120445954712044605811033
InversionT,M,X417788604141778901714129VEGFC
InversionT,M,X19178008611917801858996JAK3
DeletionM12938921312941613326919MECR
DeletionT,M,X17649671917649679779ST6GALNAC3
DeletionT,M,X188291885188292292406
DeletionT,M,X218629189219196656567466NT5C1B
DeletionT,M,X264853205265010694157488
DeletionT,M,X21287453032128898612153308HS6ST1
DeletionT,M,X412033954126556062164CTBP1
DeletionT,M,X413573739941357387181318
DeletionT,M,X4147221480414729462873147AK057233
DeletionT,M,X4153446894415349335746462FBXW7
DeletionT,M,X515572469515572649179FBXL7
DeletionT,M,X51307436045130743718113CDC42SE2
DeletionT,M,X51381314955138669963538467CTNNA1, LRRTM2, MATR3, SNORA74A, SIL1
DeletionT,M,X51381417535138657219515465CTNNA1, LRRTM2, MATR3, SNORA74A, SIL1
DeletionT,M,X639689264639689652387KIF6
DeletionT,M,X79997437999984240
DeletionT,M,X71354192327135419453220
DeletionT,M,X832597100832706664109563NRG1
DeletionT,M,X8116552846811663466581818TRPS1
DeletionT,M,X81365957958136596285489KHDRBS3
DeletionT,M,X9274653492746735200
DeletionT,M,X10771423781077142881502C10orf11
DeletionT,M,X1111597441811115974688269
DeletionT,M,X1112547937711125479744366
DeletionT,M,X1724451601172447525523653MYO18A
DeletionT,M,X17737334461773733547100BIRC5
DeletionT,M,X184676551018467680172507ELAC1
DeletionT,M,XX149511547X14954864237094MTM1

T: primary tumor; M: metastasis; X: xenograft.

Figure 3

Two overlapping CTNNA1 deletions on chromosome 5 in three tumors

A graph of sequence depths, read pairs, and genes in a 638,468 bp region containing two overlapping deletions. The top four panels display the read depths at each base and the reads within the region whose mates mapped at an abnormal distance are displayed as blue bars, with matched pairs connected by arcs. Two different shades of blue indicate the two separate allelic deletion events (538,467 bp and 515,465 bp in length). The bottom panel displays genes annotated in this genomic region.

2. Translocations

Of the 112 assembled putative translocations, 34 passed manual review using Pairoscope graphs (unpublished), and 19 with an assembly score greater than our experimentally-supported cutoff of 10 were included in Supplementary Table 13. Seven translocations were experimentally validated (Table 2). One validated translocation t(4;9)(188855443;139022258), assembled in all three tumors, involved an LTR from the ERVL-MaLR family on chromosome 4 and ABCA2 on chromosome 9. The translocation removes the final exon of the ABCA2 gene (NM_001606). Two other validated translocations, identified in all three tumors, are t(1;2)(245548338;64855172) and t(2;6)(64855607;144243116) (Supplementary Figure 1). Noticeably, the breakpoints on chromosome 2 for these two translocations are only separated by 393 bp in a TcMar-Tigger repeat. The chromosome 1 breakpoint of t(1;2)(245548338;64855172) is in intron 5 of NM_032752 in ZNF496. We expect the translation of ZNF496 to continue through exon 5 into intron 5 due to lack of a splice acceptor site. On the other hand, t(2;6)(64855565;144243116) involves FAM164B on chromosome 6 and the translocation contig retains 3 exons of XM_928657. We have also validated t(1:6)(245548342;144243110) (not detected by BreakDancer), whose breakpoints are only 4 bp and 6 bp away from the breakpoints identified on chromosomes 1 and 6 for t(1;2)(245548338;64855172) and t(2;6)(64855607;144243116), respectively (Supplementary Figure 1). This translocation is found in both the primary tumor and the metastasis, but apparently is lost in the xenograft (Supplementary Figure 1 and 4). Sequencing of two PCR products generated using two primer pairs from chromosomes 1 and 6 demonstrated the presence of two forms of genomic fusions: one includes chromosomes 1 and 6 and the other includes chromosomes 1, 2, and 6. The former is only present in the primary tumor and the metastasis.
Figure 4

Circos plots for the primary tumor, metastasis, and xenograft genomes

Circos30 plots display the validated tier 1 somatic mutations, DNA copy number, and validated structural rearrangements in the primary tumor (a), metastasis (b), and xenograft (c). Mutations enriched in the primary tumor (a) are labeled in red and mutations enriched in the metastasis or xenograft are in red (b and c). Mutations and the large deletion unique to the metastasis are in blue (b). Translocations only present in primary tumor and metastasis are in green. All shared events are in black. The copy number difference between the tumor and normal is shown (scale: -4 to 4). No purity-based copy number corrections were used for plotting.

Discussion

Our comprehensive analysis of this sample set identified 50 novel somatic point mutations and small indels in coding sequences, RNA genes, and splice sites as well as 28 large deletions, 6 inversions and 7 translocations. In terms of functional annotation, a hierarchy can be suggested. The first level includes somatic changes likely to be functional, such as the small indel in TP53, the large heterozygous deletion in FBXW7, and the bi-allelic deletion in CTNNA1. The second level consists of non-synonymous mutations in genes previously noted to be targeted for somatic mutation in cancer or found to be recurrently mutated in this study, although the exact mutations are novel and their functional importance requires further investigation (JAK2, PTCH2, CSMD1, and NRK). The third level contains mutations known to be related to signal transduction in the malignant cells and/or found to be enriched during disease progression (MAP3K8, PTPRJ, and WWTR1). The final level, by far the largest group, awaits the acquisition of new data. Analysis of germline variants for over 500 classic tumor suppressor genes and oncogenes27 identified a large number of SNPs, none of which were unequivocal hereditary breast cancer susceptibility alleles (data not shown). The wide range of mutant allele frequencies suggests considerable genetic heterogeneity in the cellular population at the primary site. The mutation frequency range narrowed in brain metastasis and xenograft, suggesting the metastatic and transplantation processes selected for cells utilizing a distinct subset of the primary tumor mutation repertoire. The overlap between the mutation frequency changes seen in the metastatic and xenograft samples argues that genomic progression during xenograft formation is similar to that during metastasis. Moreover, it suggests that the changes were not therapy-related, since the xenograft was established prior to any treatment. GO annotation of enriched mutations suggests that transcription factor activity is possibly selected in xenograft (Supplementary Table 14). In contrast to our observation of only two new tier 1 mutations at the metastatic site, sequencing of an indolent metastatic lobular breast tumor showed that the great majority of the mutations detected were completely novel when compared to the primary tumor9. However, in this instance, the metastatic process evolved over nine years, as opposed to less than one year in the case we describe here. Another difference relative to the lobular cancer genome, where no structural variants were validated, was that paired-end sequencing detected 41 structural variations within this basal-like tumor genome. Our study of a primary tumor-metastasis-xenograft trio therefore demonstrates that, while additional somatic mutations, copy number alterations, and structural variations do occur during the clinical course of the disease, most of the original mutations and structural variants present in the primary tumor are propagated. The preservation of all primary mutations in the xenograft suggests that early passage xenograft lines are valid for functional and therapeutic studies. However, the altered mutation frequency and elevated degree of copy number alterations suggest caution when interpreting the results of such experiments. In conclusion, the first completed basal-like breast cancer genome is highly complex, as would be anticipated from a tumor-type associated with chromosomal instability and DNA repair defects. Indeed this cancer genome, in comparison with the two AML cases published recently by our group27,28, revealed a 3-4 fold increase in high confidence SNVs genome-wide, suggesting a much greater background mutation rate. Future studies should extend our analysis approach of primary, metastatic, and normal tissue trios and include affected individuals with diverse geographic origins to produce a complete catalog of recurrent somatic and inherited variants associated with the development of this common malignancy.

Methods Summary

Illumina reads from peripheral blood, primary tumor, metastasis, and xenograft were aligned to NCBI build36 using MAQ5 and coverage levels were defined by comparison of SNPs identified by Illumina 1M duo arrays to SNVs called by MAQ. Somatic mutations were identified using our in-house programs glfSomatic and a modified version of the Samtools indel caller (http://samtools.sourceforge.net/). Putative variants were manually reviewed and then validated by Illumina, 3730 or 454 sequencing. Structural variations were identified using BreakDancer21, manually reviewed and validated by a combination of localized Illumina read assembly, PCR and either 3730 or 454 sequencing. A complete description of the materials and methods used to generate this data set and results is provided in the Supplementary Information.
  30 in total

1.  Reconstruction of functionally normal and malignant human breast tissues in mice.

Authors:  Charlotte Kuperwasser; Tony Chavarria; Min Wu; Greg Magrane; Joe W Gray; Loucinda Carey; Andrea Richardson; Robert A Weinberg
Journal:  Proc Natl Acad Sci U S A       Date:  2004-03-29       Impact factor: 11.205

2.  Race, breast cancer subtypes, and survival in the Carolina Breast Cancer Study.

Authors:  Lisa A Carey; Charles M Perou; Chad A Livasy; Lynn G Dressler; David Cowan; Kathleen Conway; Gamze Karaca; Melissa A Troester; Chiu Kit Tse; Sharon Edmiston; Sandra L Deming; Joseph Geradts; Maggie C U Cheang; Torsten O Nielsen; Patricia G Moorman; H Shelton Earp; Robert C Millikan
Journal:  JAMA       Date:  2006-06-07       Impact factor: 56.272

3.  A role for TAZ in migration, invasion, and tumorigenesis of breast cancer cells.

Authors:  Siew Wee Chan; Chun Jye Lim; Ke Guo; Chee Peng Ng; Ian Lee; Walter Hunziker; Qi Zeng; Wanjin Hong
Journal:  Cancer Res       Date:  2008-04-15       Impact factor: 12.701

4.  TAZ, a transcriptional modulator of mesenchymal stem cell differentiation.

Authors:  Jeong-Ho Hong; Eun Sook Hwang; Michael T McManus; Adam Amsterdam; Yu Tian; Ralitsa Kalmukova; Elisabetta Mueller; Thomas Benjamin; Bruce M Spiegelman; Phillip A Sharp; Nancy Hopkins; Michael B Yaffe
Journal:  Science       Date:  2005-08-12       Impact factor: 47.728

5.  Fbw7 isoform interaction contributes to cyclin E proteolysis.

Authors:  Wei Zhang; Deanna M Koepp
Journal:  Mol Cancer Res       Date:  2006-12       Impact factor: 5.852

6.  The consensus coding sequences of human breast and colorectal cancers.

Authors:  Tobias Sjöblom; Siân Jones; Laura D Wood; D Williams Parsons; Jimmy Lin; Thomas D Barber; Diana Mandelker; Rebecca J Leary; Janine Ptak; Natalie Silliman; Steve Szabo; Phillip Buckhaults; Christopher Farrell; Paul Meeh; Sanford D Markowitz; Joseph Willis; Dawn Dawson; James K V Willson; Adi F Gazdar; James Hartigan; Leo Wu; Changsheng Liu; Giovanni Parmigiani; Ben Ho Park; Kurtis E Bachman; Nickolas Papadopoulos; Bert Vogelstein; Kenneth W Kinzler; Victor E Velculescu
Journal:  Science       Date:  2006-09-07       Impact factor: 47.728

7.  FBXW7 targets mTOR for degradation and cooperates with PTEN in tumor suppression.

Authors:  Jian-Hua Mao; Il-Jin Kim; Di Wu; Joan Climent; Hio Chung Kang; Reyno DelRosario; Allan Balmain
Journal:  Science       Date:  2008-09-12       Impact factor: 47.728

Review 8.  FBW7 ubiquitin ligase: a tumour suppressor at the crossroads of cell division, growth and differentiation.

Authors:  Markus Welcker; Bruce E Clurman
Journal:  Nat Rev Cancer       Date:  2008-02       Impact factor: 60.716

9.  Mutational activation of the MAP3K8 protooncogene in lung cancer.

Authors:  Adam Michael Clark; Steven H Reynolds; Marshall Anderson; Jonathan S Wiest
Journal:  Genes Chromosomes Cancer       Date:  2004-10       Impact factor: 5.006

10.  The genomic landscapes of human breast and colorectal cancers.

Authors:  Laura D Wood; D Williams Parsons; Siân Jones; Jimmy Lin; Tobias Sjöblom; Rebecca J Leary; Dong Shen; Simina M Boca; Thomas Barber; Janine Ptak; Natalie Silliman; Steve Szabo; Zoltan Dezso; Vadim Ustyanksky; Tatiana Nikolskaya; Yuri Nikolsky; Rachel Karchin; Paul A Wilson; Joshua S Kaminker; Zemin Zhang; Randal Croshaw; Joseph Willis; Dawn Dawson; Michail Shipitsin; James K V Willson; Saraswati Sukumar; Kornelia Polyak; Ben Ho Park; Charit L Pethiyagoda; P V Krishna Pant; Dennis G Ballinger; Andrew B Sparks; James Hartigan; Douglas R Smith; Erick Suh; Nickolas Papadopoulos; Phillip Buckhaults; Sanford D Markowitz; Giovanni Parmigiani; Kenneth W Kinzler; Victor E Velculescu; Bert Vogelstein
Journal:  Science       Date:  2007-10-11       Impact factor: 47.728

View more
  608 in total

1.  The JAK2/STAT3 signaling pathway is required for growth of CD44⁺CD24⁻ stem cell-like breast cancer cells in human tumors.

Authors:  Lauren L C Marotta; Vanessa Almendro; Andriy Marusyk; Michail Shipitsin; Janina Schemme; Sarah R Walker; Noga Bloushtain-Qimron; Jessica J Kim; Sibgat A Choudhury; Reo Maruyama; Zhenhua Wu; Mithat Gönen; Laura A Mulvey; Marina O Bessarabova; Sung Jin Huh; Serena J Silver; So Young Kim; So Yeon Park; Hee Eun Lee; Karen S Anderson; Andrea L Richardson; Tatiana Nikolskaya; Yuri Nikolsky; X Shirley Liu; David E Root; William C Hahn; David A Frank; Kornelia Polyak
Journal:  J Clin Invest       Date:  2011-07       Impact factor: 14.808

2.  Establishment and characterization of novel human primary and metastatic anaplastic thyroid cancer cell lines and their genomic evolution over a year as a primagraft.

Authors:  Manoj Garg; Ryoko Okamoto; Yasunobu Nagata; Deepika Kanojia; Subhashree Venkatesan; Anand M T; Glenn D Braunstein; Jonathan W Said; Ngan B Doan; Quoc Ho; Tadayuki Akagi; Sigal Gery; Li-Zhen Liu; Kar Tong Tan; Wee Joo Chng; Henry Yang; Seishi Ogawa; H Phillip Koeffler
Journal:  J Clin Endocrinol Metab       Date:  2014-11-03       Impact factor: 5.958

Review 3.  Practical implications of gene-expression-based assays for breast oncologists.

Authors:  Aleix Prat; Matthew J Ellis; Charles M Perou
Journal:  Nat Rev Clin Oncol       Date:  2011-12-06       Impact factor: 66.675

Review 4.  Clinical implementation of comprehensive strategies to characterize cancer genomes: opportunities and challenges.

Authors:  Laura E MacConaill; Paul Van Hummelen; Matthew Meyerson; William C Hahn
Journal:  Cancer Discov       Date:  2011-09       Impact factor: 39.397

Review 5.  Molecular prescreening to select patient population in early clinical trials.

Authors:  Jordi Rodón; Cristina Saura; Rodrigo Dienstmann; Ana Vivancos; Santiago Ramón y Cajal; José Baselga; Josep Tabernero
Journal:  Nat Rev Clin Oncol       Date:  2012-04-03       Impact factor: 66.675

6.  Obesity-insulin targeted genes in the 3p26-25 region in human studies and LG/J and SM/J mice.

Authors:  Aldi T Kraja; Heather A Lawson; Donna K Arnett; Ingrid B Borecki; Ulrich Broeckel; Lisa de las Fuentes; Steven C Hunt; Michael A Province; James Cheverud; D C Rao
Journal:  Metabolism       Date:  2012-03-03       Impact factor: 8.694

7.  Clonal architecture of secondary acute myeloid leukemia.

Authors:  Matthew J Walter; Dong Shen; Li Ding; Jin Shao; Daniel C Koboldt; Ken Chen; David E Larson; Michael D McLellan; David Dooling; Rachel Abbott; Robert Fulton; Vincent Magrini; Heather Schmidt; Joelle Kalicki-Veizer; Michelle O'Laughlin; Xian Fan; Marcus Grillot; Sarah Witowski; Sharon Heath; John L Frater; William Eades; Michael Tomasson; Peter Westervelt; John F DiPersio; Daniel C Link; Elaine R Mardis; Timothy J Ley; Richard K Wilson; Timothy A Graubert
Journal:  N Engl J Med       Date:  2012-03-14       Impact factor: 91.245

8.  On oncogenes and tumor suppressor genes in the mammary gland.

Authors:  Rushika M Perera; Nabeel Bardeesy
Journal:  Cold Spring Harb Perspect Biol       Date:  2012-06-01       Impact factor: 10.005

9.  Evolution of tumour biology upon progression. Do we know our enemy?

Authors:  Ana Lluch; Ana Bosch
Journal:  Clin Transl Oncol       Date:  2012-06       Impact factor: 3.405

10.  Biological resonance for cancer metastasis, a new hypothesis based on comparisons between primary cancers and metastases.

Authors:  Dongwei Gao; Sha Li
Journal:  Cancer Microenviron       Date:  2013-11-10
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.