| Literature DB >> 30267146 |
Ke Xu1, Liang Ding1, Ti-Cheng Chang1, Ying Shao1, Jason Chiang2, Heather Mulder1, Shuoguo Wang1, Tim I Shaw1, Ji Wen2, Laura Hover3, Clay McLeod1, Yong-Dong Wang1, John Easton1, Michael Rusch1, James Dalton2, James R Downing2, David W Ellison4, Jinghui Zhang5, Suzanne J Baker6, Gang Wu7.
Abstract
Double minute chromosomes are extrachromosomal circular DNA fragments frequently found in brain tumors. To understand their evolution, we characterized the double minutes in paired diagnosis and relapse tumors from a pediatric high-grade glioma and four adult glioblastoma patients. We determined the full structures of the major double minutes using a novel approach combining multiple types of supporting genomic evidence. Among the double minutes identified in the pediatric patient, only one carrying EGFR was maintained at high abundance in both samples, whereas two others were present in only trace amounts at diagnosis but abundant at relapse, and the rest were found either in the relapse sample only or in the diagnosis sample only. For the EGFR-carrying double minutes, we found a secondary somatic deletion in all copies at relapse, after erlotinib treatment. However, the somatic mutation was present at very low frequency at diagnosis, suggesting potential resistance to the EGFR inhibitor. This mutation caused an in-frame RNA transcript to skip exon 16, a novel transcript isoform absent in EST database, as well as about 700 RNA-seq of normal brains that we reviewed. We observed similar patterns involving longitudinal copy number shift of double minutes in another four pairs (diagnosis/relapse) of adult glioblastoma. Overall, in three of five paired tumor samples, we found that although the same oncogenes were amplified at diagnosis and relapse, they were amplified on different double minutes. Our results suggest that double minutes readily evolve, increasing tumor heterogeneity rapidly. Understanding patterns of double minute evolution can shed light on future therapeutic solutions to brain tumors carrying such variants.Entities:
Keywords: Clonal evolution; Copy number alteration; Double minutes; Structural variation; Tumor heterogeneity
Mesh:
Year: 2018 PMID: 30267146 PMCID: PMC6338707 DOI: 10.1007/s00401-018-1912-1
Source DB: PubMed Journal: Acta Neuropathol ISSN: 0001-6322 Impact factor: 17.088
Fig. 1Predicted structures of double minutes and FISH analysis in the diagnosis and relapse samples of SJHGG019. a CIRCOS plots from the most inner circle to the most outer circle represent the highly amplified genomic segments, sequencing coverage of the diagnosis/relapse sample (blue/maroon) and genomic coordinates (×1000 kb) of the chromosomes. SVs between the segment boundaries are shown as arrows indicating the orientations of the joined segments constituting a double minutes. For each sample, the SVs associated with different double minute structures are colored differently. b Tri-Color FISH experiment shows that EGFR (green) in both samples, CDK6 (aqua) in both samples, MYC (magenta) in the relapse sample only, are localized to multiple punctae that are widely distributed in the nucleus of representative cells, suggesting that the presence of different double minutes. For each sample, the four panels represent FISH results for all three genes together and each gene separately
Fig. 2Linked-reads data validation for dmV, dmVI, and dmVII in the relapse sample of SJHGG019. a Validation of dmV. The heat maps above dmV represent barcode sharing of the linked-reads between two immediately joined segments, and the heat maps below dmV represent barcode sharing of the linked-reads between two segments spaced by another segment. The red arrows represent the SVs validated by barcode sharing of linked-reads with strong evidence. The gray arrows represent the SV validated by barcode sharing of linked-reads with less strong evidence due to the long distance in between. The scale of the segments’ lengths is slightly adjusted to accommodate all the heat maps. b Invalidation of dmVI and dmVII. The heat maps represent barcode sharing of the linked-reads between two immediately joined segments. While seg27 is indeed linked with 5′ of seg9 in dmVI, it stops at 3′ of seg9 and does not extend to seg10 as predicted in the structure. Similarly, while seg19 is indeed linked with 3′ of seg9 in dmVII, it stops at 5′ of seg9 and does not extend to seg8 as predicted in the structure
Fig. 3Copy number estimation of the double minutes and expression analysis of the genes carried by them in the relapse sample of SJHGG019. a Variant allele frequency (VAF) vs. coverage for SNVs in segments specific to dmI, dmII, dmIII, and dmIV. b Distribution of the ratio of alternative allele reads count versus reference allele reads count of the germline SNVs on the upper half of panel A for dmI, dmII, dmIII, and dmIV. The vertical dashed line represents the median of the distribution, i.e., the estimated copy number of the double minute per cell. c Gene expression (characterized by log2FPKM) comparison between the relapse sample of SJHGG019 and the other non-brainstem pediatric HGG samples for genes carried by one or two identified double minutes. Each violin plot represents the distribution of gene expression levels across the samples, and the red dot represents the expression level of SJHGG019 relapse sample
Fig. 4Characterization and evolution of the shared double minutes between the diagnosis and relapse samples of SJHGG019. a Highly amplified segments around EGFR are involved in multiple double minutes in both samples. Seg14 deleted in dm1 results in EGFRvIII. b Variant allele frequency comparison of the SNVs shared by dm1-specific segments in the diagnosis sample and dmIII-specific segments in the relapse sample. On the upper left corner are four somatic SNVs with low frequency (0.02) in the diagnosis sample and high frequency (0.96) in the relapse sample. c Among the four somatic SNVs, three are located close to each other on exon 16 of EGFR. Along with these three SNVs is an eight base pair deletion that disrupts the intron15/exon16 splicing acceptor site, leading to skipping of exon 16 in the EGFR RNA transcript, indicated by the missing coverage of exon16 in RNA-seq coverage track and large number of supporting reads for the novel splice junction. Consequently, the skipping of exon 16 produces an EGFR protein product with an in-frame deletion of 13-amino acids (GCTGPGLEGCPTN), which partially code for two protein domains: growth factor receptor domain IV (GF_recep_IV) and transmembrane ERBB1 like domain (TM_ErbB1). d Evolution of the EGFR-containing double minutes from diagnosis (dm1) to relapse (dmIII). Black circles represent double minutes carrying wild-type EGFR (wtEGFR), blue circles represent double minutes carrying EGFRvIII, and red circles represent double minutes carrying EGFR with exon 16 deletion in RNA transcript (EGFRxE16). The counts of circles in different colors reflect the relative abundance of each type of EGFR as estimated from sequencing data
Fig. 5A schematic diagram illustrating the evolutionary trajectory of double minutes. The timeline at the bottom shows the time at diagnosis, treatment and relapse. Each circle represents a double minutes with unique structure. Circles in blue and maroon represent highly amplified double minutes in the diagnosis and relapse samples; circles in light blue and light maroon represent double minutes with low abundance in diagnosis and relapse samples. Every double minutes is mutable and can acquire new somatic SNVs, INDELs, and SVs which are then subject to selection. While some diagnosis double minutes may be eliminated by treatment, some may survive and expand. In addition, new double minutes can form after the treatment