Tamara J Hagoel1,2, Eduardo Cortes Gomez3, Ajay Gupta1,2, Clare J Twist1,2, Rafal Kozielski4, Jeffrey C Martin5, Lingui Gao5, Joseph Kuechle6, Prashant K Singh7, Miranda Lynch8, Lei Wei8, Song Liu3, Jianmin Wang3, Joyce E Ohm5. 1. Department of Pediatrics, University at Buffalo Jacobs School of Medicine and Biomedical Sciences, Buffalo, New York 14203, USA. 2. Department of Pediatric Hematology-Oncology, Roswell Park Comprehensive Cancer Center, Buffalo, New York 14203, USA. 3. Department of Biostatistics & Bioinformatics, Roswell Park Comprehensive Cancer Center, Buffalo, New York 14203, USA. 4. Department of Pathology, Oishei Children's Hospital, Buffalo, New York 14203, USA. 5. Department of Cancer Genetics and Genomics, Roswell Park Comprehensive Cancer Center, Buffalo, New York 14203, USA. 6. Department of Medicine, Roswell Park Comprehensive Cancer Center, Buffalo, New York 14203, USA. 7. Center for Personalized Medicine, Roswell Park Comprehensive Cancer Center, Buffalo, New York 14203, USA. 8. Hauptman-Woodward Structural Institute, Buffalo, New York 14203, USA.
Abstract
Undifferentiated soft tissue sarcomas (UDSTSs) are a group of mesenchymal tumors that remain a diagnostic challenge because of their morphologic heterogeneity and unclear histologic origin (Peters et al., Mod Pathol 28: 575 [2015]). In this case report, we present the first multiomics molecular signature for a BCOR-CCNB3 sarcoma (BCS) that includes mutation analysis, gene expression, DNA methylation, and micro RNA (miRNA) expression. We identify a paucity of additional mutations in this tumor and detail that there is significant dysregulation of gene expression of epigenetic remodeling agents including key members of the PRC, Sin3A/3b, NuRD, and NcoR/SMRT complexes and the DNA methyltransferases DNMT1, DNMT3a, and DNMT3b. This is accompanied by significant DNA methylation changes and dysregulation of multiple miRNAs with known links to tumorigenesis. This study significantly increases our understanding of the BCOR effects on fusion-positive undifferentiated sarcomas at both the genomic and epigenomic level and suggests that as better-tailored and more refined treatment algorithms continue to evolve, epigenetic modifying agents should be further evaluated for their efficacy against these tumors.
Undifferentiated soft tissue sarcomas (UDSTSs) are a group of mesenchymal tumors that remain a diagnostic challenge because of their morphologic heterogeneity and unclear histologic origin (Peters et al., Mod Pathol 28: 575 [2015]). In this case report, we present the first multiomics molecular signature for a BCOR-CCNB3 sarcoma (BCS) that includes mutation analysis, gene expression, DNA methylation, and micro RNA (miRNA) expression. We identify a paucity of additional mutations in this tumor and detail that there is significant dysregulation of gene expression of epigenetic remodeling agents including key members of the PRC, Sin3A/3b, NuRD, and NcoR/SMRT complexes and the DNA methyltransferases DNMT1, DNMT3a, and DNMT3b. This is accompanied by significant DNA methylation changes and dysregulation of multiple miRNAs with known links to tumorigenesis. This study significantly increases our understanding of the BCOR effects on fusion-positive undifferentiated sarcomas at both the genomic and epigenomic level and suggests that as better-tailored and more refined treatment algorithms continue to evolve, epigenetic modifying agents should be further evaluated for their efficacy against these tumors.
Undifferentiated soft tissue sarcomas (UDSTSs) are a group of mesenchymal tumors that remain a diagnostic challenge because of their morphologic heterogeneity and unclear histologic origin (Peters et al. 2015b). Recurrent gene fusions have been implicated in many UDSTSs and their presence is often used to refine diagnosis (Mertens et al. 2016; Laetsch et al. 2018). In 2012, Pierron et al. discovered a subset of round-cell undifferentiated sarcomas that share morphologic and immunohistochemical properties with Ewing sarcoma (ES), yet lack the defining ES chimeric gene fusions between EWSR1 and FUS to the ETS family of transcription factors. These rare mimickers acquired the title “Ewing-like” sarcomas and result from an intrachromosomal paracentric inversion on the short arm of Chromosome X fusing the BCOR and CCNB3 genes together. Multiple breakpoints have been identified in the 3′ untranslated region (UTR) or the MID1IP1 intergenic region of BCOR and intron 4 of CCNB3. A splice acceptor site is embedded in the stop codon of BCOR’s exon 15 that splices with exon 5 of CCNB3. The fusion transcript thus is the product of BCOR exons 1–15 fusing to CCNB3 exons 5–12, and its expression relies on the BCOR promoter (Pierron et al. 2012; Puls et al. 2014; Peters et al. 2015b; Machado et al. 2016; Le Loarer et al. 2017; Kao et al. 2018; Renzi et al. 2019). Gene expression profiling suggests that these tumors are biologically distinct from ES and other pediatric round-cell sarcomas (Pierron et al. 2012; Kao et al. 2018; Renzi et al. 2019).Since their discovery, BCOR–CCNB3 sarcomas (BCSs) continue to be a subject of investigation and clinical interest. Several additional case series have highlighted the clinical, morphologic, and genomic similarities and differences between BCS and ES (Puls et al. 2014; Peters et al. 2015b; Argani et al. 2017; Kao et al. 2018; Koelsche et al. 2018). Other reports suggest an overlap between clear cell sarcoma of the kidney (CCSK) and other BCOR-related sarcomas of the kidney and soft tissues including BCS and primitive myxoid mesenchymal tumor of infancy (PMMTI) (Argani et al. 2017; Kao et al. 2018), many of which contain BCOR internal tandem duplications (BCOR-ITDs) (Astolfi et al. 2019). A distinctive BCOR-driven transcriptional profile has been identified, suggesting that all of these tumors are genetically related (Argani et al. 2017), although additional research is needed at the molecular level to determine whether these tumors should be considered the same entity for clinical purposes. We now present the first case report detailing a multiomics molecular signature for BCS that includes mutation analysis, gene expression, DNA methylation, and micro RNA (miRNA) expression.BCS is most often detected in bone with equal distribution in the axial skeleton and long bones and less commonly present in soft tissue and rarely the viscera (most often the kidney) of primarily adolescent males (Pierron et al. 2012; Puls et al. 2014; Peters et al. 2015b; Le Loarer et al. 2017; Kao et al. 2018). Histologically, BCS has a round or ovoid and sometimes spindle-shaped appearance. Molecularly, they exhibit patchy cytoplasmic expression of CD99 and diffuse intranuclear expression of CCNB3 and BCOR (Kao et al. 2016; Le Loarer et al. 2017; Kao et al. 2018). Currently, a definitive diagnosis of BCS relies on the detection of the BCOR–CCNB3 translocation by reverse transcriptase-polymerase chain reaction (RT-PCR) and/or fluorescence in situ hybridization (FISH) (Pierron et al. 2012; Puls et al. 2014; Matsuyama et al. 2017; Kao et al. 2018; Renzi et al. 2019).Because of the rarity and novelty of BCS, prospective clinical trial investigation of children with such diagnoses is lacking. Therefore, the clinical course and therapy-related response of patients with BCS can only be gleaned from case studies and retrospective reviews. As no uniform standardized treatment algorithm exists for BCS, patients identified with BCS sarcomas have been managed according to a variety of more established sarcoma regimens; most commonly utilized are regimens designed to treat ES. Treatment modalities have included a combination of surgical resection, focal radiation, and/or multiagent systemic neoadjuvant versus adjuvant chemotherapy with agents such as doxorubicin, etoposide, vincristine, cyclophosphamide, and ifosfamide (Pierron et al. 2012; Cohen-Gogo et al. 2014; Puls et al. 2014; Peters et al. 2015b; Kao et al. 2018). Retrospective data suggest that patients with BCS have a 5-yr survival rate equal to that of the localized ES (Cohen-Gogo et al. 2014; Puls et al. 2014; Peters et al. 2015b; Machado et al. 2016; Laetsch et al. 2018).Here we present the clinical presentation, morphologic characteristics, treatment course and outcome, and associated multiomics molecular profile of a single adolescent male patient diagnosed with a BCOR–CCNB3 undifferentiated round-cell sarcoma of the kidney. Treatment of our patient according to a regimen used to treat CCSK resulted in complete remission and no evidence of disease >36 mo following completion of therapy.
RESULTS
Clinical Presentation and Family History
A previously healthy 14-yr-old male with a history of a well-controlled seizure disorder presented to the emergency room with a 2-d history of left flank pain and a 3-d history of brown-tinged urine progressing to frank hematuria. There was no notable history of trauma, night sweats, chills, weight loss, or fevers. Upon initial evaluation, blood chemistries showed no evidence of tumor lysis syndrome, and urinalysis and urine cultures were negative for a urinary tract infection. His hemoglobin was 13.6 mg/dL and creatinine was 0.82 mg/dL. Family history was notable for maternal history of kidney stones, but no family history of cancer was documented.Renal ultrasound with Doppler computed tomography (CT) scan of the torso and magnetic resonance imaging (MRI) scan of the abdomen and pelvis revealed a well-circumscribed enhancing focal left posterior upper pole 3.2 × 2.0 × 2.2-cm hypoechoic intrarenal lesion concerning a primary renal neoplasm with secondary hemorrhage into the upper pole calyces and resultant central collecting system dilation with hazy perinephric fat stranding adjacent to the renal hilum. No renal calculi or vascular thromboses were identified and no other lesions concerning pathologic lymphadenopathy or metastatic disease were noted in the abdomen or chest.Urine cytology was sent for pathologic examination with no evidence of malignancy. The patient underwent an open left nephrectomy with regional lymph node sampling. Our patient did well postoperatively with no complications. The mass exhibited focal extension from the left renal pelvicalyceal system into the renal sinus soft tissue with no lymph node involvement. The preliminary pathology report morphologically identified a spindle cell neoplasm of the kidney. The following immunohistochemical stains were performed: vimentin, CD99, CD10, ER, PR, CD34, EMA, CK, desmin, Myod1, actin, Bcl-2, PAX8, WT1, and TLE1. Vimentin and Bcl-2 stains were strongly positive and suggestive of a CCSK, but nondiagnostic. Consultation with the Children's Oncology Group (COG) renal tumor pathologist facilitated further immunohistochemical testing which identified strong nuclear BCOR positivity and a BCOR–CCNB3 fusion transcript was confirmed by PCR; our patient was therefore diagnosed with a stage II BCOR–CCNB3 undifferentiated round-cell sarcoma of the kidney.Baseline brain MRI and positron emission tomography scans were performed prior to initiation of treatment with no evidence of residual or metastatic disease. The patient was treated as per a COG protocol designed for the treatment of CCSK (AREN0322, Regimen I) given the BCOR abnormalities associated with CCSK. Our patient received a total of six fractions of radiation to the left flank totaling 1080 cGy followed by a multiagent chemotherapy regimen consisting of alternating cycles of cyclophosphamide, doxorubicin, and vincristine with cyclophosphamide and etoposide over a course of 24 wk. His cumulative doses equaled the following: 14,000 mg/m2 of cyclophosphamide, 225 mg/m2 of doxorubicin, and 2000 mg/m2 of etoposide and vincristine 22 mg/m2.
Treatment Outcomes
Our patient suffered a total of three generalized tonic–clonic seizures during his treatment course. The seizures were managed by his neurologist and they subsided following adjustments to his anti-epileptic medications. He suffered no additional adverse events during the course of treatment. He completed his therapy following 6 mo of multimodal therapy. After therapy completion, our patient was evaluated for relapsed disease and secondary complications every 3 mo in our pediatric clinic with surveillance scans, a physical exam, and routine laboratory screening. He is presently 36 mo out from completion of therapy without any evidence of recurrent or relapsed disease.
Genomic Analyses
To gain a better understanding of the molecular events driving tumorigenesis in this patient, we performed whole-exome sequencing (WES), RNA-sequencing (RNA-seq), global methylation analysis (Illumina EPIC array), and miRNA-sequencing (miRNA-seq) on pretreatment formalin-fixed paraffin-embedded (FFPE) abnormal and adjacent matched normal kidney tissue belonging to our patient that was taken at the time of surgical resection.
Mutation Analysis
To identify possible additional mutations in our pretreatment tumor sample, WES was performed to identify somatic mutations in intronic, exonic, and protein-coding regions. Somatic DNA mutations were determined based on a comparison of the patient's tumor to germline from his matched normal tissue using multiple independent mutations’ calling algorithms. The somatic variants were called using a multiple-caller ensembled method, including MuTect2 (v4.1.2.0) (Cibulskis et al. 2013), MuSE (v1.0) (Fan et al. 2016), Strelka2 (v2.9.10) (Saunders et al. 2012), and VarDict (v1.8) (Lai et al. 2016) and manually reviewed to remove false-positive calls. The analysis pipelines are reproducible and publicly available through the Rcwl package (Hu et al. 2021). A total of 32 genes were identified to potentially harbor mutations, five of which were common among both mutations’ calling packages and with sufficient read support (>20 reads from each of the tumor and normal samples spanning the mutation region) for consideration as potential mutations in this patient's tumor (Table 1). A sixth mutation in the NOX4 was identified to be in a repetitive region and likely attributed to being caused by a mapping artifact. Silent mutations were detected in two genes (IGFBP3 and FAM134C), whereas missense mutations were identified in the three other genes (ACP2, MLC1, and MPHOSPH9).
Table 1.
Genetic mutations detected in tumor specimen
Gene
Chromosome
HGVS DNA reference
HGVS protein reference
Variant type
Predicted effect
dbSNP/dbVarID
Genotype
Positiona
Mutant reads
Coverage
Reference allele
Mutant allele
IGFBP3
7
NM_001013398
L16L
Silent
N/A
N/A
N/A
45960694
10
58
G
A
ACP2
11
NM_001610
T215M
Missense
N/A
N/A
N/A
47266414
41
127
G
A
MPHOSPH9
12
NM_022782
S562L
Missense
N/A
N/A
N/A
123687267
14
69
G
A
FAM134C
17
NM_178126
E443E
Silent
N/A
N/A
N/A
40733903
16
73
C
T
MLC1
22
NM_015166
F76L
Missense
N/A
N/A
?N/A
50521554
15
75
A
G
(N/A) Not available.
aReference assembly GRCh37.
Genetic mutations detected in tumor specimen(N/A) Not available.
aReference assembly GRCh37.The ACP2 gene encodes lysosomal acid phosphatase 2, an enzyme found in the lysosome organelle. Mutations in the ACP2 gene are associated with bone structure abnormalities, lysosomal storage defects, metabolic disorders, and abnormalities in the central nervous system. ACP2 overexpression has been implicated in human colorectal cancer (Lee et al. 2017). MLC1 mutations have been associated with megalencephalic leukoencephalopathy with subcortical cysts, a neurodegenerative disorder, as well as seizures (Dubey et al. 2018). MPHOSPH9 polymorphisms are associated with multiple sclerosis susceptibility (Mowry et al. 2013). None of the above genes have been previously implicated in sarcomagenesis. The functional effects of our patient's specific mutations and their correlations to any clinical phenotypes are unknown and remain an area for future investigation. Overall, the mutation rate was very low compared to other cancer types, and after all quality filters and manual review, it was determined that this cancer appears to have a very quiet genome.
Transcriptome Analysis
To identify differentially expressed genes between tumor and normal kidney and to identify genes and pathways that may be driving tumorigenesis in this patient, RNA-seq was performed comparing tumor to the adjacent normal kidney. Importantly, RNA-seq confirmed the presence of a BCOR–CCNB3 fusion in the patient tumor sample that was absent from the normal kidney. Several other gene fusion events were identified using the STAR-Fusion program from STAR package (Haas et al. 2019). However, only two of these gene fusion calls were supported by a significant number of read counts in the RNA-seq data. First, the BCOR–CCNB3 fusion was present as expected. The fusion-specific breakpoint was identified in BCOR exon 15 at Chr X: 39,911,366 and exon 5 of CCNB3 at Chr X: 50,051,505 (Fig. 1). A fusion between Chr Y and Chr 16 resulting in an RNA5 8SP6–RNA5 8SP2 fusion was also identified (data not shown), but was deemed to be a low confidence observation, as no spanning fragment reads were called. RNA5-8SP6 (RNA, 5.8S Ribosomal Pseudogene 6) and RNA5-8SP2 (RNA, 5.8S Ribosomal Pseudogene 2) are both pseudogenes, nonfunctional segments of DNA that resemble functional genes that arise as extra copies of functional genes, either directly by DNA duplication or indirectly by reverse transcription of an mRNA transcript. The biological significance of the presence of this potential second fusion is unclear and may represent a sequencing artifact.
Figure 1.
Clinical evaluation. (A) Genomic locations for BCOR (Xp11.4, Chr X:40,049,815-40,177,390) and CCNB3 (Xp11.22, Chr X:50,202,713–50,351,914) genes on Chromosome X. (B) A schematic of the X chromosome with strand orientation indicated for the BCOR and CCNB3 genes. Exonic regions for the BCOR and CCNB3 genes are shown. The breakpoints are indicated by the thunderbolt with the resultant paracentric inversion. (C) Schematic depicting the resulting fusion between BCOR exons 1–15 and CCNB3 exons 5–12. (D) Depiction of BCOR and CCNB3 gene protein domains. (E) Computed tomography (CT) scan of pelvis with IV contrast. The left kidney demonstrates diffusely abnormal contrast uptake, which is delayed in comparison to the right side. There is moderate left hydronephrosis with dilatation of the major calyces. At the upper pole of the left kidney, there is some segmental hypoperfusion with a round enhancing structure projecting into one of the upper pole major calyces. (F) Gross pathology of a BCOR–CCNB3 sarcoma of the kidney. A cut section of the solid neoplasm with areas of hemorrhage and necrosis, bulging into the renal pelvis. Although the tumor appears well-circumscribed, the lesion infiltrates soft tissue of the renal sinus. (G) Tumor cells show positivity for bcl-2 (Bcl-2, IHC, 10× objective). (H) Strong vimentin expression is also present (vimentin, immunohistochemistry [IHC], 10× objective). (I) Hematoxylin and eosin (H&E) stain, 20× objective. Histologic findings include monomorphic spindle cell morphology focally arranged in a vague fascicular pattern intermixed with a few small round blue cells (arrow) and alternating hypocellular and hypercellular areas (not shown). (J) H&E stain, 20× objective. Histologic findings show infiltration of tumor cells.
Clinical evaluation. (A) Genomic locations for BCOR (Xp11.4, Chr X:40,049,815-40,177,390) and CCNB3 (Xp11.22, Chr X:50,202,713–50,351,914) genes on Chromosome X. (B) A schematic of the X chromosome with strand orientation indicated for the BCOR and CCNB3 genes. Exonic regions for the BCOR and CCNB3 genes are shown. The breakpoints are indicated by the thunderbolt with the resultant paracentric inversion. (C) Schematic depicting the resulting fusion between BCOR exons 1–15 and CCNB3 exons 5–12. (D) Depiction of BCOR and CCNB3 gene protein domains. (E) Computed tomography (CT) scan of pelvis with IV contrast. The left kidney demonstrates diffusely abnormal contrast uptake, which is delayed in comparison to the right side. There is moderate left hydronephrosis with dilatation of the major calyces. At the upper pole of the left kidney, there is some segmental hypoperfusion with a round enhancing structure projecting into one of the upper pole major calyces. (F) Gross pathology of a BCOR–CCNB3 sarcoma of the kidney. A cut section of the solid neoplasm with areas of hemorrhage and necrosis, bulging into the renal pelvis. Although the tumor appears well-circumscribed, the lesion infiltrates soft tissue of the renal sinus. (G) Tumor cells show positivity for bcl-2 (Bcl-2, IHC, 10× objective). (H) Strong vimentin expression is also present (vimentin, immunohistochemistry [IHC], 10× objective). (I) Hematoxylin and eosin (H&E) stain, 20× objective. Histologic findings include monomorphic spindle cell morphology focally arranged in a vague fascicular pattern intermixed with a few small round blue cells (arrow) and alternating hypocellular and hypercellular areas (not shown). (J) H&E stain, 20× objective. Histologic findings show infiltration of tumor cells.Recognizing the absence of significant mutations beyond the BCOR–CCNB3 fusion and in an attempt to increase our understanding of the genes and pathways driving tumorigenesis in our patient's tumor, differential gene expression (DGE) analysis was performed on aligned reads. Fold-change (FC) values and differential expression analyses for the two samples were estimated using EBSeq (Leng et al. 2013) (v1.18.1), which utilizes an empirical Bayesian model and can adequately handle data without replicates. Pathway gene expression analysis was performed using gene set enrichment analysis (GSEA). The top 10 up-regulated pathways in our tumor sample in comparison to paired normal kidney were analyzed and are shown in Table 2 with corresponding enrichment plots in Figure 2. Our analysis demonstrates up-regulated pathways related to cell cycle processes, DNA damage repair, transcriptional regulation, epigenetic remodeling, and activation of HOX genes.
Table 2.
Gene set enrichment analysis: top 10 up-regulated pathways in tumor versus normal
Gene set enrichment analysis (GSEA) shows the top 10 up-regulated pathways in tumor versus normal. Enrichment plots are shown for GSEA top 10 pathways up-regulated in BCOR–CCNB3 sarcoma versus normal kidney control. Enrichment values can be seen in Table 2. Pathways are enriched for cell cycle processes, DNA damage repair, transcriptional regulation, epigenetic remodeling, and activation of HOX genes.
Gene set enrichment analysis (GSEA) shows the top 10 up-regulated pathways in tumor versus normal. Enrichment plots are shown for GSEA top 10 pathways up-regulated in BCOR–CCNB3 sarcoma versus normal kidney control. Enrichment values can be seen in Table 2. Pathways are enriched for cell cycle processes, DNA damage repair, transcriptional regulation, epigenetic remodeling, and activation of HOX genes.Gene set enrichment analysis: top 10 up-regulated pathways in tumor versus normal(ES) Enrichment score, (NES) normalized enrichment score, (NOM) nominal p-value, (FDR) false discovery rate.The top 10 down-regulated pathways (Table 3; Fig. 3) identified in our tumor sample compared to paired normal kidney were primarily related to cellular and drug metabolism, cholesterol and electrolyte homeostasis, and biologic processes related to the kidney.
Table 3.
GSEA: top 10 down-regulated pathways in tumor versus normal
Gene set enrichment analysis (GSEA) shows the top 10 down-regulated pathways in tumor versus normal. Enrichment plots are shown for GSEA top 10 pathways down-regulated in the BCOR–CCNB3 sarcoma versus normal kidney control. Enrichment values can be seen in Table 3. Pathways are primarily focused on cellular and drug metabolism, cholesterol and electrolyte homeostasis, and biologic processes related to normal kidney function.
Gene set enrichment analysis (GSEA) shows the top 10 down-regulated pathways in tumor versus normal. Enrichment plots are shown for GSEA top 10 pathways down-regulated in the BCOR–CCNB3 sarcoma versus normal kidney control. Enrichment values can be seen in Table 3. Pathways are primarily focused on cellular and drug metabolism, cholesterol and electrolyte homeostasis, and biologic processes related to normal kidney function.GSEA: top 10 down-regulated pathways in tumor versus normal(ES) Enrichment score, (NES) normalized enrichment score, (NOM) nominal p-value, (FDR) false discovery rate.This is not unexpected with normal kidney serving as the control sample. The cell of origin of this cancer is unknown, and the normal adjacent kidney sample taken at the time of surgical resection was the best available control for this study. Many of the pathways identified as being enriched in the tumor contained overlapping lists of genes with known roles in epigenetic regulation. Multiple genes encoding key players of the PRC2, PRC4, NuRD, NCoR, and mSIN3A epigenetic remodeling complexes were found to be differentially expressed in this tumor, suggesting an extensive dysregulation of the epigenome in this patient's tumor. These results are summarized in Figure 4.
Figure 4.
Differential gene expression in epigenetic remodeling complexes. Messenger RNA sequencing (mRNA-seq) identified numerous epigenetic regulatory genes that are differentially expressed in the BCOR–CCNB3 tumor versus normal. Genes associated with the canonical and noncanonical Polycomb repressive complex 1 (PRC1) and PRC2, SIN3A/3B, NuRD, and NCoR/SMRT chromatin-remodeling complexes and DNA methyltransferases are shown. Genes with greater than twofold up-regulation are shown in green text. Genes with greater than twofold down-regulation in tumor versus normal are shown in red text. Genes that are not differentially expressed are shown in black text.
Differential gene expression in epigenetic remodeling complexes. Messenger RNA sequencing (mRNA-seq) identified numerous epigenetic regulatory genes that are differentially expressed in the BCOR–CCNB3 tumor versus normal. Genes associated with the canonical and noncanonical Polycomb repressive complex 1 (PRC1) and PRC2, SIN3A/3B, NuRD, and NCoR/SMRT chromatin-remodeling complexes and DNA methyltransferases are shown. Genes with greater than twofold up-regulation are shown in green text. Genes with greater than twofold down-regulation in tumor versus normal are shown in red text. Genes that are not differentially expressed are shown in black text.
DNA Methylation
Because of the known roles of BCOR in the regulation of the epigenome and results of the transcriptomic data suggesting a dramatic dysregulation of multiple epigenetic complexes, we next focused our attention on the epigenome. DNA methylation analysis was performed using the Illumina EPIC DNA methylation array to identify differentially methylated CpG dinucleotides in tumor versus normal and to determine if this differential methylation had an impact on gene expression. Differentially methylated CpG dinucleotides were identified as those having a loss or gain of methylation >20% in tumor versus normal (corresponding to a Delta beta value of <−0.2 or >0.2, respectively). Because of the single patient sample included in this analysis, it is difficult to draw statistically relevant conclusions, but we identified a total of 99,720 CpG sites as being potentially differentially methylated in this tumor. Consistent with what is frequently observed in other cancers, the majority of these sites exhibited a loss of methylation, with 55,779 sites with a Delta beta of <−0.2 and 43,824 sites showing a gain of methylation with a Delta beta of >0.2. A closer look at this data also reveals that CpG with a loss of methylation in the tumor were more likely to be found outside of and not associated with CpG islands (Fig. 5A), whereas gains in methylation were more likely to occur within or adjacent to a CpG island (Fig. 5B). This is true independently of the location of the CpG island in proximity to known gene features (Fig. 5C), an observation that is also consistent with what is known about global methylation changes in cancer. Particularly striking in the disproportionate gain in methylation in CpG islands in the 3′ UTR of genes, within gene bodies, and at exon boundaries within genes (Fig. 5C). This suggests that altered methylation may particularly affect enhancer/transcriptional regulatory regions and isoform expression. Next, we used the aggregate methylation analysis and the DMRcate (Lent et al. 2021) package to identify differentially methylated regions (DMRs; multiple CpG sites showing differential methylation that are close to each other on genomic positions). A total of 4785 potential DMRs were identified including regions spanning multiple genes with known links to cancer including GATA3, PTEN, HIC1, etc. (Supplemental Table 1). An example DMR associated with the GATA3 gene is shown in Figure 5D.
Figure 5.
Differential methylation analysis Illumina EPIC arrays were used to identify differentially methylated CpG loci in tumor versus normal. (A) Total number of CpG dinucleotides shows a >20% (Delta beta >0.2) loss of methylation associated with CpG islands. (B) Total number of CpG dinucleotides showing a >20% (Delta beta >0.2) gain of methylation in relation to position with regard to CpG islands. (C) Density of gain and loss of methylation in relation to gene features and CpG island features. (D) Differentially methylated region associated with the GATA3 gene is shown. CpG dinucleotides are shown in green. Methylation from 0.0 (blue bars) to 1.0 (red bars) shown for individual CpG sites (middle panel) and across the region (lower panel) b-value for tumor (orange) and normal (blue) are shown.
Differential methylation analysis Illumina EPIC arrays were used to identify differentially methylated CpG loci in tumor versus normal. (A) Total number of CpG dinucleotides shows a >20% (Delta beta >0.2) loss of methylation associated with CpG islands. (B) Total number of CpG dinucleotides showing a >20% (Delta beta >0.2) gain of methylation in relation to position with regard to CpG islands. (C) Density of gain and loss of methylation in relation to gene features and CpG island features. (D) Differentially methylated region associated with the GATA3 gene is shown. CpG dinucleotides are shown in green. Methylation from 0.0 (blue bars) to 1.0 (red bars) shown for individual CpG sites (middle panel) and across the region (lower panel) b-value for tumor (orange) and normal (blue) are shown.To determine whether highly differentially methylated CpG dinucleotides were associated with corresponding changes in gene expression, we also compared the methylation data to our transcriptomic data. Each methylation array probe represents specific location related to corresponding genes. For the methylation array, a log2 ratio of M-values between tumor and the normal sample was estimated for each probe for which log2FC expression of normalized tumor and normal sample is estimated for each gene. Methylation and expression data were matched according to gene symbols and plotted only selecting the most extreme points based on the corresponding distributions. Notice that this matching may entail a many-to-one mapping (e.g., a methylation site is related to many different genes and would be matched to the corresponding expression entry). Our analysis identified a total of 221 genes with increased expression and at least one CpG with increased methylation >20%; 39 genes with decreased expression and at least one CpG with increased methylation; 78 genes with increased expression and at least one CpG with decreased methylation; and only 20 genes with decreased expression and at least one CpG with increased methylation (data not shown). This relatively low correlation between gene expression and DNA methylation changes suggests that DNA methylation may be a relatively minor contributor to the observed gene expression changes in our BCOR–CCNB3 tumor.
miRNA Analysis
Next, we wanted to determine whether the BCOR–CCNB3 fusion may be altering the expression of miRNA, thus contributing to the altered transcriptional/epigenetic programming in these cells. To look at global miRNA expression levels, we performed miRNA-seq according to established protocols. Raw counts were obtained through the miRDeep2 (Friedländer et al. 2012) pipeline. and then the anamiR (Wang et al. 2019) pipeline was used to integrate miRNA and RNA data in order to detect (inversely) correlated features. We identified six miRNAs that were detected as differentially expressed. These are shown in Figure 6 (gray circles), and several of these miRNA have previously been implicated in tumorigenesis. For example, the expression of miRNA-199a-3p has been shown to suppress tumor growth, migration, invasion, and angiogenesis through negative regulation of various target genes in hepatocellular carcinoma, multiple myeloma, papillary thyroid carcinoma, and ovarian cancer (Minna et al. 2014; Raimondi et al. 2014; Deng et al. 2017; Ghosh et al. 2017). The under expression of miR-199a-3p in osteosarcoma plays an important role in the development of metastasis, recurrence, and drug resistance (Gao et al. 2015). Therefore, miR-199a-3p appears to have many important tumor suppressor roles. The miRNA-199b-3p is an isoform of miRNA-199a-3p with an identical sequence and similar functions (Dabbah et al. 2017). miR-196a-5p has been shown to promote metastasis of colorectal cancer (Xin et al. 2019). One study found that miR-23a-3p is significantly up-regulated in renal cell carcinoma tissue samples and that low expression of miR-23a-3p in patients with renal cell carcinoma was associated with longer overall survival (Quan et al. 2019). miR-204-5p is known to suppress the proliferation, migration, and invasion of endometrial carcinoma cells (Bao et al. 2013) and colorectal carcinomas cells (Yin et al. 2014). To help elucidate the role that differential miRNA expression may play in regulating gene expression, we also performed an integrated miRNA/mRNA analysis. Genes and miRNAs with small read counts were considered as unexpressed and removed from the analysis. Posterior fold-change values were estimated on miRNA and mRNA using EBseq. Two thousand four hundred and ninety-seven genes identified as differentially expressed with cutoff values of false discovery rate (FDR) < 0.05 and absolute fold change >2 were compared with the six miRNAs detected as significantly differentially expressed. We then used anamiR (Wang et al. 2019) (v1.13.0), an integrated analysis package of miRNA and mRNA expression data, to find the negatively correlated points and also to query these into 10 different known databases (such as miRDB, miRanda, etc.). We identified 86 genes that are known targets and were inversely correlated with the differentially expressed miRNA, further suggesting that these miRNAs are altering downstream target gene expression in this tumor. A summary of these results is shown in Figure 6.
Figure 6.
Detected micro RNA (miRNA)–messenger RNA (mRNA) associations. Plot depicts detected miRNA–mRNA associations for six miRNA (gray circles) identified by miRNA-seq and are differentially expressed in tumor versus normal and 86 negatively correlated, differentially expressed genes (orange circles) by mRNA-seq, which are known targets of these miRNA.
Detected micro RNA (miRNA)–messenger RNA (mRNA) associations. Plot depicts detected miRNA–mRNA associations for six miRNA (gray circles) identified by miRNA-seq and are differentially expressed in tumor versus normal and 86 negatively correlated, differentially expressed genes (orange circles) by mRNA-seq, which are known targets of these miRNA.
DISCUSSION
BCOR–CCNB3 fusion sarcomas are a rare and newly described subtype of soft-tissue sarcomas, primarily presenting in young male patients. Because of the rarity of this disease, little is known about the molecular characteristics of this tumor. In this study, WES and RNA-seq were combined with DNA methylation and miRNA analysis and performed on our patient sample with the goal of increasing our understanding the genomic and epigenomic landscape of this disease. Because of the paucity of other mutations in this tumor, it is probable that our patient's tumor is primarily driven by the in-frame fusion between BCOR and CCNB3, as expected, which is identical to those described in other case reports (Pierron et al. 2012; Peters et al. 2015b).The combined data suggest few somatic mutations and a high frequency of deregulated gene expression involving both chromatin-modifying enzymes and HOX genes. Epigenetic alterations due to the up-regulation of HOX transcription factors and proteins involved in epigenetic regulation appear to lead to the dysregulation of genes encoding several developmental programs and stem cell differentiation, which is similarly seen across a spectrum of pediatric sarcomas (Lawlor and Thiele 2012). This is not surprising given that BCOR is known to epigenetically regulate the transcription of specific genes, especially those of mesenchymal stem cell function. The Bcl6 co-repressor, or BCOR gene, encodes a transcriptional co-repressor that binds the Bcl6 oncoprotein as part of a large Polycomb repressive complex, the PRC1.1 complex that functions to repress transcriptional start sites. Interestingly, none of the common components of PRC1 were differentially expressed in our tumor compared to normal (Fig. 4). Of note, exon 15 of the BCOR gene is implicated in the development of sarcoma across a spectrum of BCOR-related diseases including BCS, CCSK, and PMMTI (Kao et al. 2018; Santiago et al. 2018). BCOR internal tandem duplications (ITDs) involving exon 15 of BCOR are well-known genetic alterations seen in CCSK and PMMTI (Roy et al. 2015). Exon 15 encodes the PCGF ubiquitin-like fold discriminator (PUFD) domain, a region that binds selectively to another domain, the ubiquitin-like RAWUL domain. This interactive binding between PUFD and ubiquitin-like RAWUL is crucial for assembly of the Polycomb group RING finger complex, a distinct Polycomb repressive complex necessary to mediate epigenetic modifications of histones (Junco et al. 2013; Kao et al. 2018). Disruption of the PRC1.1/BCOR complex and up-regulation of PRC2 targets has been described in ITD-driven CCSK and PMMTI, leading to overall Polycomb complex dysregulation. BCS tumors similarly have abnormalities involving the PUFD domain of the BCOR gene increasing our understanding of the similarities between these BCOR mutant tumors (Roy et al. 2015).Numerous epigenetically regulated genes and pathways are identified in our analysis, which may serve as the foundation for future studies focused on BCOR-associated malignancies, but only a small percentage of these DNA methylation changes is associated with corresponding changes in gene expression (data not shown), suggesting that abnormal DNA methylation is likely a consequence, not a leading cause of tumorigenesis in this patient. We did observe multiple changes in miRNA expression with known links to tumorigenesis. The clinical significance of the key miRNAs related to BCOR–CCNB3 fusion sarcomas is still unclear, but this raises important questions with regard to the role of miRNA in BCOR–CCNB3 fusion-driven sarcomas. In the absence of additional mutations in this tumor, it appears that the BCOR–CCNB3 fusion drives a global epigenetic reprogramming event that is likely to be the primary driver of transformation and tumorigenesis.As we continue to increase our understanding of the BCOR effects on fusion-positive undifferentiated sarcomas, better-tailored and more refined treatment algorithms will continue to evolve. Targeted inhibition of the BCOR–CCNB3 fusion transcript may be an efficacious treatment strategy for this disease, possibly in combination with epigenetic modifying agents. Our molecular analysis suggests the possibility that a trial of epigenetic modifying agents in patients with BCS may restore a functional epigenome and, in the absence of a high number of secondary mutations in these patients, may prove an efficacious therapeutic strategy, particularly where epigenome targeted therapeutic agents are already being tested in the clinic (i.e., HDAC inhibitors and EZH2 inhibitors). We recognize this study includes only one BCS patient analysis and a further in-depth analysis of additional tumor samples at the molecular level may lend support to this approach. Furthermore, in vitro and in vivo modeling of the fusion will be necessary to increase our understanding of how the BCOR–CCNB3 fusion may promote epigenetic remodeling and whether targeted therapies may be efficacious for patients with BCS tumors.
Future Research Plans
A cellular model will facilitate the search for a better understanding of BCSs, their driver mutations, cellular effects, and epigenetic alterations. As BCSs appear to be fusion-driven sarcomas, a search for targeted and refined treatments is important. With additional functional studies of a BCS-driven cellular model, we may be able to (1) clarify the role of BCOR epigenetic dysregulation, (2) test/develop novel therapeutics (epigenetic modifiers vs. oncogene-targeted therapies), and (3) identify a distinct epigenetic signature crafted by BCSs and establish a reference methylation standard to identify tumors in cases that are diagnostically challenging.
METHODS
Tissue Source and Processing
Acquisition of specimen from a 14-yr-old pediatric male patient who was diagnosed with a BCS of the kidney was done following IRB approval and patient consent. FFPE samples of the tumor with adjacent paired normal (germline) from the patient's kidney specimen were obtained from the Department of Pathology at Oishei Children's Hospital. The tumor sample was dissected from an FFPE block to a depth of 5 µm and fixed on slides. Paired normal was provided as shaved curls from an FFPE13 block to a depth of 20 µm. Our tumor and matched normal specimens were sent to the Genomics Shared Resource of Roswell Park Comprehensive Cancer Center for DNA and RNA extraction, WES, RNA-seq, miRNA-seq, and EPIC arrays analyses. Briefly, WES libraries were prepared using Agilent, SureSelectXTHS Human All Exon V7 capture library and sequenced on Illumina NovaSeq using 100PE sequencing. For RNA-seq, libraries were prepared using SureSelect XT RNA Direct kit and sequenced on Illumina NovaSeq using 100PE sequencing. The small RNA libraries were prepared using SMARTer smRNA-Seq kit and sequenced on Illumina NovaSeq for single-end sequencing.
Clinical Case Data Review
The patient presentation and follow-up information are obtained from review of the electronic health records and direct conversation with the treating physician and pathologist.
Genomic Analysis
Detection of Somatic Mutations
Exome sequencing reads were aligned to the human reference genome (GRCh37 b37) using BWA-MEM (v0.7.12) (Li and Durbin 2009). Sequencing coverage is shown in Supplemental Figure 1. PCR duplicates were marked by Picard Mark Duplicates (v2.18.2) and base quality recalibration was performed by GATK (v4.1.2.0) (McKenna et al. 2010). The somatic variants were called using a multiple-caller ensembled method, including MuTect2 (v4.1.2.0) (Cibulskis et al. 2013) for sensitive detection of somatic point mutations in impure and heterogeneous cancer samples, MuSE (v1.0) (Fan et al. 2016), which accounts for intertumor heterogeneity using a sample-specific error model and improves sensitivity and specificity in mutation calling for sequencing data, Strelka2 (v2.9.10) (Saunders et al. 2012), and VarDict (v1.8) (Lai et al. 2016) and manually reviewed to remove false-positive calls. The analysis pipelines are reproducible and publicly available through the Rcwl package (Hu et al. 2021). Putative mutations are identified by running NeuSomatic (Sahraeian et al. 2019) with default parameters. All putative mutations are manually inspected to filter out potential false-positives introduced by sequencing and mapping artifacts. The identified somatic mutations are compared to the public human germline databases including dbSNP (Sherry et al. 2001), 1000 Genomes Project (Genomes Project Consortium et al. 2012), and the National Heart, Lung, and Blood Institute's Exome Sequencing Project (NHLBI 2013) to further exclude remaining germline polymorphisms. All mutations are annotated using ANNOVAR (Wang et al. 2010) with the National Center for Biotechnology Information (NCBI) RefSeq database.
RNA-seq Data Analysis
Paired-end reads are generated by Illumina sequencer and QCed using fastqc (Andrews 2010) (v0.10.1) and mapped to the GRCh38 human reference genome and ENSEMBLE (v25) annotation database using STAR [2] (v2.7.0f). Aligned reads are further checked with RSeQC (Wang et al. 2012) (v2.6.3) in order to identify potential library preparation-related problems. Gene level expression is estimated with featureCounts from Subread (Liao et al. 2014) (v1.6.0) using the fracOverlap 1 option. FC estimation and differential expression analyses for the two samples are carried out using EBSeq (Leng et al. 2013) (v1.18.1), which utilizes the empirical Bayesian model and can handle data without replicates. Heatmaps are generated using pheatmap (Kolde and Kolde 2015) (v1.0.8) R library using data generated by DESeq2's (Love et al. 2014) regularized-log transformation. The fusion results are generated from STAR-Fusion (Haas et al. 2019) program from STAR package.
miRNA-seq
Raw single-end reads are examined using fastqc to check for low-quality within-read position trends. Reads are filtered by removing the first 16 bp and last 23 bp to keep the miRNA sequence only. Resulting reads are then aligned and quantified using the miRDeep2 (Friedländer et al. 2012) (v2.0.1.2) pipeline. Briefly, miRDeep2 removes reads smaller than 18 bp, collapses duplicated reads, and maps them against the reference genome (GRCh38) using Bowtie (Langmead 2010) (v1.2.3). Mapped reads are then identified and quantified to known miRNAs (mature and hairpin) accounting for secondary structures.
Methylation Array
EPIC array (Illumina Human Methylation 850k) files are read, processed, filtered, and normalized using minfi (Aryee et al. 2014) (v1.32) R package. M and beta values are then extracted for further analysis. Delta beta values (tumor-normal beta values) are estimated, removing observations with absolute values of <0.2 to identify candidate methylation sites with different methylation levels between tumor and normal samples. Dmrcate (Peters et al. 2015a) Bioconductor package is used to identify differentially methylated regions, and sites are annotated using both the UCSC refGene and CpG island databases to identify the relative position to each gene and CpG islands.
Integrated Analysis
mRNA and miRNA integration: Genes and miRNAs with small read count are considered as unexpressed and removed from the analysis. Posterior FC values were estimated on miRNA and mRNA using EBseq. Two thousand four hundred and ninety-seven genes are differentially expressed with cutoff values of FDR < 0.05 and absolute fold change > 2 and six miRNAs are detected as significant according to FDR < 0.25 (hsa-miR-199a-3p, hsa-miR-199b-3p, hsa-miR-196a-5p, hsa-miR-23a-3p, hsa-miR-204-5p, hsa-miR-1260a). Then anamiR (Wang et al. 2019) (v1.13.0), an integrated analysis package of miRNA and mRNA expression data, was used to find the negatively correlated points and also to query these into 10 different known databases (such as miRDB, miRanda, etc.).mRNA and methylation integration: mRNAs are filtered and processed as described in the miRNA integration analysis, but with a log2FC cutoff value of 6. The log2 ratio of M-values forms tumor and normal samples are calculated and methylation sites with absolute log2 > 3 are kept as candidate differential methylation sites. A final list is obtained matching overlapped genes found in the methylation filtered data annotation with the filtered mRNA data.
ADDITIONAL INFORMATION
Data Deposition and Access
Sequencing data within this paper are deposited in the GEO (https://www.ncbi.nlm.nih.gov/geo/) repository and are now publicly available. The reference series GEO accession number is GSE188555. The data set accession numbers are GSE188552, GSE188553, and GSE188554.
Ethics Statement
The research project used tissue specimens that had been collected as part of an ongoing biorepository. The biorepositories obtain written informed consent from participants and/or their legal guardians for the collection and research use of the specimens. All aspects of the study were approved by the Roswell Park Comprehensive Cancer Center Institutional Review Board.
Acknowledgments
The work herein was supported by the laboratory of Joyce E. Ohm, PhD. We also acknowledge the members of the Ohm lab—Joyce E. Ohm, Kristen M. Humphrey, Lyn Gao, and Jeffrey Martin—for their continued support and mentorship throughout this project. The tissue donation provided by the patient in order to conduct this research is also greatly appreciated. We thank our colleagues at Oishei Children's Hospital, Victoria Frano and Dr. Rafal Kozielski, MD, who provided additional pathologic insight for this case and allowed for the acquisition of our patient's tumor. We thank Dr. Elizabeth Perlman, MD, and Dr. Mariana Cajaiba, MD, at the Division of Pathology at Lurie Children's Hospital in consultation with the Children's Oncology Group Renal Tumor Pathology Center.
Author Contributions
All of the molecular experiments presented here were performed by T.J.H. under the guidance of J.E.O. and with the assistance of J.C.M. and L.G. Bioinformatics and biostatistical analysis of all data were performed by the team of E.C.G., M.L., L.W., S.L., and J.W. P.K.S. led all of the multiomics analysis in the Genomics Shared Resource. T.J.H., A.G., C.J.T, R.K., and J.K. contributed clinical and pathological input. All authors have contributed to writing and/or editing components of this manuscript.
Funding
This work was funded by a grant from the Roswell Park Alliance Foundation to J.E.O. This work was partly supported by National Cancer Institute (NCI) grant P30CA016056 involving the use of Roswell Park Comprehensive Cancer Center's Genomic Shared Resource and Bioinformatics Shared Resource.
Authors: Martin J Aryee; Andrew E Jaffe; Hector Corrada-Bravo; Christine Ladd-Acosta; Andrew P Feinberg; Kasper D Hansen; Rafael A Irizarry Journal: Bioinformatics Date: 2014-01-28 Impact factor: 6.937
Authors: Sarah E Junco; Renjing Wang; John C Gaipa; Alexander B Taylor; Virgil Schirf; Micah D Gearhart; Vivian J Bardwell; Borries Demeler; P John Hart; Chongwoo A Kim Journal: Structure Date: 2013-03-21 Impact factor: 5.006
Authors: Christopher T Saunders; Wendy S W Wong; Sajani Swamy; Jennifer Becq; Lisa J Murray; R Keira Cheetham Journal: Bioinformatics Date: 2012-05-10 Impact factor: 6.937
Authors: Ellen M Mowry; Robert F Carey; Maria R Blasco; Jean Pelletier; Pierre Duquette; Pablo Villoslada; Irina Malikova; Elaine Roger; R Phillip Kinkel; Jamie McDonald; Peter Bacchetti; Emmanuelle Waubant Journal: PLoS One Date: 2013-10-09 Impact factor: 3.240
Authors: Zhongwu Lai; Aleksandra Markovets; Miika Ahdesmaki; Brad Chapman; Oliver Hofmann; Robert McEwen; Justin Johnson; Brian Dougherty; J Carl Barrett; Jonathan R Dry Journal: Nucleic Acids Res Date: 2016-04-07 Impact factor: 16.971