| Literature DB >> 30837567 |
Paul M Krzyzanowski1, Fabrice Sircoulomb2, Fouad Yousif3, Josee Normand2,4, Jose La Rose2, Kyle E Francis2, Fernando Suarez2, Tim Beck5, John D McPherson3,6, Lincoln D Stein7,8, Robert K Rottapel9,10.
Abstract
Genomic rearrangements are a hallmark of cancer biology and progression, allowing cells to rapidly transform through alterations in regulatory structures, changes in expression patterns, reprogramming of signaling pathways, and creation of novel transcripts via gene fusion events. Though functional gene fusions encoding oncogenic proteins are the most dramatic outcomes of genomic rearrangements, we investigated the relationship between rearrangements evidenced by fusion transcripts and local expression changes in cancer using transcriptome data alone. 9,953 gene fusion predictions from 418 primary serious ovarian cancer tumors were analyzed, identifying depletions of gene fusion breakpoints within coding regions of fused genes as well as an N-terminal enrichment of breakpoints within fused genes. We identified 48 genes with significant fusion-associated upregulation and furthermore demonstrate that significant regional overexpression of intact genes in patient transcriptomes occurs within 1 megabase of 78 novel gene fusions that function as central markers of these regions. We reveal that cancer transcriptomes select for gene fusions that preserve protein and protein domain coding potential. The association of gene fusion transcripts with neighboring gene overexpression supports rearrangements as mechanism through which cancer cells remodel their transcriptomes and identifies a new way to utilize gene fusions as indicators of regional expression changes in diseased cells with only transcriptomic data.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30837567 PMCID: PMC6401071 DOI: 10.1038/s41598-019-39878-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Characteristics of chimeric transcripts in ovarian cancer, in transcriptomes derived from TCGA primary tumors and ovarian cell lines. (A) Rearrangement classifications associated with fusion events, in TCGA primary transcriptomes and ovarian cancer cell lines. (B) Profiles of cDNA positions associated with expressed fusion transcripts in TCGA primaries. (C) Copy number status associated with evidence of TCGA fusion transcripts. (D) Density plots of fusion counts per tumor, separated by TCGA subgroups. (E) Proportions of OVCA subtypes vs expressed fusion load.
Figure 2Gene fusion breakpoint locations within protein coding transcripts. (A) Expected distributions of cDNA breakpoint positions according to a random genomic breakage model. Red thresholds indicate observed positions. (B) Numerical data for A. (C) Normalized density plot illustrating observed breakpoint positions within cDNAs (bar plot) vs expected distribution (ribbon plot) with enrichment within C-terminal breakpoints of the CDS (star; Permutation test P < 10−4).
Genes significantly upregulated with positive fusion status.
| Genes | Description | q-value | Fold Change |
|---|---|---|---|
| AQP6 | aquaporin 6 | 2.30E-08 | 8.91 |
| DAPL1 | death associated protein like 1 | 7.82E-08 | 6.94 |
| VPREB3 | pre-B lymphocyte 3 | 1.00E-07 | 2.16 |
| INSL3 | insulin like 3 | 1.96E-06 | 3.71 |
| KISS1 | KiSS-1 metastasis-suppressor | 5.90E-05 | 3.73 |
| AQP5 | aquaporin 5 | 1.30E-04 | 3.67 |
| C1orf186 | chromosome 1 open reading frame 186 | 1.59E-04 | 1.80 |
| C22orf31 | chromosome 22 open reading frame 31 | 1.59E-04 | 15.28 |
| TCL6 | T-cell leukemia/lymphoma 6 (non-protein coding) | 1.59E-04 | 9.12 |
| KCTD1 | potassium channel tetramerization domain containing 1 | 1.90E-04 | 1.66 |
| HKR1 | HKR1, GLI-Kruppel zinc finger family member | 2.20E-04 | 1.36 |
| KIAA1683 | KIAA1683 | 2.39E-04 | 4.13 |
| CRYGC | crystallin gamma C | 2.56E-04 | 126.49 |
| ZNF732 | zinc finger protein 732 | 5.28E-04 | 2.64 |
| KREMEN1 | kringle containing transmembrane protein 1 | 5.39E-04 | 2.88 |
| WNT10A | wingless-type MMTV integration site family member 10A | 5.40E-04 | 2.79 |
| DIO3 | deiodinase, iodothyronine, type III | 5.76E-04 | 26.20 |
| C2orf48 | chromosome 2 open reading frame 48 | 6.74E-04 | 2.60 |
| FGF12 | fibroblast growth factor 12 | 7.37E-04 | 2.53 |
| GOLT1A | golgi transport 1A | 8.46E-04 | 2.22 |
| DAND5 | DAN domain family member 5, BMP antagonist | 9.66E-04 | 9.91 |
| RRM2 | ribonucleotide reductase M2 | 9.87E-04 | 1.81 |
| LRRC69 | leucine rich repeat containing 69 | 1.20E-03 | 7.85 |
| ATP5I | ATP synthase membrane subunit e | 1.24E-03 | 2.01 |
| C7 | complement component 7 | 1.33E-03 | 10.87 |
| ARID3C | AT-rich interaction domain 3C | 1.33E-03 | 15.54 |
| BACE2 | beta-site APP-cleaving enzyme 2 | 1.64E-03 | 2.09 |
| COL6A3 | collagen, type VI, alpha 3 | 2.13E-03 | 6.51 |
| IGLL5 | immunoglobulin lambda-like polypeptide 5 | 2.28E-03 | 7.15 |
| SHISA9 | shisa family member 9 | 2.53E-03 | 8.50 |
| CACNA2D3 | calcium channel, voltage-dependent, alpha 2/delta subunit 3 | 2.53E-03 | 4.04 |
| DEFB126 | defensin beta 126 | 2.66E-03 | 11.54 |
| CHCHD10 | coiled-coil-helix-coiled-coil-helix domain containing 10 | 2.70E-03 | 1.55 |
| MFSD7 | major facilitator superfamily domain containing 7 | 2.70E-03 | 1.80 |
| JAK3 | Janus kinase 3 | 4.29E-03 | 1.63 |
| HLA-DRB1 | major histocompatibility complex, class II, DR beta 1 | 4.31E-03 | 2.56 |
| IRX5 | iroquois homeobox 5 | 5.36E-03 | 30.66 |
| SIX4 | SIX homeobox 4 | 5.48E-03 | 2.07 |
| CCDC120 | coiled-coil domain containing 120 | 5.51E-03 | 3.05 |
| C2orf74 | chromosome 2 open reading frame 74 | 6.34E-03 | 3.13 |
| KLHL23 | kelch like family member 23 | 6.34E-03 | 3.78 |
| SLC29A1 | solute carrier family 29 (equilibrative nucleoside transporter), member 1 | 7.38E-03 | 2.34 |
| MSN | moesin | 7.57E-03 | 1.45 |
| MUC16 | mucin 16, cell surface associated | 7.68E-03 | 2.35 |
| KCNIP3 | potassium voltage-gated channel interacting protein 3 | 7.74E-03 | 5.60 |
| PARD3 | par-3 family cell polarity regulator | 9.01E-03 | 1.57 |
| GLTPD2 | glycolipid transfer protein domain containing 2 | 9.01E-03 | 7.07 |
| NAA15 | N(alpha)-acetyltransferase 15, NatA auxiliary subunit | 9.82E-03 | 1.38 |
Q-values represent FDR corrected p-values of fusion-positive versus fusion-negative expression levels (two-sided Wilcoxon test). Fold change of fusion-positive versus fusion-negative expression levels are shown.
Figure 3Selected genes with expression changes significantly correlated with their fusion status in primary ovarian cancer tumors. Expression levels (FPKM) shown on y-axis with fusion status presented on x-axis.
Figure 4High levels of MUC16 are associated with fusion status. (A) Tumors with detected MUC16 fusion transcripts exhibit significantly higher MUC16 expression levels (Wilcox q-value = 0.008). Breakpoint distribution of MUC16 fusion events in primary ovarian transcriptomes, with classifications as outgoing (MUC16 N-terminal; red) or incoming (MUC16 C-terminal; blue). Mapping on to NM_024690 is shown.
Figure 5Fusions act as markers of localized expression dysregulation. (A) Fusion correlated fold changes of genes in the proximity of fusion genes. Figure illustrates aggregated fold change data for all genes within 1 megabase windows of fusion anchor gene, which lie at the central coordinate (distance = 0). Expected range of fold change (ribbon plot) computed using loess smoothing. (B) Identical figure but post-randomization of TCGA expression data and gene fusion status. (C) Circos plot indicating selected fusion genes (black), fusion associated dysregulated genes (blue), and ratio of observed vs. expected fusion frequencies (red barplot) across the human genome. Chromosomes 1–22, X, Y presented sequentially in clockwise order from the 12 O’clock position.