| Literature DB >> 28983113 |
Helena Persson1, Rolf Søkilde1, Jari Häkkinen1, Anna Chiara Pirona1,2, Johan Vallon-Christersson1, Anders Kvist1, Fredrik Mertens3, Åke Borg1,4, Felix Mitelman3, Mattias Höglund1,4, Carlos Rovira5,6.
Abstract
Studies of fusion genes have mainly focused on the formation of fusions that result in the production of hybrid proteins or, alternatively, on promoter-switching events that put a gene under the control of aberrant signals. However, gene fusions may also disrupt the transcriptional control of genes that are encoded in introns downstream of the breakpoint. By ignoring structural constraints of the transcribed fusions, we highlight the importance of a largely unexplored function of fusion genes. Here, we show, using breast cancer as an example, that miRNA host genes are specifically enriched in fusion genes and that many different, low-frequency, 5' partners may deregulate the same miRNA irrespective of the coding potential of the fusion transcript. These results indicate that the concept of recurrence, defined by the rate of functionally important aberrations, needs to be revised to encompass convergent fusions that affect a miRNA independently of transcript structure and protein-coding potential.Fusion gene research traditionally focuses on fusions that result in hybrid proteins or promoter switching events. Here, the authors demonstrate enrichment of fusions in miRNA host genes in breast cancer, highlighting that disparate fusions could have convergent impact on miRNA.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28983113 PMCID: PMC5629207 DOI: 10.1038/s41467-017-01176-1
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
MicroRNA precursors encoded in host genes with fusion transcripts in our data and in TCGA data
| Any fusion | 5′ Fusion partner | 3′ Fusion partner | |
|---|---|---|---|
| ≥ 1 Tumor | 802 (61%) | 691 (52%) | 640 (48%) |
| ≥ 1 Tumor and in TCGA | 163 (12%) | 155 (12%) | 153 (12%) |
| ≥ 1 Tumor and in TCGA breast | 142 (11%) | 66 (5%) | 80 (6%) |
| Recurrent (≥ 3 tumors) | 514 (39%) | 426 (32%) | 380 (29%) |
| Recurrent and in TCGA | 124 (9%) | 119 (9%) | 92 (7%) |
| Recurrent and in TCGA breast | 101 (8%) | 51 (4%) | 47 (4%) |
| Recurrent our data and TCGA | 44 (3%) | 24 (2%) | 18 (1%) |
| Recurrent our data and TCGA breast | 14 (1%) | 8 (1%) | 6 (0.5%) |
Number of miRNA precursors encoded in host genes with fusion transcripts with the corresponding percentage of all 1320 analyzed pre-miRNAs in parenthesis. 5ʹ or 3ʹ fusion partner refers to the position of the miRNA host gene and the column “any fusion” combines all precursors present in 5ʹ and/or 3ʹ fusion partner genes. Recurrent fusions were defined as occurring in at least three tumors with the host gene in the same position
Fig. 1MicroRNA host genes are overrepresented among fusion genes. Logistic regression with a model including host gene status, gene size, and the interaction between the two, showed that miRNA host genes were significantly more likely to be involved in fusion transcripts, both when considering all fusions (a) or recurrent fusions (b). No significant association was found between gene fusions and the presence of several other genetic elements that occur with a similar frequency in protein-coding genes according to RepeatMasker annotation (c–h)
Fig. 2Fusion transcripts involving miRNA host genes as 3ʹ partners have fewer in-frame fusions. Cumulative distribution of the fraction of in-frame fusions among all fusion transcripts, plotted separately for 5ʹ and 3ʹ partner genes that lack or include miRNAs within the fused gene segments in our breast cancer data (a) and among fusion transcripts from approximately 30 different normal tissue types from ref. [27] (b) Percentage of genes with fusions that include miRNAs when analyzing all fusion types or only in-frame fusion transcripts for all protein-coding genes involved in fusions or among 5ʹ and 3ʹ partners separately, and, for comparison, among manually curated fusions in the COSMIC database (c). The 5ʹ partners of miRNA host genes have higher average expression than those of non-host genes, especially when the miRNA is included in the fused segment of the 3ʹ partner (d). Average expression was calculated separately for each 5ʹ partner gene among tumors with a fusion transcript matching either of the three 3ʹ partner categories. No fusion represents expression for the full set of 5ʹ partner genes in samples without fusion transcripts involving them. Expression is in log2(FPKM) (fragments per kilobase of exon model and million reads)
MicroRNA precursors encoded in host genes with fusion transcripts in our data and in the Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer
| Any fusion | 5′ Fusion partner | 3′ Fusion partner | |
|---|---|---|---|
| ≥ 1 Tumor | 802 | 691 | 640 |
| ≥ 1 Tumor and in Mitelman | 320 (40%) | 192 (28%) | 162 (25%) |
| Recurrent (≥ 3 tumors) | 514 | 426 | 380 |
| Recurrent and in Mitelman | 222 (43%) | 137 (32%) | 94 (25%) |
| Recurrent in both our data and Mitelman | 9 (2%) | 4 (1%) | 4 (1%) |
Number of miRNA precursors encoded in host genes with fusion transcripts with the corresponding percentage previously reported in breast cancer in the Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer in parenthesis. 5ʹ or 3ʹ fusion partner refers to the position of the miRNA host gene and the column “any fusion” combines all precursors present in 5ʹ and/or 3ʹ fusion partner genes. Recurrent fusions were defined as occurring in at least three tumors with the host gene in the same position in our data
MicroRNA-convergent fusions involving precursors of upregulated miRNAs
| miRNA | Number 5′ partner genes | Number tumors with more than one 5′ partner | Number tumors 3′ fusion | Host gene | Cytogenetic location |
|---|---|---|---|---|---|
|
| 20 | 2 | 19 |
| 17q23.1 |
|
| 12 | 1 | 14 |
| 7q22.1 |
|
| 3 | 0 | 3 |
| 3p22.2 |
|
| 3 | 0 | 3 |
| 2q35 |
|
| 62 | 19 | 82 |
| 17p11.2 |
|
| 11 | 2 | 11 |
| 8q24.3 |
|
| 4 | 0 | 4 |
| 5q15 |
|
| 38 | 11 | 34 |
| 14q32.2 |
|
| 107 | 17 | 49 |
| 11p15.5 |
|
| 6 | 0 | 6 |
| 4q21.1 |
|
| 5 | 0 | 5 |
| 8p11.21 |
|
| 8 | 0 | 14 |
| 15q23 |
|
| 2 | 0 | 2 |
| 19q13.2 |
|
| 5 | 1 | 6 |
| 11p15.5 |
|
| 4 | 1 | 3 |
| 5q35.3 |
|
| 6 | 2 | 29 |
| 22q13.31 |
|
| 2 | 0 | 2 |
| 11p13 |
|
| 5 | 0 | 5 |
| 6q23.3 |
|
| 4 | 0 | 4 |
| 20q13.2 |
|
| 5 | 0 | 5 |
| 20q13.12 |
|
| 3 | 0 | 3 |
| 22q12.3 |
|
| 103 | 6 | 29 |
| 15q26.3 |
|
| 338 | 92 | 155 |
| 17q12 |
|
| 5 | 0 | 5 |
| 4p14 |
|
| 22 | 5 | 24 |
| 17q21.2 |
MicroRNA-convergent fusions involving the precursors of miRNAs upregulated in tumors with 3ʹ host fusions vs tumors without host gene fusions (mature miRNAs are listed in Supplementary Table 3). For clustered miRNAs, all miRNA loci included in the host gene fusions are listed
Fig. 3Novel fusion genes involving mir-33b in SREBF1. Breakpoints for 5ʹ and 3ʹ SREBF1 fusion transcripts (a). Gene symbols refer to the corresponding partner gene for each fusion transcript. Red marks in-frame fusions between partner gene coding sequences (CDSs) and blue other fusion transcripts with out-of-frame fusions between partner gene CDSs in bold italic font. Expression of miR-33b-5p (b) but not miR-33b-3p (c) was significantly increased in 13 tumors with 3ʹ SREBF1 fusions and miRNA expression data compared with tumors without fusions of the host gene. Samples with SREBF1 fusions are shown as filled circles for in-frame fusions between partner gene CDSs, open circles with thick line for out-of-frame fusions between partner gene CDSs, and open circles with thin line for all other fusion transcripts
Fig. 4Overexpression of mir-21 in tumors with 3ʹ VMP1 fusion transcripts. Breakpoints and fusion partners for 5ʹ and 3ʹ VMP1 fusion transcripts (a). Gene symbols refer to the corresponding partner gene for each fusion transcript. Red marks in-frame fusions between partner gene coding sequences (CDSs) and blue other fusion transcripts with out-of-frame fusions between partner gene CDSs in bold italic font. Tumors with genomic amplification of the oncogene HER2 (ERBB2) and tumors negative for expression of estrogen receptor (ER) alpha (ESR1) were overrepresented among samples with VMP1 fusion transcripts with p = 0.0018 and p = 0.030, respectively (b). In TCGA data, breast cancer and lung adenocarcinoma were enriched among samples with 3ʹ VMP1 fusions (p = 0.025 and p = 0.0011, respectively); breast cancer also in tumors with 5ʹ VMP1 fusions (p = 0.019, c). Expression of both mature miRNAs from the mir-21 locus, miR-21-5p (d) and miR-21-3p (e), was significantly higher in tumors with 3ʹ VMP1 fusions compared to tumors with no VMP1 fusion transcripts (p = 3.36 × 10−6 and p = 0.048, respectively). Samples with VMP1 fusions are shown as filled circles for in-frame fusions between partner gene CDSs, open circles with thick line for out-of-frame fusions between partner gene CDSs, and open circles with thin line for all other fusion transcripts. The mRNA expression of PDCD4, an experimentally confirmed target for miR-21-5p, was significantly lower in tumors with 3ʹ VMP1 fusions including mir-21 compared to tumors without mir-21 fusions (p = 0.0019, f). P-values were calculated by Fisher’s exact test for the marginal odds ratios in b, c and by Student’s t-test in d–f