| Literature DB >> 24213499 |
Eric Kowarz1, Theo Dingermann, Rolf Marschalek.
Abstract
We among others have recently demonstrated that normal cells produce "fusion mRNAs". These fusion mRNAs do not derive from rearranged genomic loci, but rather they are derived from "early-terminated transcripts" (ETTs). Premature transcriptional termination takes place in intronic sequences that belong to "breakpoint cluster regions". One important property of ETTs is that they exhibit an unsaturated splice donor site. This results in: (1) splicing to "cryptic exons" present in the final intron; (2) Splicing to another transcript of the same gene (intragenic trans-splicing), resulting in "exon repetitions"; (3) splicing to a transcript of another gene (intergenic trans-splicing), leading to "non-genomically encoded fusion transcripts" (NGEFTs). These NGEFTs bear the potential risk to influence DNA repair processes, since they share identical nucleotides with their DNA of origin, and thus, could be used as "guidance RNA" for DNA repair processes. Here, we present experimental data about four other genes. Three of them are associated with hemato-malignancies (ETV6, NUP98 and RUNX1), while one is associated with solid tumors (EWSR1). Our results demonstrate that all genes investigated so far (MLL, AF4, AF9, ENL, ELL, ETV6, NUP98, RUNX1 and EWSR1) display ETTs and produce transpliced mRNA species, indicating that this is a genuine property of translocating genes.Entities:
Year: 2012 PMID: 24213499 PMCID: PMC3712730 DOI: 10.3390/cancers4041036
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Cancer genes are frequently involved in chromosomal translocations. According to the Cancer gene census, the human genome harbors 488 cancer genes that are recurrently diagnosed in different individuals. Of those, 327 genes are involved in recurrent genomic translocations, leading to the creation of “fusion genes” that encode fusion proteins with oncogenic potential. The group of translocating genes can be separated into 191 genes that are associated with hematomalignant tumors, while 131 are associated with solid tumors. Five genes were associated with both disease entities. Most of these genes are recombined by an NHEJ-mediated recombination mechanism (~83% vs. ~98% in hemato-malinant vs. solid tumors), while some are recombined by a RAG1/2-mediated mechanism (~17% vs. ~2% in hemato-malignant vs. solid tumors).
Figure 2ETTs cause 3 different splice events. (A) Genes normally produce a primary transcript that is subsequently spliced into the mature RNA; (B) The production of early-terminated transcripts (ETTs) causes three different and independent scenarios. Due to an unsaturated 3'-splicie donor site and based on the available sequences within a given gene, an ETT may cause splicing to a cryptic exon that is encoded by intronic sequences (a); In all cases investigated so far, intragenic trans-splicing reactions are causing “exon repetitions” (b); This can be explained by the fact that the unsaturated splice donor site is using a splice acceptor site of a second, nascent transcript of the same gene. If genes are co-transcribed in the same transcription factory, then intergenic trans-splicing reactions can be observed which will result in the production of a “non-genomically encoded fusion transcript” (NGEFT) (c); Here, the unsaturated splice donor site of the ETT is using a splice acceptor site of a nascent transcript of a second gene. White boxes: exons; lines: introns; grey boxes: 5'- and 3'-UTR sequences; light orange box: cryptic exon. Splice donor (SD) sites are marked as orange circles, while splice acceptor sites are marked as green circles.
Figure 3Genes involved in CT display ETTs and exon repetitions. The exon/intron structures of all four investigated genes are displayed: ETV6, EWSR1, NUP98 and RUNX1. Exons are indicated by black rectangles. The known breakpoint cluster regions of each gene are marked in green; thicker lines indicate major breakpoint cluster regions. Exon numbering is shown on top of each exon/intron structure. 5' and 3'-UTRs are depicted by white rectangles. Genes known to be recombined within a certain intronic region are indicated below. Primers used for exon repetition mapping are indicated above the exon as blue triangles. Oligonucleotides used for 3'-RACE or QRT-PCR experiments are indicated at their corresponding position below the gene structure and depicted as red and green triangles, respectively. Right panels: results of our exon repetition mapping analyses, exemplarily shown for the RS4;11 and Kasumi cell lines. Single bands were sequenced to reveal their exonic composition. Exon repetitions are indicated for each gene on the left of each panel. Sizes are given on the right of each panel.
Results of 3'-RACE experiments. All data concerning the four investigated genes. (A) ETV6; (B) EWSR1; (C) NUP98; (D) RUNX1. Data were separated into the identified cryptic exons, cryptic poly A sites and trans-spliced fusion transcripts. Only the non-genomically encoded polyA streches were indicated as (an).
|
|
| (tag|)agacggggtttcaccatgttagccaggatggtcttgatctcctgacctcgcgatccgcctgcgtcggcctcctaaagtgctaggattacaggcgtgagccaccgcgcctggccggtactgtcccttttttatttaaatcatgtatttctagatttttctctggccccaagtgcaccaccagtgttgtctcagtttcactgttataaatcatttacatcagggatgttgaatccagaaaataaaatggcaaacaaaaatga(an) |
| CACGCCATGCCCATTGGGAGAATAGCAG::AACCCAGCAAGCGTAGTCGAGGAGCCCT |
| [microsomal glutathione S-transferase 3 (MGST3) is located on chromosome 1q24, consists of 6 exons and encodes 152 amino acids] |
|
|
| (cag|)gttacaaagtgagagccttgtatacacttcaatacttaaaaagtacccgtactcagtactcagccggcagcataatgaaaagtgggactagacacggtgtccatatggagaggaaaaatatacatagaatatttttaacaaaatgtattcattgtataaatggaatccttctgtaactttggtaactgcatacttgttgtttggtaatgaaccagaggaggtataatactctagaattgtgtaacattaaagtgtaaacttttgtgtttaaa(an) |
| (cag|)gttacaaagtgagagccttgtatacacttcaatacttaaaaagtacccgtactcagtactcagccggcagcataatgaaaagtgggactagacacggtgtccatatggagaggaaaaatatacatagaatatttttaacaaaatgtattcattgta(an) |
| CAACAGAGCAGCAGCTACGGGCAGCAGA::GCTGTTTTGGAGTCAGCGGAAGATCACCA |
| [pre-mRNA processing factor 6 (PRP6) is located on 20q13.3, consists of 21 exons and encodes 941 amino acids] |
|
|
| additional exon repetition: E14_E13-14-15-16 (out-of-frame; poly-adenylated within exon 16) |
|
|
| (cag|)atggggtttcagcatgttggccaggctcgtctcgaactcctgaccacaggt---gctacaaaaggatttcttttaaccaa(an) |