| Literature DB >> 17986331 |
Britta Mersch1, Noa Sela, Gil Ast, Sándor Suhai, Agnes Hotz-Wagenblatt.
Abstract
BACKGROUND: Transposed elements (TEs) are known to affect transcriptomes, because either new exons are generated from intronic transposed elements (this is called exonization), or the element inserts into the exon, leading to a new transcript. Several examples in the literature show that isoforms generated by an exonization are specific to a certain tissue (for example the heart muscle) or inflict a disease. Thus, exonizations can have negative effects for the transcriptome of an organism.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17986331 PMCID: PMC2194731 DOI: 10.1186/1471-2156-8-78
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Figure 1The effects of TE insertions. a) (i) TE inserts into an intron of a gene. (ii-v) show the possible effects of this integration; (ii) alternatively exon is created, (iii) TE contributes alternative 5'splice site, (iv) TE contributes alternative 3'splice site, (v) TE creates a constitutively spliced exon. b) (i) TE inserts into the first or last exon of a gene. (ii – iv) show the possible effect of this integration: (ii, iii) enlargement of first or last exon, (iv) TE activates an alternative intron.
Figure 2Pie chart of EST numbers. Number of ESTs for the different tissues. This figure indicates the extremely different numbers of ESTs and mRNAs for the different tissues.
Figure 3Work flow of SERpredict. Programs and rules used for extracting tissue- or tumor-specific TE-containing exons, for details see Section "Work flow in SERpredict".
Figure 4Example output of SERpredict. The output of SERpredict for [EMBL:AF466401] is presented. One of the isoforms shows a tumor-specific exon which was generated through the exonization of an Alu element.
Exonizations in the human genome
| Human | |||||||
| Transposed element | Alu | L1 | L2 | CR1 | MIR | LTR | DNA |
| Number of exonizations | 432 | 88 | 40 | 4 | 86 | 119 | 90 |
Number of exonized transposed elements in the human genome which have a least three EST observations of the mRNA containing the TE exon. The Alu element is exonized most frequently among TEs.
Exonizations in the mouse genome
| Mouse | |||||||||
| transposed element | B1 | B2 | B4 | L1 | L2 | ID | MIR | LTR | DNA |
| number of exonizations | 55 | 33 | 22 | 37 | 9 | 4 | 20 | 72 | 8 |
Number of exonized transposed elements in the mouse genome which have a least three EST observations of the mRNA containing the TE exon. The B1 element, which is homologous to the left arm of the Alu element, is exonized most frequently among mouse TEs.
Human tissue-specific TE-exons
| Gene | TE | Chr. | Tissue | TS score |
| Zinc finger protein 195 | L1 | 11 | nerve | 92.41 |
| Zinc finger protein 33A | L1 | 10 | trachea | 96.81 |
| Ribonuclease P protein subunit p38 | Alu | 10 | bone | 92.30 |
| AP-3 complex subunit mu-1 | Alu | 10 | eye | 99.21 |
| Zinc finger protein 195. | Alu | 11 | nerve | 92.41 |
| 4F2 cell-surface antigen heavy chain | Alu | 11 | bone | 96.69 |
| uterus | 96.34 | |||
| Suppressor of G2 allele of SKP1 homolog | Alu | 13 | pancreas | 94.39 |
| muscle | 96.65 | |||
| Centrosomal protein of 27 kDa | Alu | 15 | uterus | 93.29 |
| Fumarylacetoacetate hydrolase domain-containing protein 1 | Alu | 16 | placenta | 94.01 |
| T-cell activation NFKB-like protein | Alu | 19 | ovary | 93.65 |
| Zinc finger protein 320 | Alu | 19 | embryo | 96.68 |
| Zinc finger MYM-type protein 1 | Alu | 1 | brain | 97.84 |
| Protein MANBAL | Alu | 20 | stomach | 93.75 |
| Serine/threonine-protein kinase 6 | Alu | 20 | mouth_oral | 94.16 |
| CDNA FLJ20699 fis. clone KAIA2372. | Alu | 22 | muscle | 93.75 |
| brain | 98.69 | |||
| DNA directed RNA polymerase II | Alu | 7 | colon | 93.45 |
| ovary | 93.45 | |||
| thymus | 98.29 | |||
| Putative ribosomal RNA methyltransferase 2 | Alu | 7 | breast | 93.74 |
| GPI ethanolamine phosphate transf. 3 | Alu | 9 | eye | 91.82 |
| Soluble calcium-activated nucl. 1 | L2 | 17 | placenta | 98.14 |
| Intraflagellar transport 20 homolog | L2 | 17 | colon | 98.92 |
| muscle | 93.73 | |||
| CDNA FLJ32655 fis | CR1 | 17 | testis | 99.99 |
| Retinol dehydrogenase 13 | MIR | 19 | testis | 93.74 |
| uterus | 92.69 | |||
| Tripartite motif-containing protein 14. | MIR | 9 | thymus | 93.39 |
| Salivary alpha-amylase precursor | ERVK | 1 | muscle | 97.67 |
| Thiamin pyrophosphokinase 1 | ERV1 | 7 | testis | 93.75 |
| Hypothetical protein Q8WZ27 | ERV1 | 4 | thyroid | 93.73 |
| Trafficking protein particle complex protein 2 | MER1 | X | testis | 92.55 |
| mTERF domain-containing protein 2. | MER2 | 2 | brain | 99.09 |
Potentially tissue-specific TE-exons in the human transcriptome. From left to right: the gene name in which the exonization occurred, the transposed elements family name, the chromosome number, the name of the tissue to which the exon is specific and the TS score.
Mouse tissue-specific TE-exons
| Gene | TE | Chr. | Tissue | TS score |
| RIKEN cDNA 9830124H08 gene | B1 | 14 | pancreas | 99.21 |
| Gametogenetin binding protein 1 | B1 | 17 | pancreas | 90.57 |
| G protein-coupled receptor 177 | B1 | 3 | pancreas | 98.05 |
| Hydroxysteroid dehydrogenase like 2 | B1 | 4 | pancreas | 98.44 |
| NFKB inhibitor interacting Ras-like protein 1 | B2 | 14 | pancreas | 96.48 |
| RIKEN cDNA 4930444A02 gene | B2 | 8 | pancreas | 92.81 |
| ADP-ribosylation factor-like 4A | B4 | 12 | pancreas | 93.75 |
| Cyclin-dependent kinase 2 | L2 | 10 | pancreas | 92.75 |
| Target of EGR1. member 1 | L2 | 4 | limb | 92.75 |
| ST6 | MIR | 2 | limb | 92.74 |
| intestine | 99.4 |
Potentially tissue-specific TE-exons in the mouse transcriptome. From left to right: the gene name in which the exonization occurred, the transposed elements family name, the chromosome number, the name of the tissue to which the exon is specific and the TS score.
Human tumor-specific TE-exons
| Gene | TE | Chr. | LOD |
| Centrosomal protein of 27 kDa | Alu | 15 | 2.93 |
| G-protein coupled receptor 56 precursor | Alu | 16 | 2.8 |
| T-cell activation NFKB-like protein | Alu | 19 | 2.76 |
| Dipeptidyl peptidase 9 | Alu | 19 | 3.49 |
| NADH dehydrogenase | Alu | 19 | 2.07 |
| YY1-associated protein 1 | Alu | 1 | 3.15 |
| Centromere protein R | Alu | 1 | 2.74 |
| Selenoprotein T precursor. | Alu | 3 | 2.64 |
| Putative ribosomal RNA methyltransferase 2 | Alu | 7 | 2.4 |
| GPI ethanolamine phosphate transferase 3 | Alu | 9 | 2.07 |
| Protein RMI1 homolog. | Alu | 9 | 2.03 |
| NHP2-like protein 1 | L1 | 22 | 9.1 |
| Zinc finger protein DZIP1 | L2 | 13 | 2.34 |
| Dynein light chain 2A. cytoplasmic | MIR | 20 | 4.43 |
| Protein NipSnap1. | MIR | 22 | 6.55 |
| ST6 | MIR | 9 | 2.69 |
| Tripartite motif-containing protein 14. | MIR | 9 | 2.86 |
| 40S ribosomal protein SA | ERV1 | 3 | 12.27 |
| DNA directed RNA polymerase II | MaLR | 7 | 2.34 |
| CDNA FLJ33708 fis, clone BRAWH2007862. | MaLR | 6 | 2.18 |
| Beta-microseminoprotein precursor | MER1 | 10 | 2.92 |
Potentially tumor-specific TE-exons in the human transcriptome. From left to right: the gene name in which the exonization occurred, the transposed elements family name, the chromosome number and the LOD score.
Mouse tumor-specific TE-exons
| Gene | TE | Chr. | LOD |
| Methylmalonic aciduria | L1 | 8 | 2.76 |
| ST6 | MIR | 2 | 2.56 |
Potentially tumor-specific TE-exons in the mouse transcriptome. From left to right: the gene name in which the exonization occurred, the transposed elements family name, the chromosome number and the LOD score.