| Literature DB >> 25056159 |
José Horacio Grau1, Albert J Poustka, Martin Meixner, Jörg Plötner.
Abstract
BACKGROUND: LTR retroelements (LTR REs) constitute a major group of transposable elements widely distributed in eukaryotic genomes. Through their own mechanism of retrotranscription LTR REs enrich the genomic landscape by providing genetic variability, thus contributing to genome structure and organization. Nonetheless, transcriptomic activity of LTR REs still remains an obscure domain within cell, developmental, and organism biology.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25056159 PMCID: PMC4131045 DOI: 10.1186/1471-2164-15-626
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Classification and structure of LTR retroelements in the frog genome and transcriptomes. Maximum-likelihood (ML) trees calculated on the basis of 256 known RT domains of eukaryotic LTR REs including amino acid sequences obtained from the Silurana tropicalis genome (a) and the transcriptomes of Pelophylax lessonae (b). Diagrammatic presentation of LTR REs (c) found in the Silurana genome (blue) and in the transcriptome of P. lessonae (red). The thin lines represent the overall length of the retroelement including the LTRs, while thick bars depict open reading frames for aspartic proteinase (AP), chromo domain (CHR), envelope protein (ENV), group-specific antigen (GAG), integrase (INT), RNase (RN), and reverse transcriptase (RT). Frameshifts are indicated by asterisks (*).
LTR retroelements detected in the genome of
| Family | Type | GSM1 (genomic ORFs) | GSM2 (LTR-harvest) | AAE | AEL [bp] | [%] |
|---|---|---|---|---|---|---|
| Bel/Pao | Kobel | 129 | 140 | 135 | 7000 | 0.06468 |
| Hydra3.1 | 0 | 2 | 1 | 7000 | 0.00048 | |
| Ty1/Copia | Hydra1.1 | 6 | 8 | 7 | 4000 | 0.00192 |
| Mtanga | 8 | 8 | 8 | 4000 | 0.00220 | |
| Ty3/Gypsy | Amn-san | 749 | 805 | 777 | 5000 | 0.26688 |
| Cer | 30 | 25 | 28 | 7000 | 0.01322 | |
| Gmr | 177 | 215 | 196 | 8000 | 0.10772 | |
| Mag | 65 | 102 | 84 | 4000 | 0.02294 | |
| Retroviridae | MuERV | 2 | 1 | 2 | 6000 | 0.00062 |
| SnRV | 7 | 11 | 9 | 10000 | 0.00618 | |
| XEN1 | 7 | 12 | 10 | 10000 | 0.00653 | |
|
|
|
|
|
| ||
Based on the results of two genome search methods (GSM1 and 2) the average amount of elements (AAE), the average element length (AEL), and the percentage [%] of the elements in the genome were calculated.
LTR retroelements discovered in the genome of (SIL-G) and different transcriptomes of (SIL-T: adult tissues; SIL-D: developmental stages), (CYC-T: adult individuals), and (PEL-T: adult tissues) with remarks on the occurrence and distribution of these elements among animals, plants, and fungi
| Family | Type | Occurrence/remarks | Ref. | Genome | Transcriptome | |||
|---|---|---|---|---|---|---|---|---|
| SIL-G | SIL-T | SIL-D | CYC-T | PEL-T | ||||
|
| Kobel | first detected in the genome of the hemichordate | [ | ● | ● | ● | ● | ● |
| Only known from animal genomes; relatively few elements are reported across diverse animal phyla | ||||||||
| Hydra3.1 | described from the genome of | [ | ● | |||||
|
| Hydra1.1 | includes two elements that have described from the invertebrate | [ | ● | ● | ● | ● | |
| Widespread in eukaryotic genomes; two main sub-clades can be distinguished | ||||||||
| Mtanga | so far only known from the genome of the mosquito | [ | ● | ● | ||||
| Zeco | restricted to crustaceans, urochordates, and fish | [ | ● | |||||
|
| Amn-san | belongs to the vertebrate lineage of chromoviruses, active in fish, amphibians, and reptiles | [ | ● | ● | ● | ● | ● |
| The largest family of LTR REs; widespread among the genomes of plants, animals, and fungi | ||||||||
| Cer | first described from nematodes | [ | ● | ● | ||||
| CsRN1 | characterized from the genome of the trematode | [ | ● | |||||
| Gmr | circulate within the genomes of deuterostomes; characterized by Ty1/Copia pol-domain organization | [ | ● | ● | ● | ● | ● | |
| Mag | widely spread through animal genomes including vertebrates | [ | ● | ● | ● | ● | ● | |
|
| MuERV | poorly known outside mammals (belongs to class 3 of retroviruses) | [ | ● | ● | ● | ||
| Exclusively found in vertebrate genomes; characterized by the presence of a gene encoding an envelope protein | ||||||||
| SnRV | described from the snakehead fish ( | [ | ● | ● | ● | ● | ||
| XEN1 | described from | [ | ● | ● | ● | ● | ● | |
Figure 2Diversity and expression patterns of LTR retroelements in the frog genome and transcriptomes. (a) Diversity of LTR REs in the genome of Silurana and in the frogs transcriptomes analyzed. (b) Number of LTR RE copies in the Silurana genome; (c) Proliferation patterns based on average relatedness of LTR REs in the Silurana genome. The average relatedness was calculated on the basis of amino acids as LOG (∑ (Alignment coverage *Alignment score)), in which a higher relatedness score indicates that the elements within that group are closer related to one another. (d) Arithmetic means of relative NRC values calculated for brain (B), heart (H), liver (L), and muscle (M) of S. tropicalis (left points) and P. lessonae (right points). (e) Relative amount of LTR REs in different frog transcriptomes.
Figure 3Normalized read counts and relative amount of expression of LTR retroelement (LTR RE) transcripts throughout the developmental progression of progession. The presence of each type of LTR RE found within the transcriptome of S. tropicalis throughout 23 distinct developmental stages is summarized.
Examples of protein domains found in LTR REs predicted from the genome of which might play a role in gene regulation and transcriptional networking
| Domain | Pfam No. | Included in | Domain description/function | Ref. |
|---|---|---|---|---|
| CHROMO (Chromatin organization modifier) | pfam00385 | This domain was exclusively found within Amn-san elements. Circa 80% of these elements within the | The chromo domain is about 40–50 amino acids long. It is contained in various proteins involved in chromatin remodeling and the regulation of gene expression in eukaryotes during development. | [ |
| PNMA (Para-neoplastic antigen MA) | pfam14893 | Found so far in about 30% of Cer elements and in about 16% of Gmr elements. | This protein domain has so far only been studied in mammals, where it has been associated with neurological disorders. | [ |
| Because of the homology between PNMA proteins and an apoptosis inducing protein (MOAP1), the involvement of PNMA proteins in apoptosis is hypothesized. | ||||
| SCAN | pfam02023 pfam00096 | This domain was found in over 50% of Gmr elements only. | The SCAN domain family of Zinc finger transcription factors, they are thought to be implicated in regulating genes involved in lipid metabolism, cell survival, and differentiation. | [ |
| Exo/endo phosphatase | pfam03372 pfam14529 | Found in about 12% of Mtanga elements. | The exo-/endonuclease phosphatase family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signaling. | [ |
| Zinc Fingers Zf-H2C2_2 (Zinc-finger double domain) | pfam13465 | About 40% of SnRVs contained a Zinc-finger double domain; about 20% of XEN1 elements contained a Zinc knuckle. | Zinc finger (Znf) domains are relatively small but very diverse protein motifs which can target specific molecules. Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organization, epithelial development, cell adhesion, protein folding, chromatin remodeling, and Zinc sensing, to name but a few. | [ |
| pfam00098 | A Zinc knuckle is a Zinc binding motif of the general structure CX2CX4HX4C where X can be any amino acid. The motifs mostly originate from retroviral gag proteins (nucleocapsid). Zinc knuckles are involved in eukaryotic gene regulation. | |||
| UBN2 gag-polypeptide of LTR copia-type | pfam14223 | Found in Copia-type elements, in about 30% of Mtanga elements and in about 80% of Hydra1.1 elements. | Ubinucleins are members of a protein family that contain a conserved HIRA binding domain which interacts with the N-terminal WD repeats of HIRA/Hir proteins. UBN1 and UBN2 are believed to be the orthologs of Hpc2p, a subunit of a nucleosome assembly complex in budding yeast (HIR), involved in regulation of histone gene transcription. | [ |
Pfam: Protein family database.
Figure 4Work flow diagram summarizing data flow from sequencing to statistical analysis. Abbreviations used in the Figure: DB, database; LTR REs, long terminal repeat retroelements.