| Literature DB >> 29308093 |
Kenji K Kojima1,2.
Abstract
Repbase is a comprehensive database of eukaryotic transposable elements (TEs) and repeat sequences, containing over 1300 human repeat sequences. Recent analyses of these repeat sequences have accumulated evidences for their contribution to human evolution through becoming functional elements, such as protein-coding regions or binding sites of transcriptional regulators. However, resolving the origins of repeat sequences is a challenge, due to their age, divergence, and degradation. Ancient repeats have been continuously classified as TEs by finding similar TEs from other organisms. Here, the most comprehensive picture of human repeat sequences is presented. The human genome contains traces of 10 clades (L1, CR1, L2, Crack, RTE, RTEX, R4, Vingi, Tx1 and Penelope) of non-long terminal repeat (non-LTR) retrotransposons (long interspersed elements, LINEs), 3 types (SINE1/7SL, SINE2/tRNA, and SINE3/5S) of short interspersed elements (SINEs), 1 composite retrotransposon (SVA) family, 5 classes (ERV1, ERV2, ERV3, Gypsy and DIRS) of LTR retrotransposons, and 12 superfamilies (Crypton, Ginger1, Harbinger, hAT, Helitron, Kolobok, Mariner, Merlin, MuDR, P, piggyBac and Transib) of DNA transposons. These TE footprints demonstrate an evolutionary continuum of the human genome.Entities:
Keywords: Crypton; DNA transposons; Human repeat; LTR retrotransposons; MER; Non-LTR retrotransposons; Repbase; SINE; Transposable elements; UCON
Year: 2018 PMID: 29308093 PMCID: PMC5753468 DOI: 10.1186/s13100-017-0107-y
Source DB: PubMed Journal: Mob DNA
Ancient repeat sequences not classified yet
| Header | Consensus sequences |
|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
TE classification in Repbase
| Class | Clade/Superfamilya |
|---|---|
| Non-LTR retrotransposon |
|
| LTR retrotransposon |
|
| DNA transposon |
|
aBold faces of clades/superfamilies show the presence of their traces (as repeats and/or domesticated genes) in the human genome
Non-LTR retrotransposons (LINEs, SINEs, and composites)
| Class | Group/Clade | Consensus sequences |
|---|---|---|
| LINE |
|
|
|
|
| |
|
|
| |
|
|
| |
|
|
| |
|
|
| |
|
|
| |
|
|
| |
|
|
| |
|
|
| |
| SINE |
|
|
|
| ||
|
| ||
|
| ||
|
|
| |
|
|
| |
| Unclassified |
| |
| Composite |
|
|
Consensus sequences of Alu are further classified into reported lineages (AluY, AluS, AluJ and monomeric Alu)
Fig. 1Nucleotide sequence alignments of ancient repeats with characterized TEs. Nucleotides identical to the uppermost sequence are shaded. Numbers in parentheses indicate the nucleotide position in the consensus. a UCON82 is an RTE non-LTR retrotransposon family. b UCON39 is an ancient Mariner DNA transposon family. c Eulor5 and Eulor6 are ancient Crypton DNA transposon families
LTR retrotransposons and endogenous retroviruses
| Superfamily | Groupa | Internal portion | Associated LTRs |
|---|---|---|---|
|
|
|
|
|
|
|
|
| |
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
|
| |
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
|
| |
|
|
| ||
|
|
|
| |
|
|
| ||
|
|
| ||
|
|
|
| |
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
|
| |
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
|
| |
|
|
| ||
|
|
| ||
|
|
| ||
|
| |||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
|
| |
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
| Unclassified |
|
| |
|
|
| ||
|
|
| ||
| LTRs not associated with characterized internal portions |
| ||
|
|
|
|
|
|
|
|
| |
|
|
|
| |
|
|
|
| |
|
|
|
| |
|
|
|
| |
|
|
|
| |
|
|
|
| |
|
|
|
| |
|
|
|
| |
| LTRs not associated with characterized internal portions |
| ||
|
|
|
|
|
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
|
| |
|
|
| ||
|
|
|
| |
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
| Unclassified |
|
| |
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
| (LTRs not associated with characterized internal portions) |
| ||
|
|
|
| |
| (LTRs not associated with characterized internal portions) |
| ||
| (Internal portions not associated with characterized LTRs) |
| ||
| Unclassified | (LTRs not associated with characterized internal portions) |
| |
aClassification is based on [64]
DNA transposons
| Superfamily | Consensus sequences |
|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Unclassified |
|