| Literature DB >> 29325122 |
Kenji K Kojima1,2.
Abstract
Short interspersed elements (SINEs), which are nonautonomous transposable elements, require the transposition machinery of long interspersed elements (LINEs) to mobilize. SINEs are composed of two or more independently originating parts. The 5' region is called the "head" and is derived mainly from small RNAs, and the 3' region ("tail") originates from the 3' region of LINEs and is responsible for being recognized by counterpart LINE proteins. The origin of the middle "body" of SINEs is enigmatic, although significant sequence similarities among SINEs from very diverse species have been observed. Here, a systematic analysis of the similarities among SINEs and LINEs deposited on Repbase, a comprehensive database of eukaryotic repeat sequences was performed. Three primary findings are described: 1) The 5' regions of only two clades of LINEs, RTE and Vingi, were revealed to have contributed to the middle parts of SINEs; 2) The linkage of the 5' and 3' parts of LINEs can be lost due to occasional tail exchange of SINEs; and 3) The previously proposed Ceph-domain was revealed to be a fusion of a CORE-domain and a 5' part of RTE clade of LINE. Based on these findings, a hypothesis that the 5' parts of bipartite nonautonomous LINEs, which possess only the 5' and 3' regions of the original LINEs, can contribute to the undefined middle part of SINEs is proposed.Entities:
Keywords: Ceph-domain; LINE; RTE; SINE; internal deletion; nonautonomous
Mesh:
Year: 2018 PMID: 29325122 PMCID: PMC5786205 DOI: 10.1093/gbe/evy008
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—Schematic representation of SINE structures. The origins of head (tRNA or 5 S rRNA), body (CORE), and LINE-derived parts (RTE 5′-UTR, RTE 3′-UTR, Vingi 5′-UTR) are indicated. Regions whose origins are unknown are indicated by “?.”
SINEs Whose Two Parts of Sequences Show Similarity to LINEs
| SINE | Region | LINE | Region | Identity |
|---|---|---|---|---|
| 84–134 | 53–107 | 81% | ||
| 141–194 | 3040–3093 | 76% | ||
| 117–168 | 115–166 | 83% | ||
| 173–216 | 5034–5077 | 76% | ||
| 255–311 | 188–243 | 77% | ||
| 341–385 | 144–186 | 91% | ||
| 194–229 | 151–186 | 83% | ||
| 242–299 | 504–558 | 84% | ||
| 235–288 | 188–239 | 80% | ||
| 78–223 | 295–439 | 87% | ||
| 223–267 | 3616–3659 | 89% | ||
| 85–302 | 1–221 | 88% | ||
| 317–406 | 3182–3269 | 83% | ||
| 199–277 | 152–233 | 78% | ||
| 197–296 | 152–250 | 73% | ||
| 10–220 | 6–246 | 69% | ||
| 16–107 | 29–119 | 87% | ||
| 134–239 | 1–122 | 74% | ||
| 170–335 | 27–196 | 69% | ||
| 338–378 | 3726–3767 | 86% | ||
| 415–564 | 147–292 | 74% | ||
| 570–609 | 3854–3893 | 83% | ||
| 134–266 | 159–289 | 74% | ||
| 275–317 | 3854–3895 | 88% |
Note.—If the same region of SINE hits several different LINEs, only the LINE with the highest CENSOR score is shown. The length of LINE/SINE is shown in parenthesis.
SINEs originated by the internal deletion of LINEs.
Internal Fragments of LINE 5′-UTRs Seen in the Middle of SINEs
| SINE | Region | LINE | Region | Identity |
|---|---|---|---|---|
| 82–116 | 2–37 | 92% | ||
| 270–321 | 129–176 | 86% | ||
| 101–178 | 220–298 | 69% | ||
| 391–455 | 294–359 | 76% | ||
| 403–467 | 286–348 | 78% | ||
| 61–127 | 2–74 | 84% | ||
| 130–367 | 259–486 | 75% | ||
| 130–220 | 325–414 | 79% | ||
| 225–281 | 182–237 | 74% | ||
| 127–213 | 325–414 | 76% | ||
| 134–239 | 325–423 | 79% | ||
| 133–238 | 325–423 | 77% |
Note.—If the same region of SINE hits several different LINEs, only the LINE with the highest CENSOR score is shown. The length of LINE/SINE is shown in parenthesis.
. 2.—Sequence similarity of Ceph-domains with some RTE LINEs. Nucleotides identical to those in representative Ceph-domains (SepiaSINE and IdioSINE2) are shaded. The positions of consensus sequences are shown in parentheses with their origins.
. 3.—Alignment of CORE-domains and Ceph-domains. Nucleotides identical to MIR CORE-domain is colored in red, whereas nucleotides identical to Sepioth-SINE1 but not to MIR CORE-domain are in yellow. RTE-like sequences in Ceph-SINEs are in lower cases.
. 4.—Hypothetical origin of SINE body.