| Literature DB >> 20615209 |
Abstract
BACKGROUND: Long interspersed nuclear element-1 (LINE-1 or L1) is a dominant repetitive sequence in the human genome. Besides mediating its own retrotransposition, L1 can mobilize Alu and messenger RNA (mRNA) in trans, and probably also SVA and non-coding RNA. The structures of L1 copies and trans-mobilized retrocopies are variable and can be classified into three categories: full-length; 5'-truncated; and 5'-inverted insertions. These structures may be generated by different 5' integration mechanisms.Entities:
Year: 2010 PMID: 20615209 PMCID: PMC2912911 DOI: 10.1186/1759-8753-1-17
Source DB: PubMed Journal: Mob DNA
Figure 1Models for retrotransposition. Five models have been proposed for three types of retrotransposition and all are variations of target-primed reverse transcription (TPRT); the first two steps are common in all models. (A) bottom strand cleavage; (B) target-primed reverse transcription; (C) top strand cleavage; (D) completion of first strand reverse transcription; (E) template jumping; (F) second strand cDNA synthesis by either RT or cellular DNA polymerase; (G) rejoining junctions; (H) annealing between top strand and first strand cDNA; (I) target-primed second strand cDNA synthesis with the usage of short sequence complementarity by either RT (double TPRT model), DNA polymerases in microhomolgy-mediated end-joining (MMEJ) pathway (MMEJ model), or DNA polymerases in non-homlogous end-joining (NHEJ) pathway (NHEJ model); (J) rejoining junctions; (K) annealing between top strand and template RNA; (L) target-primed reverse transcription from the top strand cleavage site; (M) RNA degradation and annealing between two cDNA strands; (N) DNA synthesis and rejoining junctions by unknown mechanisms.
Human-specific and chimpanzee-specific insertions with ≥5bp TSDs
| Human-specific | Chimp-specific | |
|---|---|---|
| Total | 6197 | 2910 |
| L1 | 1120 | 948 |
| | 4197 | 1357 |
| SVA | 368 | 109 |
| HERV and solo LTR | 63 | 41 |
| mRNA retrocopy | 48 | 95 |
| ncRNA retrocopy | 14 | 13 |
| DNA transposon | 0 | 0 |
| Uncharacterized* | 387 | 347 |
Uncharacterized includes elements with nested insertion, internal deletion, internal duplication, 3' truncation, and template switch.
L1, long interspersed nuclear element-1; HERV, human endogenous retrovirus; LTR, long terminal repeat; mRNA, messenger RNA; ncRNA, non-coding RNA.
Figure 2Junction sequences of L1 insertions starting between positions 6-25. Target site duplications are shaded in grey; microhomologies are underlined; 5' extranucleotides are colored in red and in lowercases; and YY1-binding sequences are shaded in yellow.
Three types of L1 insertions in human and chimpanzee
| Human | Chimp | Total | |
|---|---|---|---|
| Full-length | 286 (25.5%) | 49 (5.2%) | 335 (16.2%) |
| 5'-truncated | 490 (43.8%) | 604 (63.7%) | 1094 (52.9%) |
| 5'-inverted | 344 (30.7%) | 295 (31.1%) | 639 (30.9%) |
| Total | 1120 | 948 | 2068 |
Three types of Alu insertions in human and chimpanzee
| Human | Chimp | Total | |
|---|---|---|---|
| Full-length | 3449 (82.2%) | 1023 (75.4%) | 4472 (80.5%) |
| 5'-truncated | 748 (17.8%) | 333 (24.5%) | 1081 (19.5%) |
| 5'-inverted | 0 (0%) | 1 (0.1%) | 1 (0.0%) |
| Total | 4197 | 1357 | 5554 |
The frequency of insertions with 5' extranucleotides
| Type of insertion | No. of insertions with extranucleotides | Frequency |
|---|---|---|
| 5'-inverted L1 | 25 | 3.9% |
| 5'-truncated L1 | 160 | 14.6% |
| 5'-truncated | 126 | 11.7% |
| Full-length | 554 | 12.4% |
Figure 3Target site duplication (TSD) and mircohomogous (MH) length distribution. (A) TSD length distribution of L1 and Alu. (B) TSD length distribution of SVA and messenger RNA retrocopy. (C) Length distribution of MH at 5' junctions of L1 and Alu. (D) Length distribution of MH at inversion junction of 5'-inverted L1, 5' junctions of SVA and mRNA retrocopies and 3' junctions of non-canonical L1 insertion (NCLI) and transfer RNA tailless retrocopies.
Chi-square or Fisher's exact test for the frequency of ≤ 9-bp target site duplications
| Comparison | |
|---|---|
| 5'-truncated L1 > full-length L1 | 5.7e-12*** |
| 5'-truncated L1 > 5'-inverted L1 | 1.2e-19*** |
| 5'-truncated L1 > full-length | 5.3e-43*** |
| 5'-truncated L1 > 5'-truncated | 5.9e-14*** |
| 5'-truncated L1 > 5'-truncated SVA | 7.7e-5*** |
| 5'-truncated L1 > 5'-truncated mRNA retrocopy | 0.0038** |
** P- value < 0.01
*** P- value < 0.001
Figure 4Insertion site sequences. (A) Target sequence logos. Arrow indicates the position of 3' insertion site. Sample numbers are represented at upper right. As insertions with 20-bp target site duplications (TSDs) are not included, the sums of sample numbers are not equal to those in Tables 2 and 3. (B) An example of short TSDs that were likely to be parts of long TSDs including substitution. TSDs are underlined and identical sequences in TSDs are shaded in black.
Short target site duplications (TSDs) with or without allowing nucleotide substitutions
| L1 | |||||
|---|---|---|---|---|---|
| Full | Trun | Inv | Full | Trun | |
| 5-bp TSD in relaxed criterion*/ | 0/3 | 36/56 | 0/4 | 13/51 | 5/18 |
| 6-bp TSD in relaxed criterion*/ | 0/4 | 51/64 | 0/6 | 14/69 | 6/26 |
*TSD in relaxed criterion allows 1-bp substitution sandwiched by ≥2-bp homologous nucleotides, but TSD in strict criterion does not.
Full, full-length; Trun, 5'-truncated; Inv, 5'-inverted.
Chi-square or Fisher's exact test for the frequency of ≥1-bp microhomologies (MHs)
| Comparison | |
|---|---|
| 5'-inverted L1 > 5'-truncated L1 | 5.9e-21*** |
| 5'-truncated L1 > Full-length L1 | 1.1e-43*** |
| 5'-truncated L1 > 5'-truncated | 9.8e-46*** |
| 5' junction of 5'-inverted L1 > Inversion junction of 5'-inverted L1 | 3.6e-11*** |
| Inversion junction of 5'-inverted L1 > 5' junction of full-length L1 | 2.3e-51*** |
| 5'-truncated L1 > 5'-truncated SVA | 2.3e-5*** |
| 5'-truncated L1 > 5'-truncated mRNA retrocopies | 3.7e-6*** |
*** P-value < 0.001