| Literature DB >> 16034026 |
Kyudong Han1, Shurjo K Sen, Jianxin Wang, Pauline A Callinan, Jungnam Lee, Richard Cordaux, Ping Liang, Mark A Batzer.
Abstract
Long INterspersed Elements (LINE-1s or L1s) are abundant non-LTR retrotransposons in mammalian genomes that are capable of insertional mutagenesis. They have been associated with target site deletions upon insertion in cell culture studies of retrotransposition. Here, we report 50 deletion events in the human and chimpanzee genomes directly linked to the insertion of L1 elements, resulting in the loss of approximately 18 kb of sequence from the human genome and approximately 15 kb from the chimpanzee genome. Our data suggest that during the primate radiation, L1 insertions may have deleted up to 7.5 Mb of target genomic sequences. While the results of our in vivo analysis differ from those of previous cell culture assays of L1 insertion-mediated deletions in terms of the size and rate of sequence deletion, evolutionary factors can reconcile the differences. We report a pattern of genomic deletion sizes similar to those created during the retrotransposition of Alu elements. Our study provides support for the existence of different mechanisms for small and large L1-mediated deletions, and we present a model for the correlation of L1 element size and the corresponding deletion size. In addition, we show that internal rearrangements can modify L1 structure during retrotransposition events associated with large deletions.Entities:
Mesh:
Year: 2005 PMID: 16034026 PMCID: PMC1179734 DOI: 10.1093/nar/gki718
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1L1IMD in the human genome. (A) Gel chromatograph of PCR products from a phylogenetic analysis of the human-specific L1IMD. The DNA template used in each lane is shown at top. The product sizes for filled and empty alleles are indicated at the left. (B) Schematic diagram depicting the insertion of the L1 element (orange boxes) and the deletion of genomic DNA (blue boxes). Flanking unique DNA sequences are shown as light blue boxes.
Figure 2EN cleavage site preferences for the L1IMDs. The number of differences from the consensus L1 endonuclease cleavage site (TTTT/A) are shown after down-weighting transitions. The data are analyzed for (A) The L1-Ta subfamily elements identified in Morrish et al. (17); (B) Human lineage-specific L1 insertions; (C) Chimpanzee lineage-specific L1 insertions.
Figure 3Median-joining network of the L1 elements associated with L1IMD. Empty circles denote human-specific L1 elements. Filled circles denote chimpanzee-specific L1 elements. The size of circles indicates the number of L1 loci with that sequence type. The lines denote substitution steps, with a one-step distance indicated in the top-left corner. The subfamily-specific diagnostic sequence positions (corresponding to positions 5930–5932 and 6015 in the 3′-UTR of the full-length L1 consensus sequence) are specified below each relevant node.
Structural summary of L1IMD
| Feature | Human | Chimpanzee |
|---|---|---|
| Full-length L1 insertions | 0 | 0 |
| 5′ truncated L1 insertions | 24 | 26 |
| Internal rearrangements | 4 | 2 |
| Non-inverted | 4 | 0 |
| 5′ truncation/inversions | 0 | 2 |
| With TSDs of any length | 0 | 0 |
| Total L1 size (bp) | 31 617 | 25 031 |
| Mean of L1 size (bp) | 1322 | 963 |
| Total deletion size (bp) | 17 671 | 14 923 |
| Mean of deletion size (bp) | 736 | 574 |
| Median of deletion size (bp) | 21 | 73 |
Figure 4Size distribution of the L1IMDs. The size distribution of all the L1IMD events identified in the human and chimpanzee lineages is displayed in 500 bp intervals or bins.
L1IMD frequency and polymorphism levels within the human and chimpanzee lineages
| Human | Chimpanzee | Human to chimpanzee ratio | |
|---|---|---|---|
| Total observed L1IMDs | 24 | 26 | 0.92 |
| PCR amplified | 23 | 26 | – |
| Fixed present | 18 | 22 | – |
| Polymorphic loci | 5 | 4 | – |
| Polymorphic fraction | 0.22 | 0.15 | 1.41 |
| Adjusted polymorphic loci | 10 | 8 | |
| Adjusted number of L1IMDs | 29 | 30 |
Figure 5Models for the creation of L1IMDs and formation of deletion associated inverted L1 elements. (A) Formation of small deletions. 5′ overhangs created by inexact cleavage of the top strand by the L1 EN are subject to 5′–3′ exonuclease activity that removes small single-stranded stretches from both the plus and minus strands (dotted light blue lines), which would otherwise have been the templates for the formation of TSDs. Subsequent ligation of the L1 cDNA to the upstream minus-strand sequence and plus-strand sequence synthesis by cellular enzymes results in the creation of small deletions and an L1 insertion without TSDs. (B) Formation of large deletions. For any preexisting double-strand break that has a 3′ overhang (red) for base pairing of the L1 cDNA (blue), a longer cDNA transcript is more likely to contain a stretch of sequence that has adequate complementary bases for annealing (pink) than a shorter one. Subsequent recombinational repair would remove a large segment of the target sequence, extending downstream to the original integration site (dotted black line) and resulting in a L1 insertion without TSDs. (C) Formation of a non-inverted atypical L1 insertion resulting in a large deletion. The L1 mRNA (green) forms a loop, with microhomology stretches within its sequence annealing to each other. The resulting L1 cDNA (blue) has an internal breakpoint (orange) where a stretch of the consensus sequence (complementary to the loop) is missing. Arrows show the orientation of the two parts of the L1 insertion. (D) Formation of a 5′ truncation/inversion resulting in a large deletion. Annealing of the L1 mRNA (green) to a complementary sequence in the 3′ overhang of a preexisting double-strand break leads to the transcription of a second stretch (purple) apart from the original cDNA (blue). Subsequently, both dissociate from the mRNA and form an ‘inversion junction’ (circled in red). Recombinational repair removes the stretch of DNA between the double-strand break and the original site of integration. Plus-strand synthesis results in a 5′ truncated L1 with the inverted portion being reverse complementary to the consensus sequence. Arrows show the orientation of the L1 segments in the inversion.