| Literature DB >> 27158268 |
Dustin C Hancks1, Haig H Kazazian2.
Abstract
Over evolutionary time, the dynamic nature of a genome is driven, in part, by the activity of transposable elements (TE) such as retrotransposons. On a shorter time scale it has been established that new TE insertions can result in single-gene disease in an individual. In humans, the non-LTR retrotransposon Long INterspersed Element-1 (LINE-1 or L1) is the only active autonomous TE. In addition to mobilizing its own RNA to new genomic locations via a "copy-and-paste" mechanism, LINE-1 is able to retrotranspose other RNAs including Alu, SVA, and occasionally cellular RNAs. To date in humans, 124 LINE-1-mediated insertions which result in genetic diseases have been reported. Disease causing LINE-1 insertions have provided a wealth of insight and the foundation for valuable tools to study these genomic parasites. In this review, we provide an overview of LINE-1 biology followed by highlights from new reports of LINE-1-mediated genetic disease in humans.Entities:
Keywords: Alu; Autoimmunity; Cancer; Disease; LINE-1; Retrotransposition; Retrotransposon; SVA
Year: 2016 PMID: 27158268 PMCID: PMC4859970 DOI: 10.1186/s13100-016-0065-9
Source DB: PubMed Journal: Mob DNA
Fig. 1Retrotransposons active in humans. a An autonomous active LINE-1. A full-length LINE-1 ~ 6 kb in length is shown [36, 41, 239]. LINE-1 encodes three proteins, two of which (ORF1p and ORF2p) are absolutely required for retrotransposition in cis [42, 146]. Currently, the role for ORF0p is unclear [60]; interestingly, it may form fusion proteins with downstream coding sequences by utilizing internal splice donor sites (SD) [60]. LINE-1 transcription is driven from its own promoter (big black bent arrow) [53, 54] located in the 5′-UTR. The 5′-UTR also encodes a weaker antisense promoter (ASP, small black bent arrow) [59]. It has been postulated that the LINE-1 ASP in conjunction, with splice acceptors located on the antisense strand of LINE-1, may contribute to new gene formation via a mechanism termed “gene-breaking [240].” Termination of LINE-1 transcription is mediated by a polyA signal (AATAAA) located in the 3′-UTR. Occasionally, transcription proceeds past the internal polyA signal and terminates at a downstream one [139, 241]. Such chimeric transcripts, if retrotransposed, may result in 3′-transductions [42, 62–64, 176]. Majority of insertions end in a polyA tail (AAAn) of variable length [37]. In addition, most insertions are characterized by flanking target-site duplications (4-20 bp in length, black horizontal arrows) [35]. CC-coiled coiled domain [47], RRM-RNA recognition motif [44], CTD-C-terminal domain, EN-endonuclease [51], Z domain [242], RT-reverse transcriptase [52], C-cysteine-rich. AA-amino acid. b The Alu SINE. Alus are small Pol III transcribed RNAs derived from 7SL RNA [243]. An Alu element consists of a left and right monomer, which are derived from an ancient duplication event, separated by an internal A-rich sequence. Alus contain their own transcriptional signals, an A and B box located in the left monomer. Efficient Alu transcription requires a strong enhancer element in the upstream flanking sequence [103, 104]. Transcription termination of an Alu typically occurs at a Pol III terminator (TTTT) located in the downstream flanking sequence [244]. Similar to LINE-1, Alu insertions end in a polyA tail and are flanked by a target-site duplication. c A canonical SINE-VNTR-Alu (SVA) element consisting of its primary domains: CCCTCT hexamer, Alu-like, VNTR, SINE-R derived from the env gene and right LTR from a HERV-K is shown [126]. SVA transcription can initiate upstream (black bent arrow) or in the CCCTCT hexamer (black bent arrow) [126, 127]. Like LINE-1, SVA transcription typically terminates at its own [127] or a downstream polyA signal [24, 65]. d A processed pseudogene (PP) is shown. Note the lack of introns and the presence of a target-site duplication and a 3′-polyA tail similar to LINE-1, Alu, and SVA. e U6 chimera insertion. A U6 snRNA fused with the 3′-end of an LINE-1 sequence formed by “template-switching” [84, 140, 144] is shown. Although the site where ORF2p switches templates varies across the U6 chimera insertions, the junction where the two sequences are joined is typically T-rich [144]
Fig. 2Anatomy of retrotransposon insertions. A variety of structures for retrotransposon insertions (a-k) identified by genomic studies, cell culture retrotransposition assays, and disease-causing insertions that have been reported is shown. Reported frequencies, either from genomic analysis or cell-culture retrotransposition assays, for each structure is located in the upper right hand corner of each panel. If no frequency data has been reported only the element’s name is shown. These structures have provided key insights into the mechanism of target-primed reverse transcription [77], retrotransposon transcript structure [127], and the mechanism by which LINE-1-mediated retrotransposition events contribute to genome evolution [62]. A) LINE-1 target-site. Most insertions occur at asymmetric AT-rich sequences [6, 37, 51, 86]. The first step of TPRT is cleavage of the bottom-strand by ORF2p endonuclease activity at a motif resembling 5′-TTTT/AA-3′ [245]. The nuclease responsible for top-strand cleavage is currently unknown. The nature of the staggered cleavage events generates a target-site duplication (TSD, sequence in bold). a TSD (black horizontal arrows) is used to define the boundaries of an insertion and considered a hallmark of LINE-1-mediated retrotransposition events. b Full-length insertion. It is generally accepted that in order for an element to be retrotransposition-competent it must be full-length. c 5′-truncated insertions. Most LINE-1 s in the human genome are grossly truncated at their 5′-end [6, 36, 37]. In contrast, most Alus [243] and SVA elements are full-length [123, 127]. To date, no consensus sequence has been identified in LINE-1 or SVA insertions regarding the mechanism of 5′-truncation. However, a new report implicates stem-loop structures as a factor driving 5′-truncation in recent Alu insertions [114]. d 3′-transduction. Although the first report of a 3′-transduction was an LINE-1 insertion into the dystrophin gene resulting in Duchenne’s muscular dystrophy in 1994 [176], it would be several years before the significance of this chimeric insertion was uncovered. Several years later, as one of the first insights gained from insertions recovered from cell-culture retrotransposition assays, it was reported that LINE-1 frequently bypassed its own polyA signal (AATAAA) in favor of a downstream one (AATAAA) [42]. Subsequently, elegant experimental analysis revealed that utilizing a downstream polyA signal could result in LINE-1-mediated exon-shuffling [62]. An insertion containing a 3′-transduction will typically contain two homopolymer stretches (AAAn) and contain the 3′-TSD from the source locus (gray horizontal arrow) as part of the transduced sequence. Notably, insertions containing serial 3′-transductions have been reported and can be used to track the evolutionary history of an element [246]. e 5′-end inversions. Another hallmark of LINE-1-mediated retrotransposition events is the inversion of the 5′-end (gray horizontal arrow) of the retrotransposon sequence [35]. Small indels are typically identified at the inversion breakpoint [88]. Inversions have only been reported for LINE-1 s, SVAs, and processed pseudogenes [196]. 5′-end inversion is presumed not to occur for Alus due to their short length. It has been hypothesized that a phenomenon referred to as twin-priming may account for the frequent inversions associated with LINE-1-mediated retrotransposition events [88]. f 5′-transduction. In some instances, LINE-1 [82] or SVA transcription [126, 127] may initiate upstream of the internal promoter generating a chimeric transcript. Retrotransposition of this sequence results in duplication of the sequence 5′- of the source locus at a new genomic location. It has been speculated that 5′-transductions are relatively common for SVA elements due to their weaker internal promoter compared to LINE-1, which has a very strong internal promoter, where only a handful of 5′-transductions have been reported [82]. g Internal priming. Occasionally following bottom-strand cleavage, internal A-rich sequences upstream in the retrotransposon RNA may basepair with the T-rich overhang at the target-site instead of the 3′-polyA tail, followed by first-strand cDNA synthesis by ORF2p [247–249]. These insertions can be deemed a type of 3′-truncation. h Exon-trapping. Retrotransposons are dispersed throughout the genome including intronic sequence. LINE-1, Alu, SVA all have been reported to contain numerous splice sites and be incorporated into the transcriptome [105, 127, 128, 134, 250]. Interestingly, LINE-1 internal splicing can generate a transcript lacking ORF1 but maintaining a functional ORF2 [251]. In some instances, at least for SVA, retrotransposition of chimeric transcripts containing upstream exons may occur [127, 128, 132]. Notably, SVA itself is thought to have originated from alternative splicing from genomic repeats [126] and SVA-related elements (e.g. LAVA, PVA) appear to have acquired distinct 3′-domains via splicing in gibbons [125, 135, 136, 138]. I) 3′-truncation. Premature polyadenylation using either canonical or non-canonical polyadenylation sites results in LINE-1 or SVA RNAs lacking 3′-sequence [127, 252]. If this RNA is retrotransposed, it will result in a 3′-truncated insertion. Consistent with the dispensability of SVA domains [130], 3′-truncations may be more frequent for SVA compared to LINE-1. In principle, 3′-truncated LINE-1 RNAs containing ORF1 coding sequence might be actively retrotransposed as in the case of ORF1 mNEOi in cell culture [144] and the presence of half-LINE-1 (HAL1) insertions in mammalian genomes [253]. j Target-site deletion. Another surprise from cell culture retrotransposition assays was the discovery of large deletions associated with new retrotransposition events [82, 83]. Genomic deletions up to 1 MB have been associated with LINE-1 mediated retrotransposition events in vivo [153]. These insertions occur at a LINE-1 EN cleavage site, are generated by ORF2 reverse-transcriptase activity, and end in a 3-polyA tail. Currently, the mechanism driving 5-targe-site deletions is unclear; yet, it is tempting to speculate that chromatin looping along with cleavage by LINE-1 or another nuclease may play important roles [82, 83]. k Endonuclease-independent (ENi) insertion. Eni insertions were discovered by the Moran lab when carrying out retrotransposition assays in different Chinese Hamster Ovary (CHO) cell lines lacking key DNA repair factors [213]. Frequent retrotransposition was observed for an engineered LINE-1 element construct, with a catalytically inactive EN, in these cells but not HeLa cells. Characterization of recovered insertions revealed LINE-1 integration at genomic sites not resembling the LINE-1 EN consensus cleavage site. In addition, the insertions were typically truncated at both the 5′-and 3′-ends [213]. These data suggest that LINE-1 can serve as a “molecular band-aid” [254] at double-stranded DNA breaks [213–215] and that LINE-1 s lacking a functional EN domain may be able to retrotranspose in certain contexts. Building on these studies it was later reported that LINE-1 s can also integrate at dysfunctional telomeres in an endonuclease-independent manner [216]
Retrotransposition events associated with human disease
| Insertion | Gene | CHR | Reference | Disease | Subfamily | Size | polyA tail length | Truncation | Transduction (bp) | Strand | Exon/Intron/Mechanism | Target-site duplication (TSD) | L1 EN site (5′-TTTT/AA-3′) | Note | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Alu |
| X | Kutsche et al. 2002 [ | ALD | AluYb9 | 98 | 20 | Y/5′TR | N | S | 4.7 kb Deletion | No TSD | ATTT/GT | |
| 2 | Alu |
| X | Gu et al. 2007 [ | Menkes Disease | AluYa5a2 | 282 | 89 | N | N | AS | E | AAAAAGGACAGC | TTTT/AT | |
| 3 | Alu |
| X | Lester et al. 1997 [ | XLA | AluY | N/A | N/A | N/A | N | AS | E | N/A | N/A | |
| 4 | Alu |
| X | Conley et al. 2005 [ | XLA | AluY | 281 | 74 | N | N | S | E | AGAAATGTATGAGTAAGT | TTCT/AT | Same insertion site Conley et al. SVA |
| 5 | Alu |
| X | Apoil et al. 2007 [ | HIGM | AluYb8 | 292 | 8 | N | N | AS | E | AAAAATTTTC | TTTT/AT | |
| 6 | Alu |
| X | Claverie-Martin et. al. 2003 [ | Dent’s Disease | AluYa5 | 281 | 50 | N | N | S | E | AGAAAATGCTCGAAAGA | TTCT/AT | |
| 7 | Alu |
| 1 | Masson et. al. 2013 [ | Chronic pancreatitis | Alu | 31 | 11 | Y/5′TR | N | AS | 53.9 kb Deletion | N/A | TCTT/AT | Deletes entire CTRC and ELA2A genes |
| 8 | Alu |
| 1 | Lesmana et. al. 2015 [ | Severe Hereditary Nonspherocytic Hemolytic Anemia | Yb8 | 288 | 70 | N | N | S | E | AAGATCATCAGCAAA | TCTT/GA | consanguinity, consensus Yb8 |
| 9 | Alu |
| X | Sukarova et. al. 2001 [ | Hemophilia A | AluYb8 | 290 | 47 | N | N | AS | 3 nt Deletion | No TSD | TTTC/AT | |
| 10 | Alu |
| X | Ganguly et. al. 2003 [ | Hemophilia A | AluYb9 | 288 | 37 | N | N | AS | I/Splicing | AAAAACCAACAGG | TTTT/AT | Consensus Yb9 |
| 11 | Alu |
| X | Green et. al. 2008 [ | Hemophilia A | AluYb8 | FL | N/A | N | N | AS | E | N/A | ||
| 12 | Alu |
| X | Vidaud et al. 1993 [ | Hemophilia B | AluYa5a2 | 244 | 78 | Y/5′TR | N | S | E | AAGAATGGCAGATGCGA | TCTT/AA | Same insertion site as Wulff et al. Alu |
| 13 | Alu |
| X | Wulff et al. 2000 [ | Hemophilia B | AluYa5a2 | 237 | 39 | Y/5′TR | N | S | E | AAGAATGGCAGATGC | TCTT/AA | Same insertion site as Vidaud et al. Alu |
| 14 | Alu |
| X | Li et al. 2001 [ | Hemophilia B | AluY | 279 | 40 | Y/5′TR | N | AS | E | AAGAAACTGGTCCC | TCTT/AA | |
| 15 | Alu |
| X | Zhang et al. 2000 [ | GKD | AluYc1 | 241 | 74 | Y/5′TR | N | AS | I | AAAAAATAAG | TTTT/AA | |
| 16 | Alu |
| X | Lester et al. 1997 [ | XSCID | AluYa5 | N/A | N/A | N/A | N | AS | I | N/A | N/A | |
| 17 | Alu |
| 1 | den Hollander et al. 1999 [ | RP | AluY | 244 | 70 | Y/5′TR | N | AS | E | AAGAGTAAAGATGA | TCTT/GA | |
| 18 | Alu |
| 1 | Beauchamp et al. 2000 [ | Type 1 ATP | Alu | 6 | 40 | Y/5′TR | N | AS | 1.4 kb Deletion | N/A | TTCT/AT | Shortest Alu insertion |
| 19 | Alu |
| 2 | Taşkesen et al. 2012 [ | Alström syndrome | AluYa5 | 257 | 76 | Y/5′TR | N | S | E | AAAAGCCTAGAGAA | TTTT/AA | |
| 20 | Alu |
| 2 | Kloor et al. 2004 [ | HNPCC | AluJ | 85 | 40 | Y/5′TR | N | S | E | N/A | N/A | Contains extra 99 nt 3′-of Alu, may be transduction or recombination |
| 21 | Alu |
| 2 | Qian et al. 2015 [ | Hereditary Cancer | N/A | N/A | N/A | N/A | N/A | N/A | E | N/A | N/A | Pan-cancer panel testing |
| 22 | Alu |
| 2 | Ishihara et al. 2004 [ | MWS | AluYa5 | 281 | 93 | N | N | S | E | AAAATTAAAACA | TTTT/AA | |
| 23 | Alu |
| 3 | Muratani et al. 1991 [ | Cholinesterase deficiency | AluYb9 | 289 | 38 | N | N | S | E | AAAAATATTTTTTCC | TTTT/AA | |
| 24 | Alu |
| 3 | Janicic et al. 1995 [ | FHH and NSHPT | AluYa5 | 280 | 93 | N | N | AS | E | GAAAGCGTGAGCTGC | TTTC/AA | |
| 25 | Alu |
| 3 | Sobrier et al. 2005 [ | Anterior Pituitary Aplasia | AluYb8 | 288 | 30 | N | N | S | E | AGAAAATGTCTTTAGA | TTCT/AA | |
| 26 | Alu |
| 3 | Gallus et al. 2010 [ | ADOA | AluYb8 | 289 | 25 | N | N | AS | I/Splicing | AAAAATTTTAAAAAGTT | TTTT/AC | |
| 27 | Alu |
| 5 | Economou-Pachnis and Tsichlis 1985 [ | Associated with leukemia | AluYa5 | 280 | 26 | N | N | AS | I | GAAAATGT | TTTC/AT | |
| 28 | Alu |
| 5 | Halling et al. 1999 [ | Hereditary desmoid disease | AluYb8 | 278 | 40 | Y/5′TR | N | S | E | AAGAATAATG | TCTT/AA | Same insertion site as Miki et al. L1 |
| 29 | Alu |
| 5 | Su et al. 2000 [ | FAP | AluYb9 | 93 | 60 | Y/5′TR | N | AS | I/Splicing | No TSD | TTTT/AA | 1.6 kb intronic deletion |
| 30 | Alu |
| 5 | Qian et al. 2015 [ | Hereditary Cancer | N/A | N/A | N/A | N/A | N/A | N/A | E | N/A | N/A | Pan-cancer panel testing |
| 31 | Alu |
| 6 | Tucker et al. 2011 [ | RP | AluYb8 | 281 | 57 | N | N | AS | E | AAAGAAAAAA | CTTT/AA | Identified by exome resequence |
| 32 | Alu |
| 7 | Manco et al. 2006 [ | Chronic hemolytic anemia | Alu Ya5 | 281 | 36 | N | N | S | E | AAGAATGGCAGATGG | TCTT/AA | |
| 33 | Alu |
| 7 | Chen et al. 2008 [ | Cystic Fibrosis | AluY | 46 | 57 | Y/5′TR | N | AS | E | AAGAATCCCACCTATAAT | TCTT/AA | |
| 34 | Alu |
| 7 | Chen et al. 2008 [ | Cystic Fibrosis | AluYa5 | 281 | 56 | N | N | S | E | AATAGAAATGATTTTTGTC | TCTC/AT | 3′-Processing of (5′-CTC-3′) |
| 35 | Alu |
| 8 | Abdelhak et al. 1997 [ | BOR syndrome | AluYa5 | n/a | 97,31 | N/A | N | AS | E | AAAAAATAAATGTGTG | TTTT/AA | PolyA tail shortening between generations |
| 36 | Alu |
| 8 | Okubo et al. 2007 [ | LPL deficiency | AluYb9 | 150 | 60 | Y/5′TR | N | AS | 2.2 kb Deletion | No TSD | TTTT/AA | |
| 37 | Alu |
| 8 | Udaka et al. 2007 [ | CHARGE syndrome | AluYa5/8 | 75 | 100 | Y/5′TR | N | S | 10 kb Deletion | No TSD | ATTT/AA | |
| 38 | Alu |
| 9 | Bouchet et al. 2007 [ | Walker Warburg syndrome | AluYa5 | 290 | 53 | N | N | AS | E | AAAAAGAGATGTACTG | TTTT/AC | |
| 39 | Alu |
| 10 | Oldridge et al. 1999 [ | Apert syndrome | AluYa5 | 283 | 69 | N | N | AS | I/Splicing | AGAAAACAAGGGAAGCA | TTCT/AG | |
| 40 | Alu |
| 10 | Oldridge et al. 1999 [ | Apert syndrome | AluYb8 | 288 | 47 | N | N | AS | E | AGAATTACCCGCCAAG | TTCT/AT | |
| 41 | Alu |
| 10 | Bochukova et al. 2009 [ | Apert syndrome | AluYk13 | 214 | 12 | Y/5′TR | N | AS | E | AAAAGTTACATTCCG | TTTT/GA | |
| 42 | Alu |
| 10 | Tighe et al. 2002 [ | ALPS | AluYa5 | 281 | 33 | N | N | AS | I | AGAATATTCTAAATGTG | TTCT/AA | |
| 43 | Alu |
| 11 | Stoppa-Lyonnet et al. 1990 [ | HAE | AluYc1 | 285 | 42 | N | N | S | I | AAAAATACAAAAATTAG | TTTT/AG | |
| 44 | Alu |
| 11 | Mustajoki et al. 1999 [ | AIP | AluYa5 | 279 | 39 | N | N | AS | E | AAGAATCTTGTCCC | TCTT/GA | |
| 45 | Alu |
| 11 | Qian et al. 2015 [ | Hereditary Cancer | N/A | N/A | N/A | N/A | N/A | N/A | E | N/A | N/A | Pan-cancer panel testing |
| 46 | Alu |
| 12 | Tappino et al. 2008 [ | ML II | AluYa5 | 279 | 17 | N | N | AS | E | AAAAACAACAACTGAG | TTTT/GA | |
| 47 | Alu |
| 13 | Miki et al. 1996 [ | Breast Cancer | AluYc1 | 281 | 62 | N | N | S | E | AATCACAGGC | GATT/AT | |
| 48 | Alu |
| 13 | Teugels et al. 2005 [ | Breast Cancer | AluYa5 | 285 | N/A | N | N | S | E | AAGAATCTGAACAT | TTCT/GC | 3′ Processing 2 nt (5′-CT-3′) |
| 49 | Alu |
| 13 | Qian et al. 2015 [ | Hereditary Cancer | N/A | N/A | N/A | N/A | N/A | N/A | E | N/A | N/A | Pan-cancer panel testing |
| 50 | Alu |
| 13 | Qian et al. 2015 [ | Hereditary Cancer | N/A | N/A | N/A | N/A | N/A | N/A | E | N/A | N/A | Pan-cancer panel testing |
| 51 | Alu |
| 13 | Qian et al. 2015 [ | Hereditary Cancer | N/A | N/A | N/A | N/A | N/A | N/A | E | N/A | N/A | Pan-cancer panel testing |
| 52 | Alu |
| 13 | Qian et al. 2015 [ | Hereditary Cancer | N/A | N/A | N/A | N/A | N/A | N/A | E | N/A | N/A | Pan-cancer panel testing |
| 53 | Alu |
| 13 | Qian et al. 2015 [ | Hereditary Cancer | N/A | N/A | N/A | N/A | N/A | N/A | E | N/A | N/A | Pan-cancer panel testing |
| 54 | Alu |
| 13 | Qian et al. 2015 [ | Hereditary Cancer | N/A | N/A | N/A | N/A | N/A | N/A | E | N/A | N/A | Pan-cancer panel testing |
| 55 | Alu |
| 13 | Qian et al. 2015 [ | Hereditary Cancer | N/A | N/A | N/A | N/A | N/A | N/A | E | N/A | N/A | Pan-cancer panel testing |
| 56 | Alu |
| 13 | Qian et al. 2015 [ | Hereditary Cancer | N/A | N/A | N/A | N/A | N/A | N/A | E | N/A | N/A | Pan-cancer panel testing |
| 57 | Alu |
| 16 | Schollen et al. 2007 [ | CDG-Ia | AluYb8 | 263 | 10 | Y/5′TR | N | AS | 28 kb Deletion | No TSD | TTTT/AA | |
| 58 | Alu |
| 16 | Qian et al. 2015 [ | Hereditary Cancer | N/A | N/A | N/A | N/A | N/A | N/A | E | N/A | N/A | Pan-cancer panel testing |
| 59 | Alu |
| 17 | Peixoto et al. 2013 [ | Breast and Ovarian Cancer Family | AluYc | 191 | 60 | Y/5′TR | N | AS | 23.3 kb Deletion | No TSD | CTTT/AG | |
| 60 | Alu |
| 17 | Teugels et al. 2005 [ | Breast Cancer | AluS | 286 | N/A | N | N | S | E | GAAAAAGAATCTGCTTT | TTTC/GA | |
| 61 | Alu |
| 17 | Qian et al. 2015 [ | Hereditary Cancer | N/A | N/A | N/A | N/A | N/A | N/A | E | N/A | N/A | Pan-cancer panel testing |
| 62 | Alu |
| 17 | Wallace et al. 1991 [ | NF1 | AluYa5 | 282 | 40 | N | N | AS | I/Splicing | AAAAAAAAAAACAT | TTTT/AA | First report of de novo Alu insertion |
| 63 | Alu |
| 17 | Wimmer et al. 2011 [ | NF1 | AluY | 280 | N/A | N | N | S | I | AAAAAATTCAG | TTTT/AA | Same insertion site as Wimmer et al.a |
| 64 | Alu |
| 17 | Wimmer et al. 2011 [ | NF1 | AluY | 281 | N/A | N/A | N | AS | I | N/A | ||
| 65 | Alu |
| 17 | Wimmer et al. 2011 [ | NF1 | AluYa5 | 282 | 60 | N | N | S | E | ATAAATAGCCTGGA | TTAT/AA | |
| 66 | Alu |
| 17 | Wimmer et al. 2011 [ | NF1 | AluYa5 | 284 | 120 | N | N | AS | E | AAAAAACTTGCT | TTTT/GA | Same insertion site as Wimmer et al.c |
| 67 | Alu |
| 17 | Wimmer et al. 2011 [ | NF1 | AluYa5 | 281 | N/A | N | N | AS | E | AAAAAACTTGCTGATGG | TTTT/GA | Same insertion site as Wimmer et al.c |
| 68 | Alu |
| 17 | Wimmer et al. 2011 [ | NF1 | AluYa5 | 284 | 110 | N | N | AS | E | AATAAAACCTAAAGA | TATT/GA | |
| 69 | Alu |
| 17 | Wimmer et al. 2011 [ | NF1 | AluYa5 | 279 | N/A | N | N | S | E | AAAAGAAGAACATAT | TTTT/GT | Same insertion site as Wimmer et al.b |
| 70 | Alu |
| 17 | Wimmer et al. 2011 [ | NF1 | AluYa5 | 264 | 60-85 | Y/5′TR | N | AS | E | AAGAAGTGCGGTACCT | TCTT/GA | |
| 71 | Alu |
| 17 | Wimmer et al. 2011 [ | NF1 | AluYb8 | 249 | 121 | Y/5′TR | N | S | E | AAAGCAGTGC | CTTT/AT | |
| 72 | Alu |
| 17 | Wimmer et al. 2011 [ | NF1 | AluYb8 | 288 | N/A | N | N | AS | I | AAAAAAGAGAAAGACAA | TTTT/AA | Same insertion site as Wimmer et al.a |
| 73 | Alu |
| 17 | Wimmer et al. 2011 [ | NF1 | AluYb8 | 289 | 120 | N | N | AS | E | AACAATGGTCTT | TGTT/AA | |
| 74 | Alu |
| 17 | Wimmer et al. 2011 [ | NF1 | AluYb8 | 288 | 78-178 | N | N | S | E | AAACAATGATGTTA | TTTC/AA | 3′ Processing of 1 nt (C) |
| 75 | Alu |
| 17 | Wimmer et al. 2011 [ | NF1 | AluYb8 | 288 | 118 | N | N | S | E | AAAAGAAGAACATAT | TTTT/GT | Same insertion site as Wimmer et al.b |
| 76 | Alu |
| 17 | Wimmer et al. 2011 [ | NF1 | AluYb8 | 268 | 121 | Y/5′TR | N | AS | I | AAAAAACAAACAAACA | TTTT/GT | |
| 77 | L1 |
| X | Meischl et al. 1998 [ | CGD | L1 Ta | 1722 | 101 | Y/5′TR | Y (280) | S | E | AA | TGTT/GA | Maternal Meiosis I |
| 78 | L1 |
| X | Meischl et al. 2000 [ | CGD | L1 Ta | 836 | 69 | Y/5′TR/INV | N | S | I/Splicing | AGAAATAACTATTTAA | TTCT/AA | |
| 79 | L1 |
| X | van den Hurk et al. 2003 [ | Choroideremia | L1 Ta | 6017 | 71 | FL | Y (119/406) | AS | E | AGAAGATCAATTAG | TTCT/AA | Insertion in Early Development |
| 80 | L1 |
| X | Musova et al. 2006 [ | DMD | L1 Ta | 452 | 41 | Y/5′TR/INV | N | AS | E | AAATATCTTTATATCA | ATTT/AA | |
| 81 | L1 |
| X | Narita et al. 1993 [ | DMD | L1 Ta | 608 | 16 | Y/5′TR | N | AS | E | No TSD | TCTT/AA | 2 nt deletion |
| 82 | L1 |
| X | Holmes et al. 1994 [ | DMD | L1 Ta | 1400 | 38 | Y/5′TR/INV | Y(489) | S | E | AAATCATCTGCTGCT | ATTT/AA | First Report of L1 3′TR |
| 83 | L1 |
| X | Yoshida et al. 1998 [ | XLDCM | L1 Ta | 530 | 73 | Y/5′TR | N | AS | 5′-UTR/Loss of mRNA | AAAAAAAACCTGGTAAA | TTTT/AT | Tissue specific loss of mRNA |
| 84 | L1 |
| X | E Bakker & G van Omenn, personal communication | DMD | N/A | 878 | N/A | Y/5′TR | N | S | N/A | N/A | N/A | |
| 85 | L1 |
| X | Awano et al. 2010 [ | DMD | L1 Ta | 212 | 118 | Y/5′TR | Y (212) | AS | E | GAA | TTTC/AA | Orphan 3′-transduction |
| 86 | L1 |
| X | Kazazian et al. 1988 [ | Hemophilia A | L1 Ta | 3800 | 54 | Y/5′TR | N | S | E | AAAGACAAACAAAAC | CTTT/AA | First report of de novo L1 insertion |
| 87 | L1 |
| X | Kazazian et al. 1988 [ | Hemophilia A | L1 preTa | 2300 | 77 | Y/5′TR/INV | N | AS | E | AATGTTTCCTTCTTTTC | CATT/AA | |
| 88 | L1 |
| X | Li et al. 2001 [ | Hemophilia B | L1 Ta | 463 | 68 | Y/5′TR | N | S | E | AAAAATAGTGCTGATA | TTTT/AC | |
| 89 | L1 |
| X | Mukherjee et al. 2004 [ | Hemophilia B | L1 Ta | 163 | 125 | Y/5′TR | N | S | E | GAAAAATGGATTGT | TTTC/AT | |
| 90 | L1 |
| X | Schwahn et al. 1998 [ | XLRP | L1 Ta | 6000 | 64 | FL | N | S | I/Loss of mRNA | AAGACTGTAAGGTG | TCTT/AA | Interrupted polyA |
| 91 | L1 |
| X | Martinez-Garay et al. 2003 [ | Coffin-Lowry syndrome | L1 Hs | 2800 | Yes | Y/5′TR/INV | N | AS | E | AAGAAAACCTGCATTT | TCTT/AG | |
| 92 | L1 |
| 3 | Samuelov et al. 2011 [ | CDS | N/A | FL | N/A | N | N/A | N | I/Splicing | N/A | N/A | |
| 93 | L1 |
| 3 | Qian et al. 2015 [ | Hereditary Cancer | N/A | N/A | N/A | N/A | N/A | N/A | E | N/A | N/A | Pan-cancer panel testing |
| 94 | L1 |
| 3 | Qian et al. 2015 [ | Hereditary Cancer | N/A | N/A | N/A | N/A | N/A | N/A | E | N/A | N/A | Pan-cancer panel testing |
| 95 | L1 |
| 5 | Miki et al. 1992 [ | Colon cancer | L1Ta | 520 | 222 | Y/5′TR/INV | N | S | E | AAGAATAATG | TCTT/AA | Somatic Insertion/same insertion site as Halling et al. Alu |
| 96 | L1 |
| 8 | Morisada et al. 2010 [ | BOR syndrome | L1 Hs | 3756 | None | Y/3′TR | N | AS | 17 kb Deletion | No TSD | TCTC/AG | Internal Priming |
| 97 | L1 |
| 9 | Kondo-Iida et al. 1999 [ | FCMD | L1Ta | 1200 | 59 | Y/5′TR | N | S | I/Splicing/6 nt Deletion | No TSD | TTTT/AA | |
| 98 | L1 |
| 9 | Bernard et al. 2009 [ | AOA2 | L1 Hs | 1300 | 42 | Y/5′TR/INV | N | S | E | GGAAGAATGTGAACTGGCTA | TTCC/AG | 3′-processing 2 nt (5′-CC-3′) |
| 99 | L1 |
| 10 | Helman et al. 2014 [ | endometrial carcinoma | L1 Hs | 90 | 22 | Y/5′TR | N | S | E | AAAGAATCATCTGGATTATAG | CTTT/AA | Somatic Insertion |
| 100 | L1 |
| 11 | Divoky et al. 1996 [ | β-thalassemia | L1 Ta | 6000 | 107 | FL | N | AS | I | AAAATAAAAGCAGA | TTTT/AT | |
| 101 | L1 |
| 11 | Mine et al. 2007 [ | PDHc deficiency | L1 Hs | 6086 | 67 | FL | N | S | 46 kb Deletion | No TSD | TTTT/AT | |
| 102 | L1 |
| 12 | Kagawa et al. 2015 [ | Rotor syndrome | L1 Ta-1d | 5989 | 100 | Near FL | N | S | I/Splicing | AAGAATTAATAGTGACAGT | TCTT/AC | 0.054 Japanese Allele Frequency, may be “Hot L1” |
| 103 | L1 |
| 13 | Rodriguez-Martin et al.2016 [ | Familial Retinoblastoma | L1 Ta-1d | 6044 | 33 | FL | N | S | I/Splicing | AAATTATCTGTTTC | ATTT/AA | N/A |
| 104 | L1 |
| 17 | Wimmer et al. 2011 [ | NF1 | L1 preTa | 1800 | N/A | Y/5′TR | N | S | E | AAAAACGAAACTGTGT | TTTT/AT | Untemplated 3′- T? |
| 105 | L1 |
| 17 | Wimmer et al. 2011 [ | NF1 | L1 Ta | 6000 | N/A | FL | N | S | E | AAAAATCGAGGG | TTTT/AA | Untemplated 3′- T? |
| 106 | L1 |
| 17 | Wimmer et al. 2011 [ | NF1 | N/A | 2200 | N/A | Y/5′TR/INV | N | AS | I/Splicing | AAGAAAATGGT | TCTT/AA | |
| 107 | SVA |
| X | Rohrer et al. 1999 [ | XLA | N/A | 251 | 92 | Y/5′TR | N | S | E | AGAAATGTATGAGTAA | TTCT/AT | Same insertion site as Conley et. al. Alu |
| 108 | SVA |
| X | Makino et al. 2007 [ | XDP | F | 2627 | 62 | FL | N | AS | I | AAAAAAAAAAAATGAAATAG | TCCT/AT | 3′-Processing 3 nt (5′-CCT-3′) |
| 109 | SVA |
| X | Nakamura et.al. 2015 [ | Hemophilia B | F | 2524 | 28 | FL | N | AS | E | AAATGGCACTAGAA | TTCC/AT | 3′-Processing 1 nt (5′-C-3′) |
| 110 | SVA |
| 1 | Wilund et al. 2002 [ | ARH | E | 2600 | 57 | FL | N | S | I/Splicing | GAAACCTGTTTTCTC | TTTC/AA | |
| 111 | SVA |
| 1 | Hassoun et al. 1994 [ | HE and HPP | E | 632 | 50 | Y/5′TR/INV | Y (183/599) | S | E | GAAATTTGAAGACTTCCAAGT | TTTC/AA | Orphan 3′-transduction |
| 112 | SVA |
| 2 | Stacey et al. 2016 [ | Breast Cancer Susceptibility | E | 2782 | N/A | FL | N | AS | I/Decreased RNA | AAGAATTTGA | TCTT/AT | Protective against prostate cancer; active locus? |
| 113 | SVA |
| 3 | Nazaryan et al. 2015 [ | Chromothripsis | E | 502 | None | Y/5′TR (VNTR) | N | AS | I | N/A | TTTT/GA | First report of large scale rearrangement and an insertion. Implicates retrotransposition in germline chromothripsis. |
| 114 | SVA |
| 6 | Takasu et al. 2007 [ | Leukemia | F1 | 2000 | 45 | FL | N/A | AS | 14 kb Deletion | N/A | CCTT/AG | Novel SVA subfamily (F1) |
| 115 | SVA |
| 7 | van der Klift et al. 2012 [ | Lynch syndrome | F | 2200 | 64 | Y/5′TR (VNTR) | N | S | I/Splicing | AAGAATGTGCCATGTGA | TCTT/AA | SVA exonization |
| 116 | SVA |
| 9 | Kobayashi et al. 1998 [ | FCMD | E | 3023 | 32 | FL | N | S | 3′UTR/Splicing | AAGAAAAAAAAAATTGT | TCTT/AA | |
| 117 | SVA |
| 11 | Akman et al. 2010 [ | NLSDM | E | 1800 | 44 | Y/5′TR | N | S | E | AAAGAGGCCCGG | CTTT/AG | |
| 118 | SVA |
| 17 | Vogt et al. 2014 [ | NF1 | F1 | 1700 | 23 | Y/5′TR (VNTR) | Y (282/160) | AS | I/Deletion of NF1 | N/A | TTTT/AC | Largest reported insertion associated deletion (~1 Mb), somatic |
| 119 | SVA |
| 17 | Vogt et al. 2014 [ | NF1 | F | 1300 | 40 | Y/5′TR (VNTR) | N | AS | I/Deletion of NF1 | N/A | CTTT/AC | 867 kb deletion, somatic |
| 120 | Processed Pseudogene |
| X | de Boer et al. 2014 [ | CGD | N/A | 5739 | 100 | FL | No | AS | I/Splicing | AAAACTCAAAGACTC | TTTT/AA | First reported de novo processed pseudogene (TMF1) |
| 121 | pA |
| X | Segal et al. 1999 [ | Alport syndrome | N/A | N/A | 70 | N/A | N/A | AS | 13.4 kb Deletion | No TSD | TTCT/AT | |
| 122 | pA |
| 4 | Jalanko et al. 1995 [ | AGU | N/A | N/A | 37 | N/A | N/A | AS | 2 kb Deletion | No TSD | TTCT/AA | |
| 123 | pA |
| 13 | Wang et al. 2001 [ | Breast Cancer | N/A | N/A | 35 | N/A | N/A | S | 6.2 kb Deletion | No TSD | TTCT/AA | |
| 124 | pA |
| 17 | Wimmer et al. 2011 [ | NF1 | N/A | 130 | 120 | N/A | N/A | AS | E | AAGAAA | TCTTNAA |
Data for this table were compiled from the primary references listed and reports prior to 2009 are reviewed in the following: Ostertag and Kazazian 2001 [35], Chen et al. 2006 [150], Belancio et al. 2008 [151], Hancks and Kazazian 2012 [86]
A few insertions were left off the list as they were common polymorphisms or did not cause disease. The following websites and databases were used in the analysis: http://www.repeatmasker.org/, Repbase (http://www.girinst.org/), http://dbrip.brocku.ca/, The following symbols, a,b,c, indicate same insertion site in Wimmer et al. [296]
Abbreviations: TR truncation, INV inversion, E exon, FL full-length, I intron
Disease acronyms: ADOA Autosomal dominant optic atrophy, AGU Aspartylglucosaminuria, AIP Acute intermittent porphyria, ALD Adrenoleukodystrophy, ALPS Autoimmune lymphoproliferative syndrome, AOA2 Ataxia with oculomotor apraxia 2, ARH Autosomal recessive hypercholesterolemia, BOR Branchio-oto-renal syndrome, CDG-Ia Congenital disorders of glycosylation type Ia, CDS Chanarin-Dorfman syndrome, CGD Chronic granulomatous disease, DMD Duchenne muscular dystrophy, FAP Familial adenomatous polyposis, FCMD Fukuyama-type congenital muscular dystrophy, FHH and NSHPT Familial hypocalciuric hypercalcemia and neonatal severe hyperparathyroidism, GKD Glycerol kinase deficiency, HAE Hereditary form of angioedema, HE and HPP Hereditary elliptocytosis and hereditary pyropoikilocytosis, HIGM Hyper-immunoglobulin M syndrome, HNPCC Hereditary non-polyposis colorectal cancer syndrome, LPL Lipoprotein lipase, MLII Mucolipidosis Type II, MWS Mowat-Wilson syndrome, NF1 Neurofibromatosis Type I, PDHc Pyruvate dehydrogenase complex deficiency, NLSDM Neutral lipid storage disease with subclinical myopathy, RP Retinitis pigmentosa, Type 1 ATP Type 1 antithrombin deficiency, XDP X-linked dystonia-parkinsonism, XLA X-linked agammaglobulinemia, XLDCM X-linked dilated cardiomyopathy, XLRP X-linked retinitis pigmentosa, XSCID X-linked severe combined immunodeficiency