| Literature DB >> 22125493 |
Katharina Wimmer1, Tom Callens, Annekatrin Wernstedt, Ludwine Messiaen.
Abstract
Long interspersed (L1) and Alu elements are actively amplified in the human genome through retrotransposition of their RNA intermediates by the -100 still retrotranspositionally fully competent L1 elements. Retrotransposition can cause inherited disease if such an element is inserted near or within a functional gene. Using direct cDNA sequencing as the primary assay for comprehensive NF1 mutation analysis, we uncovered in 18 unrelated index patients splicing alterations not readily explained at the genomic level by an underlying point-mutation or deletion. Improved PCR protocols avoiding allelic drop-out of the mutant alleles uncovered insertions of fourteen Alu elements, three L1 elements, and one poly(T) stretch to cause these splicing defects. Taken together, the 18 pathogenic L1 endonuclease-mediated de novo insertions represent the largest number of this type of mutations characterized in a single human gene. Our findings show that retrotransposon insertions account for as many as -0.4% of all NF1 mutations. Since altered splicing was the main effect of the inserted elements, the current finding was facilitated by the use of RNA-based mutation analysis protocols, resulting in improved detection compared to gDNA-based approaches. Six different insertions clustered in a relatively small 1.5-kb region (NF1 exons 21(16)-23(18)) within the 280-kb NF1 gene. Furthermore, three different specific integration sites, one of them located in this cluster region, were each used twice, i.e. NM_000267.3(NF1):c.1642-1_1642 in intron 14(10c), NM_000267.3(NF1):c.2835_2836 in exon 21(16), and NM_000267.3(NF1):c.4319_4320 in exon 33(25). Identification of three loci that each served twice as integration site for independent retrotransposition events as well as 1.5-kb cluster region harboring six independent insertions supports the notion of non-random insertion of retrotransposons in the human genome. Currently, little is known about which features make sites particularly vulnerable to L1 EN-mediated insertions. The here identified integration sites may serve to elucidate these features in future studies.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22125493 PMCID: PMC3219598 DOI: 10.1371/journal.pgen.1002371
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
List of all Alu, L1 and poly(T) insertions uncovered in the NF1 gene.
| No. | Patient no. | Location of insertion | Inserted element ( |
| Orientation of the insert with regard to NF1 reading frame | length of poly (A) tail (bp) | Mutation according HGVS nomenclature | Effect on mRNA | splice type as depicted in |
| 1 | UAB-R10408 | IVS 14 (10c) |
| 280/N | S | no estimate | c.1642-1_1642insAluY,1642-11_1642-1dup | r.1642_1721delIF skip E15 (E11) | 1 |
| 2 | UAB-R119201 | IVS 10 (8) |
| 281/N | AS | no estimate | c.1186-86_1186-16delinsAluY | r.1186_1260del IF skip E11 (9) | 1 |
| 3 | MUI-1 | E6 (4c) |
| 282/N | S | ∼60 bp | c.650_651dupinsAluYa5, 637_650dup | r.587_654delIF skip E6 (E4c) | 2 |
| 4 | UAB-R37305 | E21 (16) |
| 284/N | AS | 120 bp | c.2835_2836insAluYa5, 2824_2835dup | r.2618_2850delskip last 233 nt E21 (E16) | 3 |
| 5 | UAB-R37616 | E21 (16) |
| 281/N | AS | no estimate | c.2835_2836insAluYa5, 2819_2835dup | r.2618_2850delskip last 233 nt E21 (E16) | 3 |
| 6 | UAB-R30609 | E22 (17) |
| 284/N | AS | 110 bp | c.2858_2859insAluYa5,2851-7_2858dup | r.2851_2990delOOF skip E22 (E17) | 2 |
| 7 | UAB-R81017 | E33 (25) |
| 279/N | S | no estimate | c.4319_4320insAluYa5,4305_4319dup | r.4270_4367delOOF skip E33 (E25) | 2 |
| 8 | MUI-2 | E47 (38) |
| 264/Y | AS | 60–85 bp | c.6951_6952ins AluYa5,6936_6951dup | r.[6859_6999del;6938_6999del]IF skip E47 (E38); skip last 62 nt of E47 (E38) | 2&3 |
| 9 | UAB-R869001 | E12 (10a) |
| 249/Y | S | 121 bp | c.1354_1355insAluYb8, dup1345_1354dup | r.1355_1392delins61skip of the last 38 nt E12 (E10a) & insertion of first 61 nt of the inserted Alu element | 5 |
| 10 | UAB-R164201 | IVS 14 (10c) |
| 288/N | AS | no estimate | c.1642-12_1642-11insAluYb8,1642-28_1642-12dup | r.1642_1721del80OOF skip E15 (11) | 1 |
| 11 | UAB-R39428 | E21 (16) |
| 289/N | AS | 120 bp | c.2439_2440insAluYb8, 2428_2439dup | r.2410_2638delskip first 229 nt of E21 (E16) | 4 |
| 12 | UAB-R50109 | E22 (17) |
| 288/N | S | 78–178 bp | c.2979_2980insAluYb8, 2966_2979dup | r.2851_2990delOOF skip E22 (E17) | 2 |
| 13 | UAB-R340101 | E33 (25) |
| 288/N | S | 118 bp | c.4319_4320insAluYb8, 4305_4319dup | r.4270_4367delOOF skip E33 (E25) | 2 |
| 14 | UAB-R07118 | IVS48 (39) |
| 268/Y | AS | 121 bp | c.7127-5_7127_4insAluYb8,7127-20_7127-5dup | r.7127_7258delIF skip E49 (40) | 1 |
| 15 | UAB-R75103 | E25 (19b) | poly(T) strech | ∼130/Y | AS | 120 bp | c.3312_3313insT(n∼120),3307_3312dup | r.3275_3314delskip last 40 nt of E25 (E19b) | 3 |
| 16 | UAB-R316001 | E23 (18) | L1(preTa) | ∼1800/Y | S | no estimate | c.3048_3049insTGTGAATTinsL1(Ta)_0rf2, 3033_3048dup | r.2991_3113delIF skip E23 (E18) | 2 |
| 17 | UAB-R01429 | E39 (30) | L1(Ta) | ∼6000/N | S | no estimate | c.5606_5607insL1, 5594_5606dup | r.5607_5749delins96 skip of the last 143 nt of E39 (30) & insertion of first 96 nt of the inserted L1 element | 5 |
| 18 | UAB-R91409 | IVS9 (7) | n.k. | ∼2200/Y | AS | no estimate | c.1062+195_1062+196insL1, 1062+185_1062+195dup | r.889_1062delins130OOF skip E9 (7) & ins 130 nts of L1 element | 6 |
Abbreviations: IF = is in frame; OOF = out of frame; E = exon; IVS = intron; Y = yes; N = no; S = sense; AS = antisense; n.k. = not known; nt = nucleotides.
Figure 1Example of de novo Alu insertion detection by improved PCR conditions.
A) Agarose gel with PCR products of exon 6 (4c) generated with two different primer pairs and PCR conditions generating a 403-bp (short) and a 904-bp (long) wild type PCR product (see Table S1). PCR products were generated from a control individual (C) and the patient (P) harboring an AluYa5 insertion in exon 6 (4c) (W = water control). Amplification of the large PCR product from the patient revealed the presence of extra bands of increased size, i.e. a ∼1200-bp band representing the homoduplex of the PCR product from the mutant allele and two larger bands probably derived from heteroduplexes of this PCR product with the PCR product from the wild type allele, which are neither present in the control nor in the shorter amplified PCR products. (B) Sequences of the short and long PCR products from the control individual and the patient; the sequence of the long PCR product but not of the short PCR product of the patient shows starting at nucleotide position c.651 (vertical dotted line) a background sequence coming from the inserted AluYa5 element (see scheme below the sequences). The sequence of a cloned long PCR product clearly shows the presence of the AluYa5 element within the NF1 exon 6 (4c).
Figure 2Amplification of the mutant allele containing a full-length L1 element inserted into exon 39 (30).
(A) Agarose gel with PCR products of exon 39 (30) generated with two different PCR conditions. PCR products were generated from a control individual (C) and the patient (P) harboring a full-length L1 element in exon 39 (30) (W = water control). The PCR conditions (short) used in the diagnostic laboratory generate only the 359-bp product from the wild type allele present in the control and the patient, whereas a >6000-bp PCR product derived from the mutant allele was amplified along with the wild type 359-bp PCR product using the Expand Long Template PCR system kit (Roche), buffer system 2 and 3 (long) (see Table S1). (B) Sequences of the PCR products from the patient generated with the PCR conditions “short” and “long”; the sequence of the long PCR product but not of the short PCR product of the patient shows starting at nucleotide position c.5607 (vertical dotted line), besides the main wild type sequence, minor traces of a sequence coming from the inserted L1 element (see scheme below the sequences). The presence of the L1 element within the NF1 exon 39 (30) was confirmed using a L1-specific PCR (primers: 39f and L1-5_39r). The entire 6021-bp sequence of this L1 (Ta-1d) element is deposited in Figure S5.
Sequences at the integration sites aligned to the L1 endonuclease cleavage consensus site.
| Integration sites | TSD | Orientation |
|
|
|
| 11 | sense | Y | I 14 (10c) |
|
| - | antisense | Y | I 10 (8) |
|
| 14 | sense | Ya5 | E 6 (4c) |
|
| 12 | antisense | Ya5 | E 21 (16) |
|
| 17 | antisense | Ya5 | E 21 (16) |
|
| 13 | antisense | Ya5 | E 22 (17) |
|
| 15 | sense | Ya5 | E 33 (25) |
|
| 16 | antisense | Ya5 | E 47 (38) |
|
| 10 | sense | Yb8 | E 12 (10a) |
|
| 17 | antisense | Yb8 | I 14 (10c) |
|
| 12 | antisense | Yb8 | E 21 (16) |
|
| 14 | sense | Yb8 | E 22 (17) |
|
| 15 | sense | Yb8 | E 33 (25) |
|
| 16 | antisense | Yb8 | I 48 (39) |
|
| 16 | sense | L1P1_orf2 | E 23 (18) |
|
| 13 | sense | LINE 1 | E 39 (30) |
|
| 116 | antisenseantisense | L1T(n∼120) | I 9 (7)E 25 (19b) |
AA-TTTT L1 endonuclease cleavage consensus site.
*This integration site cannot be unequivocally determined due to an associated 71-bp deletion and the lack of a TSD. The given integration site assumes either a model where the second nick occurred downstream instead of upstream to the first nick that affected the NF1 sense strand or a model where the reverse transcribed Alu cDNA strand (termed minus strand) invaded a double-strand break 71 bp downstream of the given integration site. Integration sites indicated with the same symbole §, # or ‡ are identical.
Figure 3Schematic presentation of the splicing effects caused by the identified Alu and L1 insertions.
The affected exon is indicated with a purple bar and the two flanking exons with blue bars. Intronic sequences are indicated by black lines. Red triangles denote the inserted Alu and L1 sequences and cryptic splice sites within an exon or the retrotransposon are indicated by a dotted line. The genomic sequence and the ensuing mRNA transcripts are depicted at the left and right hand site, respectively.