| Literature DB >> 29109480 |
Barun Pradhan1,2, Tatiana Cajuso1,3, Riku Katainen1,3, Päivi Sulo1,3, Tomas Tanskanen1,3, Outi Kilpivaara1,3, Esa Pitkänen1,3, Lauri A Aaltonen1,3,4, Liisa Kauppi5,6, Kimmo Palin7,8.
Abstract
Long interspersed nuclear elements-1 (L1s) are a large family of retrotransposons. Retrotransposons are repetitive sequences that are capable of autonomous mobility via a copy-and-paste mechanism. In most copy events, only the L1 sequence is inserted, however, they can also mobilize the flanking non-repetitive region by a process known as 3' transduction. L1 insertions can contribute to genome plasticity and cause potentially tumorigenic genomic instability. However, detecting the activity of a particular source L1 and identifying new insertions stemming from it is a challenging task with current methodological approaches. We developed a long-distance inverse PCR (LDI-PCR) based approach to monitor the mobility of active L1 elements based on their 3' transduction activity. LDI-PCR requires no prior knowledge of the insertion target region. By applying LDI-PCR in conjunction with Nanopore sequencing (Oxford Nanopore Technologies) on one L1 reported to be particularly active in human cancer genomes, we detected 14 out of 15 3' transductions previously identified by whole genome sequencing in two different colorectal tumour samples. In addition we discovered 25 novel highly subclonal insertions. Furthermore, the long sequencing reads produced by LDI-PCR/Nanopore sequencing enabled the identification of both the 5' and 3' junctions and revealed detailed insertion sequence information.Entities:
Mesh:
Year: 2017 PMID: 29109480 PMCID: PMC5673974 DOI: 10.1038/s41598-017-15076-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1LDI-PCR based method to detect the activity of a hot L1. (a) Schematic showing mobility of hot L1 from the TTC28 locus upon activation. (b) LDI-PCR to detect 3′ transduction arising from the L1 at TTC28: Schematic representation of a hypothetical TTC28 specific L1 retrotransposition including transduction of 3′ flanking region or the “unique tag” (=region between the canonical polyadenylation signal and an alternative polyadenylation signal downstream), into an unknown target locus. NsiI produces restriction fragments of two different sizes that are self-ligated to form a circular template. Upon LDI-PCR, an inverse primer pair directed at the unique tag produces a native product and an insertion-specific target product. In addition to NsiI, two further restriction enzymes (PstI and SacI) and primer pairs (not depicted here) were used; see Materials and Methods for details.
Figure 2Agarose gel image of the LDI-PCR products. LDI-PCR using three different inverse primer pairs and restriction enzymes on two colorectal tumours (c985T and c368T) and their matching normal (c985N and c368N) DNA samples. The “native” PCR product of expected size and several additional tumour-specific products representing different putative 3′ transduction targets were detected. (Sizes corresponding to native LDI-PCR products: PstI~6.3 kb, NsiI~10.2 kb, SacI~5.6 kb.) Digested/self-ligated blood genomic DNA sample of an unrelated individual was run in the lane labelled “b” and PCR without any template in “−”.
Insertions detected by LDI-PCR/Nanopore sequencing. Note that one insertion was predicted by WGS but not by LDI-PCR/Nanopore and not included in the table. TSM = target-site modification; TP = twin-priming; del. = deletion; dup. = duplication; Read count = Number of LDI-PCR.py reads supporting each insertion, if one insertion was detected by more than one enzyme or primer pair, only the reaction with higher number of reads was included; *includes internal duplication; **some sequence was missing.
| Sample | Target cordinates | L1 insertion targets | TSM | Size of TSM |
| Strand | TP | Insertion (bp) | Read count | Validated | WGS detected | WGS read count | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5′ junction | 3′junction | start | end | |||||||||||
| c985T | 1:195769724 | 1:195769709 | intergenic | dup. | 16 | 29065826 | 29066121 | + | Present | 295 | 3984 | yes | yes | 15 |
| 4:93280482 | 4:93280454 |
| dup. | 28 | 29065650 | 29065893 | + | Absent | 243 | 4658 | no | yes | 20 | |
| 4:155900401 | 4:155900387 | intergenic | dup. | 14 | 29064889 | 29065912 | + | Present | 1015 | 1705 | no | yes | 9 | |
| 4:183051382 | 4:183051401 |
| del. | 18 | 29065306 | 29065448 | − | Absent | 142 | 10103 | no | yes | 8 | |
| 7:146783241 | 7:146783223 |
| dup. | 18 | 29065369 | 29065912 | − | Present | 536 | 19889 | no | yes | 29 | |
| 7:152661949 | 7:152661940 | intergenic | dup. | 9 | 29065283 | 29065887 | − | Present | 455 | 20107 | no | yes | 19 | |
| 12:33708291 | 12:33708277 | intergenic | dup. | 14 | 29065722 | 29066121 | − | Present | 377 | 16909 | no | yes | 20 | |
| 2:78612537 | 2:78612530 | intergenic | dup. | 8 | 29065453 | 29066118 | − | Present | 665 | 5 | no | no | 0 | |
| 3:99147126 | 3:99147111 | intergenic | dup. | 16 | 29065138 | 29066126 | − | Present | 984 | 47 | no | no | 0 | |
| 4:90987152 | 4:90987149 | intergenic | dup. | 4 | 29065287 | 29066121 | + | Absent | 834 | 99 | no | no | 0 | |
| 6:74978206 | 6:74978187 |
| dup. | 20 | 29065521 | 29066118 | + | Absent | 597 | 9 | no | no | 0 | |
| 8:111856478 | 8:111856457 | intergenic | dup. | 22 | 29064270 | 29066083 | − | Present | 857 | 6 | no | no | 0 | |
| 14:99070525 | 14:99070523 | intergenic | dup. | 3 | 29065558 | 29065912 | − | Absent | 354 | 55 | yes | no | 1 | |
| 16:26220799 | 16:26220798 | intergenic | dup. | 2 | 29065976 | 29066118 | + | Absent | 142 | 182 | no | no | 0 | |
| 16:5902138 | 16:5902158 |
| del. | 19 | 29065747 | 29065912 | − | Absent | 165 | 72 | no | no | 1 | |
| c368T | 1:115147190 | 1:115147187 |
| dup. | 4 | 29065647 | 29065893 | − | Absent | 246 | 12753 | no | yes | 18 |
| 2:182004540 | 2:182004515 |
| dup. | 26 | 29065285 | 29065886 | + | Present | 783* | 20217 | no | yes | 12 | |
| 2:229159082 | 2:229159075 | intergenic | dup. | 8 | 29065578 | 29065908 | − | Absent | 330 | 704 | no | yes | 16 | |
| 6:70787202 | 6:70787188 |
| dup. | 15 | 29065808 | 29066032** | NA | Present | >223 | 17007 | no | yes | 21 | |
| 6:133527459 | 6:133527443 | intergenic | dup. | 17 | 29065683 | 29065887 | − | Absent | 204 | 98 | no | yes | 9 | |
| 8:88681299 | 8:88681304 |
| del. | 4 | 29065471 | 29065887 | + | Absent | 416 | 1095 | no | yes | 6 | |
| 12:128116403 | 12:128116405 |
| del. | 1 | 29065730 | 29066121 | − | Absent | 391 | 15099 | no | yes | 10 | |
| 2:50947578 | 2:50947612 |
| del. | 33 | 29065849 | 29066032** | NA | NA | >183 | 98 | no | no | 0 | |
| 2:129889238 | 2:129889240 | intergenic | del. | 1 | 29065945 | 29066121 | − | Absent | 176 | 193 | no | no | 0 | |
| 4:44621421 | 4:44621515 | intergenic | del. | 93 | 29065305 | 29065912 | − | Absent | 607 | 33 | no | no | 0 | |
| 5:8665955 | 5:8665942 | intergenic | dup. | 14 | 29065849 | 29066091 | + | Absent | 242 | 451 | yes | no | 0 | |
| 5:83347372 | 5:83347360 |
| dup. | 13 | 29065762 | 29066121 | + | Present | 340 | 78 | no | no | 0 | |
| 5:119565858 | 5:119565843 | intergenic | dup. | 16 | 29065419 | 29065899 | + | Present | 455 | 14 | no | no | 0 | |
| 6:112763097 | 6:112763084 | intergenic | dup. | 14 | 29065288 | 29066118 | + | Present | 671 | 38 | no | no | 0 | |
| 7:152870668 | 7:152870685 | intergenic | del. | 16 | 29065491 | 29065912 | − | Present | 306 | 457 | yes | no | 0 | |
| 8:114925191 | 8:114925178 | intergenic | dup. | 14 | 29064763 | 29065887 | − | Absent | 1124 | 1020 | yes | no | 0 | |
| 8:107979180 | 8:107979169 | intergenic | dup. | 12 | 29064270 | 29066121 | − | Present | 1055 | 28 | no | no | 0 | |
| 10:101386670 | 10:101386662 | intergenic | dup. | 9 | 29065621 | 29065782** | NA | NA | >161 | 36 | no | no | 0 | |
| 10:107557372 | 10:107557435** | intergenic | NA | NA | 29065912 | 29066032** | NA | NA | >120 | 86 | no | no | 1 | |
| 12:33100097 | 12:33100084 | intergenic | dup. | 14 | 29065445 | 29066118 | − | Present | 670 | 248 | yes | no | 0 | |
| 14:79638932 | 14:79638931 |
| dup. | 2 | 29065974 | 29066121 | − | Absent | 147 | 172 | yes | no | 1 | |
| 18:1233989 | 18:1233975 | intergenic | dup. | 15 | 29065387 | 29066121 | − | Present | 735 | 117 | no | no | 0 | |
| X:108351909 | X:108351907 | intergenic | dup. | 3 | 29065437 | 29065893 | − | Absent | 456 | 119 | no | no | 0 | |
| Y:15633117 | Y:15633103 | intergenic | dup. | 15 | 29065877 | 29066121 | − | Absent | 244 | 1084 | yes | no | 1 | |
Figure 3Read counts for insertions called by LDI-PCR.py. On the left, boxplots of read counts for insertions that were either detected (n = 14) or undetected (n = 25) by WGS. On the right, boxplots of read counts for insertions undetected by WGS that were either validated (n = 7) or unvalidated (n = 18). For better visualization, data are presented on a base-10 log scale.
Figure 4Polyadenylation signal of choice for TTC28 L1 mediated 3′ transduction. 8 polyadenylation signals following the L1 canonical polyadenylation signal (no. 2–9) were present in the unique tag assessed for TTC28 L1 3′ transduction. Most of the 3′ transductions identified in this study terminated utilizing the 6th and 9th polyadenylation signal instead of the L1’s canonical polyadenylation signal (no. 1). The height of the orange bars indicates the frequency of L1 3′ transductions terminating at each polyadenylation signal; number of termination events is given above the bars.
Figure 5Extrapolated mechanism of L1 insertion at (a) GRID2 locus and (b) CNTNAP2 locus In both (a) and (b) (i) shows locations of staggered double-stranded cleavage in the target region with PolyT primer in the 3′ overhang and an internal primer (when used) in the 5′ overhang (b). (ii) Extrapolated L1 mRNA with the TTC28 3′ unique tag. Regions complementary to the polyT primer and internal primer at target site is highlighted by the same colour scheme. (iii) Schematic representation of the TPRT (a) and twin-priming (b) mechanism of TTC28 specific 3′ transduction in the GRID2 (a) and CNTNAP2 (b) locus. Figures are not drawn to scale.