| Literature DB >> 21447194 |
Yaa-Jyuhn J Meir1, Matthew T Weirauch, Herng-Shing Yang, Pei-Cheng Chung, Robert K Yu, Sareina C-Y Wu.
Abstract
BACKGROUND: DNA transposons have emerged as indispensible tools for manipulating vertebrate genomes with applications ranging from insertional mutagenesis and transgenesis to gene therapy. To fully explore the potential of two highly active DNA transposons, piggyBac and Tol2, as mammalian genetic tools, we have conducted a side-by-side comparison of the two transposon systems in the same setting to evaluate their advantages and disadvantages for use in gene therapy and gene discovery.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21447194 PMCID: PMC3078864 DOI: 10.1186/1472-6750-11-28
Source DB: PubMed Journal: BMC Biotechnol ISSN: 1472-6750 Impact factor: 2.563
Figure 1Comparison of transposition activity between the long and short versions of . A. Donor and helper constructs of piggyBac and Tol2 used for the comparison. The activator sequence with enhancer activity in D. melanogaster is underlined. B. Transposition activity of the long vs. short version of piggyBac and Tol2 in HEK 293.
Figure 2Transposition activity of various engineered . A. A schematic representation of tagged piggyBac and Tol2 transposases. B. The activity comparison between the wild-type and various epitope-tagged Tol2 and piggyBac transposases. C. The enzymatic activity of Myc-piggyBac. The enzymatic activity of Myc-piggyBac was measured under a fixed amount of pB-cassette3short (donor at 100 ng) co-transfected with increasing amounts of pCMV-Myc-piggyBac helpers (expressing the Myc-tagged piggyBac transposase) into HEK 293 cells. The lower panel depicts a Western blot indicating the expression level of the piggyBac transposase (detected by Myc antibody) and α-actin (detected by α-actin antibody) in the corresponding transfected cells. Western blotting was performed by isolating protein extracts from the remaining transfected cells of the same triplicate samples for the colony formation assay shown above.
The data sets of piggyBac and Tol2 genome-wide target profiling in HEK 293
| Transposon | # of individual | successful rate in | # of individual | # of targets with a | # of targets mapped |
|---|---|---|---|---|---|
| 164 | 86% (142/164) | 371 | 315 | 207 | |
| 114 | 91% (104/114) | 264 | 149 | 107 |
*: only the target sequences sharing ≥95% sequence identity with the corresponding human genome were selected
Figure 3Chromosome ideogram of target sites of . The red and green stars label the site for piggyBac and Tol2 hotspots, respectively. Clusters (A-D) of Tol2 targets are circled in purple.
Figure 4Preferential target sites of . A. The genome context of Tol2 and piggyBac target sites. *: both transposons tend to insert near CpG islands more often than expected by chance (Tol2, P < 10-9, piggyBac, P < 10-11; see Methods). B. The gene density around Tol2 and piggyBac target sites. C. Distributions of piggyBac and Tol2 target sequences in various types of repeats. Every bar represents the percentage of target sites found in the type of repeats indicated. The bottom part of each bar represents the percentage of targets located within at least 100 bp to the 3' end of the repeats targeted.
Figure 5In-depth analyses of . A. The sequence logos of piggyBac and Tol2 target sites. B. Two representative sequences flanking the sites repeatedly targeted by piggyBac. The TTAA tetranucleotide are underlined. C. A sequence alignment of four sequences on chromosome 16 that share 100% sequence identity with the first 100 bp of the piggyBac target B89-4. The residue that is different from the other three sequences at a given position is indicated in red. Dots represent all the primary sequences that are identical in all four sequences. The numbers on the top indicate the relative position of residues. D. The sequence logo of 184 sequences that share at less 97% sequence identity with the piggyBac target, B87-4. Note: The chromosomal sequence 5' and 3' to the target site are in lower cases and upper cases, respectively. piggyBac transposon is inserted at the position between -1 and +1.
The piggyBac and Tol2 hotspots in the HEK 293 genome
| Transposon | Target ID | Targeted sequence | Times | Position | Gene context | Targeted Gene | Near gene (distance bp) | Far gene (distance bp) |
|---|---|---|---|---|---|---|---|---|
| B87-5/B89-3 | TTAAATAAAGATAATAATACTAACCATGGCA | 2 | 3p14.3 | INTRONIC | FLNB | |||
| B89-4/B77-4 | TTAAAGACCCTGTCTCTTAAAAAAAAAAAAA | 2 | 16p11.2 | 3'UTR | MLAS | |||
| B38-4/B102-2 | TTAAATAAAAAAGAACAGATATTTGAAATTG | 2 | 14q23.3 | INTRONIC | GPHN | |||
| B71-1/B109-3 | TTAAATTCCAGGTTTCTCAAAGAAAGCTTGT | 2 | 20p12.3 | INTERGENIC | BC043288(219296) | BMP2 (347908) | ||
| B75-4/B92-1 | TTAAAGAAACAAGTTAACACCGAAGCCAGAG | 2 | 17q24.1 | INTERGENIC | FLJ32065 (2686) | LRRC37A3 (101620) | ||
| T47-2/T48-3 | TGTTCCGCTCCTGGTGCGGGCCGAGACCCGG | 2 | 1q21.2 | INTRONIC | ZNF687 | |||
| 111-2/T119-1 | TATGTGTAATAATGGAGGTATGTACAACAT | 2 | 3p24.3 | INTERGENIC | SGOL1 (385347) | HPX-42 (834010) | ||
| T14-3/T17-1 | AGAATAGGTATTTCTTTTTTTCTTCTTATC | 2 | 5q33.2 | INTERGENIC | C5orf3 (87465) | GRIA1 (92385) | ||
| T2-1/T3-2 | TCCACCACAGCATGAGTTAAACCAAAGTCT | 2 | 7q11.23 | INTERGENIC | POMZP3 (2680) | hPMSR6 (350693) | ||
| T1-3/T4-1 | CCTGCCCAGCTCGTAAAAGGATGCTCACCT | 2 | 8q22.3 | INTERGENIC | GRHL2 (694) | NACAP1 (122197) | ||
| TB7-3/T113-1/T115-4/T120-2 | AATTTATCCATTTCTTCTAGATTTTCTAGT | 2 | 20p13 | INTRONIC | SIRPD | |||
The piggyBac and Tol2 targets located within the repetitive sequences of the HEK 293 genome
| Transposon | Representative targets | Sequence (+1 ~ +30) | Position | Sequence identity | Types of repeats | Times targeted | ||
|---|---|---|---|---|---|---|---|---|
| 100% | 99.9%~97.0% | Total | ||||||
| B89-4 | TTAAAGACCCTGTCTCTTAAAAAAAAAAAA | chr16(30140496) | 4 | 0 | 4 | NF | 2 | |
| B87-4 | TTAAGAATGTTGAATATTGGCCCCCACTCT | chr3(55059477) | 1 | 510 | 511 | NF | 1 | |
| B75-4 | TTAAAGAAACAAGTTAACACCGAAGCCAGA | chr17(60391785) | 1 | 1 | 2 | NF | 2 | |
| B85-4 | TTAAAAAAGGCATTATTTTCGCAGCTATCT | chr6(125167100) | 0 | 2 | 2 | NF | 1 | |
| B42-3 | TTAATTTACTTAAGATAATGGCCTCCACAC | chr22(15414966) | 5 | 3 | 8 | NF | 1 | |
| B100-1 | TTAAGAAAGGAGTTGAATTAAGCTCAGGTT | chr1(120900969) | 0 | 2 | 2 | NF | 1 | |
| B90-1 | TTAATATCCCACCTTTGCACAGTAGACAAT | chr3(137392502) | 2 | 0 | 2 | NF | 1 | |
| B92-2 | TTAAACACACACTTAGAGGGAAATAATTCAT | chr18(11877795) | 0 | 2 | 2 | NF | 1 | |
| B82-3 | TAAAGAATATAAGGCCAAGCACAGTGGCT | chr11(27402368) | 1 | 1 | 2 | NF | 1 | |
| B89-3 | TTAAATAAAGATAATAATACTAACCATGGC | chr3(57972969) | 1 | 1 | 2 | NF | 1 | |
| B92-1 | TTAAAGAAACAAGTTAACACCGAAGCCAGA | chr17(60391785) | 2 | 2 | 4 | NF | 1 | |
| B77-4 | TTAAAGACCCTGTCTCTTAAAAAAAAAAAA | chr16(29401156) | 0 | 4 | 4 | NF | 1 | |
| B79-2 | TTAAGGGGGGAAAACAGTTCAGGGCCAACA | chr14(55711916) | 1 | 1 | 2 | NF | 1 | |
| B84-1 | TTAATGTTAAATTACAAACACTGTTTTATC | chr18(29434150) | 0 | 2 | 2 | NF | 1 | |
| B85-1 | TTAAGCACAGTATCAGTGATAAAAATAGCT | chr1(201276407) | 1 | 1 | 2 | NF | 1 | |
| B82-1 | TTAAGCTGAATCTGTTTTTCCCAGTGCCCC | chr2(231461932) | 0 | 2 | 2 | NF | 1 | |
| T111-3 | TTTAAGAATGTTGAGTATTGGCCCCCACTC | chr8(9712455) | 1 | 8 | 9 | LINE | 1 | |
| T147-1 | ATCCTGAGCAGCCGAATCTGCAATCATCTT | chr7(154665283) | 1 | 1 | 2 | LTR | 1 | |
| T3-2 | TCCACCACAGCATGAGTTAAACCAAAGTCT | chr7(76097237) | 2 | 0 | 2 | LINE | 1 | |
| T88-3 | GTCTGTACTGCTGCAAAGCTTCACAGACAG | chr10(98485313) | 1 | 2 | 3 | NF | 1 | |
| T115-4 | AATTTATCCATTTCTTCTAGATTTTCTAGT | chr20(1472025) | 0 | 3 | 3 | LINE | 4 | |
| T103-2 | GGCGCCCGCCACTACGCCTGGCTAATTTTT | chr21(13847935) | 1 | 3 | 4 | SINE | 1 | |
| T162-3 | CCAGAGACCTTTGTTCACTTGTTTATCTGC | chr20(33206406) | 202 | ND | > 202 | Low_complexity | 1 | |
| T157-1 | CTCGTACGTAAGTTTTAGTGTGAACATATA | chr4(48956277) | 4 | 3 | 7 | LINE | 1 | |
| T157-2 | GTTAACAGTGACCTATTTGGGAGAAGGGGA | chr7(66290199) | 1 | 1 | 2 | LINE | 1 | |
| T107-2 | AATATATGAGTAGCTAAACAACTCTATAAG | chr2(97613844) | 1 | 1 | 2 | LINE | 1 | |
| T104-1 | GAACACATGGACACAGGAAGGGGAACATCA | chr3(134171282) | 1 | 52 | 53 | LINE | 1 | |
| T25-3 | ACCCCATCTCTACTAAAAATACAAAAAATT | chr6(166733349) | 1 | 1 | 2 | SINE | 1 | |
Figure 6The activity of genes which are close to or located at the site repeatedly targeted by . The upper panel is a histogram showing the ratio of gene expression level between the housekeeping gene, GAPDH and the gene of interest that is either repeatedly targeted by piggyBac or Tol2, or is located within a 10-kb interval of piggyBac or Tol2 hotspots as measured by Q-RT-PCR. A set of neural genes (MK-1, NRGN, and SYGR4) with a high level to no expression in HEK 293 cells is also served as references. The lower panel is a DNA-agarose gel image of a representative Q-RT-PCR reaction showing the PCR products at the end of the 30th cycle. Genes targeted repeatedly by piggyBac: FLNB; GPHN; and MLAS. Genes near the piggyBac hotspot: FLJ32065. Genes targeted repeatedly by Tol2: SIRPD and ZNF687. Genes near the Tol2 hotspot: POMZP and GRHL2.
Figure 7A risk evaluation of . The histogram shows the percentage of piggyBac or Tol2 targets located within or within a defined distance away from cancer-related genes.
A list of cancer-related genes targeted by iggyback or Tol2
| Transposon | Target ID | Sequence | Targeted Cancer Gene | Annotation |
|---|---|---|---|---|
| B102-2 | TTAAATAAAAAAGAACAGATATTTGAAATTGGCTGTTG | GPHN | gephyrin | |
| B107-3 | TTAATGATTCTTTCCATTTCTTTTATTCTTTTCCTAGC | HHEX | hematopoietically expressed homeobox | |
| B27-3 | TTAATAGAAAGGAAGGGACCATGTTTAACATAAATGCT | POU6F2 | POU class 6 homeobox 2 | |
| B38-4 | TTAAATAAAAAAGAACAGATATTTGAAATTGGCTGTTG | GPHN | gephyrin | |
| B63-1 | TTAAGTTTTCAGTGGCTGAAAGTTGGCAGTCTGAAAAA | ARF4 | ADP-ribosylation factor 4 | |
| B81-1 | TTAAGTGCTTTTGGCTGTTTTCCCAAACATCCAGACAT | SMAD5 | SMAD family member 5 | |
| T12-2 | GGTAGGAGTTATCTGAGTCAGGCCTGCCCTTGGCTTGG | SPECC1 | cytospin B | |
| T121-1 | CTCCTGGGTGACCCTCGCCTGAGCCTCCTGGCCCTTCC | RAB40B | RAB40B, member RAS oncogene family | |
| T102-4 | GACACAAACACACACATGCTATACCTTTGTATTACACT | TCF4 | transcription factor 4 | |
| T124-4 | GCGGCTGTCCTCCAGCAACAGGTGCACATTCCCGGGCT | TNK1 | tyrosine kinase, non-receptor | |
| T130-2 | CAAATAAATGAATGTTATGAATTTTTGAGGGTAGGAAA | SEMA3C | sema domain, immunoglobulin domain (Ig), (semaphorin) 3C precursor | |
| T137-3 | AAAGAGAGGCCCAATCCTGTGGAGTGAGTCACTGGGGG | ALPL | alkaline phosphatase, liver/bone/kidney | |
| T137-4 | ATTTTCTGTCTGCTCTTTGGTCACTTCCCATTCTTTTT | PARD3 | par-3 partitioning defective 3 homolog (C. elegans) | |
| T157-4 | TTTATTTGTCCTGCACTTATGAAGCATAGTTTGGCAGG | PGR | progesterone receptor | |
| T165-1 | GAAACCGGCGAAAAGGTTAGCTGTCGCTGGCTAGTATT | RASSF3 | Ras association (RalGDS/AF-6) domain family member 3 | |
| T22-1 | TTCCTAAGCTACAATAAACCACATATGAAAAACTAAAG | HD | huntingtin | |
| T26-1 | AGCCTGAGTAAAATAGTGAGACTCTGTTTCTGCAAAAC | LRP1B | low density lipoprotein-related protein 1B | |
| T43-3 | TCTGGAAGGTGAGGCAGACGTGCCCACCGCCTCCATGC | HLXB9 | motor neuron and pancreas homeobox 1 | |
| T91-4 | TTCAGGGGGTGTGTTGGAGGGGAATCGCCGGCCTGCCT | IHH | Indian hedgehog homolog (Drosophila) | |
| TB7-5 | AATTTATCCATTTCTTCTAGATTTTCTAGTTTATTCGC | FKBP1A | FK506 binding protein 1A, 12kDa | |
| TB70-1 | GTGCACACACTCACTCTCTCTTTCTCCTTCAGATAATA | FOXP1 | forkhead box P1 | |
| TB77-2 | CCCCTCACCCTCGGACCCTTCACCGCGACCCCCGCGCC | RANBP9 | RAN binding protein 9 | |
| TB81-1 | TGCAGTACAGTGCGGGGGGAAAAAAACAACAGCAAAAG | EGF | epidermal growth factor (beta-urogastrone) | |