| Literature DB >> 23961915 |
Yi Cao1, Bing Rui, Dianne L Wellems, Mingxing Li, Biaobang Chen, Dongmei Zhang, Weiqing Pan.
Abstract
BACKGROUND: The piggyBac transposon system provides a powerful forward genetics tool to study gene function in Plasmodium parasites via random insertion mutagenesis and phenotypic screening. The identification of genotype of piggyBac mutants in the Plasmodium genome is thus an indispensable step in forward genetic analysis. Several PCR-based approaches have been used to identify the piggyBac insertion sites in Plasmodium falciparum and Plasmodium berghei, but all are tedious and inefficient. Next generation sequencing can produce large amounts of sequence data and is particularly suitable for genome-wide association studies. In this study, the Next generation sequencing technology was employed to efficiently identify piggyBac insertion sites in the genome of P. berghei.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23961915 PMCID: PMC3765144 DOI: 10.1186/1475-2875-12-287
Source DB: PubMed Journal: Malar J ISSN: 1475-2875 Impact factor: 2.979
Figure 1The helper plasmid and donor plasmids for co-transfection of parasites. The helper plasmid (left) contains the transposase gene under control of the constitutive eef1aa 5′UTR and the dhfr/ts 3′UTR for transient transposase expression. The two donor plasmids (right) contain the Pb dhfr/ts selectable marker and gfp fusion gene expression cassette under the control of constitutive dhfr/ts 5′UTR or eef1aa 5′UTR. The expression cassette is flanked by the piggyBac inverted terminal repeats (ITR).
Figure 2Diagram depicting the two methods of detecting insertion sites in genome with Illumina sequencing data. (A) The ‘BLAST method’ is based on BLAST searching with the 13-bp end sequences of piggyBac ITRs in the Illumina reads. All the Illumina sequencing reads were regarded as single-end reads. Each of single reads can be located in P. berghei chromosomes, or in the piggyBac transposable fragments, or at the junctions between the piggyBac inserts and the flanking genomic sequences. Only the reads overlapping the junctions can be used to identify the exact sites of piggyBac insertions. The example read comes from piggyBac insertion at 52719 of chromosome 1. The workflow of ‘BLAST method’ is shown at the right panel. (B) The ‘SOAP method’ is based on aligning the Illumina sequencing reads respectively to the piggyBac transposable fragments and P. berghei reference sequences using the SOAP software. The Illumina reads were sequenced with the paired-end module. Both of the paired-end reads can be located in the P. berghei chromosomes or in the piggyBac transposable fragments, or alternatively, one of the paired-end reads can lie in the piggyBac transposable fragments while the other of the paired-end reads can lie in the flanking genomic sequences. The latter can be used to identify the sites of piggyBac insertions. The workflow of ‘SOAP method’ is shown at the right panel.
The results of mapping the Illumina sequencing reads to the reference genome using the SOAPaligner program
| 9,771,546 | 879,439,140 | 82.83 | 9,363,770 | 407,776 | |
| 1,343,876 | 120,948,840 | 11.39 | 1,163,314 | 180,562 |
ANKA genome coverage by chromosome
| 1 | 475,058 | 474,159 | 99.81 | 46.5 |
| 2 | 636,358 | 634,527 | 99.71 | 51.4 |
| 3 | 587,073 | 586,764 | 99.95 | 44.9 |
| 4 | 724,231 | 721,086 | 99.57 | 46.1 |
| 5 | 917,344 | 916,957 | 99.96 | 50.2 |
| 6 | 918,417 | 917,782 | 99.93 | 48.5 |
| 7 | 810,351 | 801,856 | 98.95 | 46.4 |
| 8 | 1,354,067 | 1,353,940 | 99.99 | 46.7 |
| 9 | 1,632,101 | 1,629,300 | 99.83 | 47.9 |
| 10 | 1,579,605 | 1,579,521 | 99.99 | 47.5 |
| 11 | 1,719,070 | 1,717,503 | 99.91 | 47.2 |
| 12 | 1,764,385 | 1,763,706 | 99.96 | 46.8 |
| 13 | 2,492,531 | 2,492,086 | 99.98 | 46.5 |
| 14 | 2,448,164 | 2,447,394 | 99.97 | 46.9 |
| berg_bin | 339,395 | 327,701 | 96.55 | 76.6 |
| genome level | 18,398,150 | 18,364,282 | 99.82 | 47.8 |
* % of nucleotides sequenced by ≥1 reads.
Figure 3Distribution of sequence coverage across the available reference genome of ANKA. (A) The percentage of the available genome plotted against the depth of genome base coverage. (B) The accumulated percentage of the available genome plotted against the depth of genome base coverage.
The 29 genomic insertion sites identified by the ‘SOAP method’
| berg03 | 141655 ± (200 ~ 400 bp) | 1 |
| berg04 | 138106 ± (200 ~ 400 bp) | 1 |
| berg04 | 353862 ± (200 ~ 400 bp) | 1 |
| berg04 | 374523 ± (200 ~ 400 bp) | 1 |
| berg04 | 679800 ± (200 ~ 400 bp) | 1 |
| berg05 | 272425 ± (200 ~ 400 bp) | 1 |
| berg04 | 858235 ± (200 ~ 400 bp) | 1 |
| berg06 | 534191 ± (200 ~ 400 bp) | 1 |
| berg07 | 247428–247493 (65 bp) | 6 |
| berg07 | 653302–653595 (293 bp) | 20 |
| berg08 | 21190 ± (200 ~ 400 bp) | 1 |
| berg08 | 1143631 ± (200 ~ 400 bp) | 1 |
| berg08 | 1229311 ± (200 ~ 400 bp) | 1 |
| berg09 | 1385754 ± (200 ~ 400 bp) | 1 |
| berg10 | 859354 ± (200 ~ 400 bp) | 1 |
| berg11 | 396596–396660 (64 bp) | 3 |
| berg11 | 1132095 ± (200 ~ 400 bp) | 1 |
| berg11 | 1236263–1236275 (12 bp) | 22 |
| berg12 | 137149–137213 (64 bp) | 4 |
| berg12 | 1663662 ± (200 ~ 400 bp) | 2 |
| berg13 | 307464 ± (200 ~ 400 bp) | 2 |
| berg13 | 1325560 ± (200 ~ 400 bp) | 1 |
| berg13 | 1819639 ± (200 ~ 400 bp) | 1 |
| berg13 | 2109829 ± (200 ~ 400 bp) | 1 |
| berg13 | 2135273 ± (200 ~ 400 bp) | 1 |
| berg14 | 906574–906725 (151 bp) | 3 |
| berg14 | 190567 ± (200 ~ 400 bp) | 1 |
| berg14 | 1963049 ± (200 ~ 400 bp) | 1 |
| berg14 | 2223862 ± (200 ~ 400 bp) | 1 |
Figure 4PCR amplification of all insertion sites identified by the ‘BLAST method’, the ‘SOAP method’ and the ‘iPCR method’. Half-nested PCR products were amplified by each of site-specific primers and a pair of primers located in either of the piggyBac ITR arms using the genomic DNA of post-transfection parasites as templates. The number of each lane coincides with the number shown in Additional file 1; L = DNA ladder.
Distribution of the identified SNPs in the ANKA genome
| berg01 | 12 | 11 | 23 |
| berg02 | 5 | 196 | 201 |
| berg03 | 1 | 22 | 23 |
| berg04 | 9 | 141 | 150 |
| berg05 | 11 | 36 | 47 |
| berg06 | 1 | 66 | 67 |
| berg07 | 17 | 114 | 131 |
| berg08 | 3 | 107 | 110 |
| berg09 | 10 | 73 | 83 |
| berg10 | 3 | 71 | 74 |
| berg11 | 6 | 151 | 157 |
| berg12 | 4 | 156 | 160 |
| berg13 | 8 | 91 | 99 |
| berg14 | 5 | 186 | 191 |
| berg_bin | 46 | 288 | 334 |
| SNPs in 14 chrs | 95 | 1421 | 1516 |
| All SNPs: | 141 | 1709 | 1850 |