| Literature DB >> 23613931 |
Shigeharu Sato1, Abdul K Sesay, Anthony A Holder.
Abstract
The apicoplast, a non-photosynthetic plastid of apicomplexan species, has an extremely reduced but highly conserved genome. Here, the apicoplast genome of the rodent malaria parasite Plasmodium chabaudi chabaudi (Pcc) isolate CB was characterized. Although the set of genes in the genome is identical, the copy number of some tRNA genes differs between Pcc and other Plasmodium species because the Pcc DNA has only one rRNA/tRNA gene cluster, which is normally duplicated in other species. The location of the duplicated trnR(ACG) and trnM implies that one of the duplicated clusters in the ancestral molecule has been lost due to an intramolecular recombination event. The Pcc DNA occurs in two isoforms with an internal inversion between them. The presence of a unique variant in the duplicated trnT gene suggests that the two isoforms are interconvertible. This is the first report of the complete nucleotide sequence of a Plasmodium apicoplast DNA.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23613931 PMCID: PMC3627918 DOI: 10.1371/journal.pone.0061778
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Codon usage in the plastid genome of Plasmodium chabaudi chabaudi isolates and Plasmodium falciparum.
| Number of codons | ||||||
| Amino acid | 1 letter code | Codon | P.c.c. AS | P. c. c. CB | P. f. C10 | Decoding tRNA |
| – | – | TAA | 23 | 22 | 24 | – |
| – | – | TAG | 0 | 1 | 0 | – |
| – | – | TGA | 8 | 8 | 7 | – |
| Ala | A | GCA | 21 | 21 | 16 | tRNA-Ala(ACC) |
| Ala | A | GCC | 1 | 1 | 1 | tRNA-Ala(ACC) |
| Ala | A | GCG | 4 | 4 | 0 | tRNA-Ala(ACC) |
| Ala | A | GCT | 37 | 37 | 41 | tRNA-Ala(ACC) |
| Cys | C | TGC | 2 | 5 | 1 | tRNA-Cys(GCA) |
| Cys | C | TGT | 84 | 81 | 82 | tRNA-Cys(GCA) |
| Asp | D | GAC | 11 | 10 | 3 | tRNA-Asp(GUC) |
| Asp | D | GAT | 126 | 125 | 127 | tRNA-Asp(GUC) |
| Glu | E | GAA | 134 | 134 | 145 | tRNA-Glu(UUC) |
| Glu | E | GAG | 11 | 11 | 8 | tRNA-Glu(UUC) |
| Phe | F | TTC | 4 | 4 | 5 | tRNA-Phe(GAA) |
| Phe | F | TTT | 429 | 428 | 457 | tRNA-Phe(GAA) |
| Gly | G | GGA | 83 | 84 | 85 | tRNA-Gly(UCC) |
| Gly | G | GGC | 0 | 0 | 0 | tRNA-Gly(ACC) |
| Gly | G | GGG | 17 | 16 | 13 | tRNA-Gly(UCC) |
| Gly | G | GGT | 116 | 116 | 120 | tRNA-Gly(ACC) |
| His | H | CAC | 3 | 3 | 0 | tRNA-His(GUG) |
| His | H | CAT | 58 | 58 | 70 | tRNA-His(GUG) |
| Ile | I | ATA | 662 | 660 | 758 | tRNA-Ile(GAU) |
| Ile | I | ATC | 8 | 9 | 8 | tRNA-Ile(GAU) |
| Ile | I | ATT | 584 | 585 | 520 | tRNA-Ile(GAU) |
| Lys | K | AAA | 888 | 883 | 916 | tRNA-Lys(UUU) |
| Lys | K | AAG | 47 | 48 | 53 | tRNA-Lys(UUU) |
| Leu | L | CTA | 7 | 6 | 11 | tRNA-Leu(UAG) |
| Leu | L | CTC | 0 | 0 | 0 | tRNA-Leu(UAG) |
| Leu | L | CTG | 0 | 0 | 0 | tRNA-Leu(UAG) |
| Leu | L | CTT | 9 | 9 | 4 | tRNA-Leu(UAG) |
| Leu | L | TTA | 898 | 900 | 886 | tRNA-Leu(UAA) |
| Leu | L | TTG | 31 | 29 | 19 | tRNA-Leu(UAA) |
| Met (initial) | M | ATG | 31 | 31 | 31 | tRNA-fMet(CAU) |
| Met (internal) | M | ATG | 47 | 46 | 55 | tRNA-Met(CAU) |
| Asn | N | AAC | 20 | 21 | 7 | tRNA-Asn(GUU) |
| Asn | N | AAT | 1071 | 1066 | 1116 | tRNA-Asn(GUU) |
| Pro | P | CCA | 26 | 26 | 27 | tRNA-Pro(UGG) |
| Pro | P | CCC | 1 | 1 | 1 | tRNA-Pro(UGG) |
| Pro | P | CCG | 0 | 0 | 1 | tRNA-Pro(UGG) |
| Pro | P | CCT | 77 | 77 | 76 | tRNA-Pro(UGG) |
| Gln | Q | CAA | 113 | 113 | 118 | tRNA-Gln(UUG) |
| Gln | Q | CAG | 6 | 6 | 9 | tRNA-Gln(UUG) |
| Arg | R | AGA | 91 | 91 | 85 | tRNA-Arg(UCU) |
| Arg | R | AGG | 3 | 3 | 3 | tRNA-Arg(UCU) |
| Arg | R | CGA | 0 | 0 | 0 | tRNA-Arg(ACG) |
| Arg | R | CGC | 0 | 0 | 0 | tRNA-Arg(ACG) |
| Arg | R | CGG | 0 | 0 | 0 | tRNA-Arg(ACG) |
| Arg | R | CGT | 12 | 12 | 14 | tRNA-Arg(ACG) |
| Ser | S | AGC | 5 | 5 | 0 | tRNA-Ser(GCU) |
| Ser | S | AGT | 107 | 107 | 107 | tRNA-Ser(GCU) |
| Ser | S | TCA | 103 | 102 | 98 | tRNA-Ser(UGA) |
| Ser | S | TCC | 7 | 7 | 4 | tRNA-Ser(UGA) |
| Ser | S | TCG | 1 | 2 | 3 | tRNA-Ser(UGA) |
| Ser | S | TCT | 140 | 140 | 132 | tRNA-Ser(UGA) |
| Thr | T | ACA | 111 | 111 | 101 | tRNA-Thr(UGU) |
| Thr | T | ACC | 0 | 0 | 1 | tRNA-Thr(UGU) |
| Thr | T | ACG | 3 | 2 | 2 | tRNA-Thr(UGU) |
| Thr | T | ACT | 109 | 109 | 106 | tRNA-Thr(UGU) |
| Val | V | GTA | 79 | 78 | 79 | tRNA-Val(UAC) |
| Val | V | GTC | 1 | 1 | 0 | tRNA-Val(UAC) |
| Val | V | GTG | 7 | 7 | 1 | tRNA-Val(UAC) |
| Val | V | GTT | 68 | 68 | 48 | tRNA-Val(UAC) |
| Trp | W | TGG | 30 | 30 | 29 | tRNA-Trp(CCA) |
| Tyr | Y | TAC | 9 | 9 | 8 | tRNA-Tyr(GUA) |
| Tyr | Y | TAT | 807 | 806 | 777 | tRNA-Tyr(GUA) |
| Sum | 7391 | 7375 | 7419 | |||
| Total nt | 22173 | 22125 | 22257 | |||
The sum for each codon species present in all the protein coding sequences annotated in the apicoplast DNA.
AB649423.1 [7].
HF563595/HF563596 (this work).
X95275.2/X95276.2 [6].
Proposed tRNA species decoding the codon [10].
Termination codon.
It is not certain if tRNA-Ile(GAU) can decode ATC codons to Ile.
Alignment of HTS reads of Plasmodium chabaudi chabaudi isolate CB on available sequence data of P. c. c. isolate AS and a mouse single-copy gene.
| Kind | Reference sequence | Length (bp) | Read count | Average coverage (x) | Ratio |
|
| |||||
| chab01 | 581723 | 140320 | 23.88 | 1.00 | |
| chab02 | 498758 | 121750 | 24.18 | 1.01 | |
| chab03 | 624158 | 145688 | 23.07 | 0.97 | |
| chab04 | 794008 | 182819 | 22.78 | 0.95 | |
| chab05 | 935796 | 227947 | 24.11 | 1.01 | |
| chab06 | 916956 | 218714 | 23.60 | 0.99 | |
| chab07 | 1169068 | 285453 | 24.16 | 1.01 | |
| chab08 | 1360204 | 329661 | 24.00 | 1.00 | |
| chab09 | 1345096 | 328190 | 24.16 | 1.01 | |
| chab10 | 1634725 | 397118 | 24.07 | 1.01 | |
| chab11 | 1720574 | 415441 | 23.92 | 1.00 | |
| chab12 | 1682238 | 411364 | 24.22 | 1.01 | |
| chab13 | 2616835 | 632816 | 23.95 | 1.00 | |
| chab14 | 2514833 | 615891 | 24.27 | 1.02 | |
| PCHAS_MIT | 5949 | 12531 | 209.02 | 8.75 | |
|
| |||||
| AB649423 | 29198 | 14949 | 50.97 | 2.13 | |
| Mouse single-copy gene | |||||
| Cela2a | 11041 | 32 | 0.28 | 0.01 | |
Ratio of the average coverage to the average of 14 chromosomal contigs (chab01-14).
These preliminary genomic contigs (dated December 2012) were produced by the Wellcome Trust Sanger Institute and can be obtained from ftp://ftp.sanger.ac.uk/pub/pathogens/Plasmodium/chabaudi.
Plasmodium chabaudi apicoplast DNA, partial sequence [7].
The genomic locus specifying chymotrypsin-like elastase family, member 2A. GRCm38.p1 C57BL/6J Chromosome: 4; NC_000070.6 (141814963.141826003, complement).
Variations identified in the plastid DNA sequence between Plasmodium chabaudi chabaudi isolates CB and AS.
| Reference | CB Consensus | Reads | Gene | ||||||
| Position | Residue | Residue | Type | Coverage | Counts | Frequencies (%) | Name | Nucleotide | Amino acid |
| 28 | C | C/T | (SNV) | 20 | 18/2 | 90.0/10.0 | trnT(UGU) | (C28T) | – |
| 2121 | A | T | SNV | 54 | 54 | 100.0 | rpl4 | A324T | (Ile108; synomymous) |
| 2206 | T | A | SNV | 36 | 35 | 97.2 | rpl4 | T409A | Tyr137Asn |
| 2256 | T | C | SNV | 38 | 37 | 97.4 | rpl4 | T459C | (Asn153; synomymous) |
| 2584 | C | A | SNV | 44 | 44 | 100.0 | rpl23 | C180A | Asp60Glu |
| 3039 | A | G | SNV | 29 | 29 | 100.0 | rpl2 | A423G | (Ser141; synomymous) |
| 3751 | C | T | SNV | 38 | 38 | 100.0 | rps3 | C97T | (L33; synomymous) |
| 5516 | C | T | SNV | 27 | 27 | 100.0 | rps8 | C225T | (Asn75; synomymous) |
| 6535.6537 | AAT | – | Deletion | 43 | 43 | 100.0 | rps5 | AAT(301.303) deletion | Asn101 deletion |
| 6741 | T | C | SNV | 51 | 51 | 100.0 | rps5 | T507C | (Tyr169; synomymous) |
| 6822 | T | C | SNV | 47 | 47 | 100.0 | rps5 | T588C | (Cys196; synomymous) |
| 7209 | A | – | Deletion | 56 | 55 | 98.2 | ORF91 | A241 deletion | Ile81-Met92 deletion |
| 8829 | T | C | SNV | 55 | 55 | 100.0 | tufA | T246C | (Cys82;synomymous) |
| 8835 | G | A | SNV | 55 | 55 | 100.0 | tufA | G252A | (Gly84; synomymous) |
| (10214) | – | T | Insertion | 54 | 54 | 100.0 | trnG(ACC) | TT(11.12)TTT | – |
| 10489 | G | A | SNV | 46 | 46 | 100.0 | ORF129 | G108A | (L36; synomymous) |
| 11639 | T | C | SNV | 51 | 51 | 100.0 | clpC | T876C | (Ile292; synomymous) |
| 12566 | T | C | SNV | 59 | 59 | 100.0 | clpC | T1803C | (Cys601; synomymous) |
| 12932 | C | T | SNV | 53 | 53 | 100.0 | clpC | C2169T | (Asn723; synomymous) |
| 14009 | T | C | SNV | 56 | 56 | 100.0 | rps2 | A474G | (Lys158;synomymous) |
| 15593 | G | A | SNV | 38 | 38 | 100.0 | rpoC2B | C183T | (Tyr61; synomymous) |
| 15692 | C | A | SNV | 28 | 28 | 100.0 | rpoC2B | G84T | Leu28Phe |
| 15719.15727 | TATTTTATT | – | Deletion | 26 | 26 | 100.0 | rpoC2B | AATAAAATA(49.57) deletion | Asn17-Ile19 deletion |
| 16721 | C | T | SNV | 62 | 62 | 100.0 | rpoC2A | G673A | Val225Ile |
| 16908 | A | G | SNV | 40 | 40 | 100.0 | rpoC2A | T486C | (Asn162;synomymous) |
| 17277 | A | G | SNV | 48 | 47 | 97.9 | rpoC2A | T117C | (Phe39; synomymous) |
| 17534 | T | A | SNV | 48 | 48 | 100.0 | rpoC1 | A1064T | Asn535Ile |
| 18892 | A | G | SNV | 40 | 40 | 100.0 | rpoC1 | T246C | (Asn82; synomymous) |
| 22427 | A | G | SNV | 32 | 32 | 100.0 | ORF101 | T84C | (Tyr13; synomymous) |
| 24166 | G | A/G | (SNV) | 99 | 56/43 | 56.6/43.4 | trnT(UGU) | (C28T) | – |
| 26918 | T | T/G | (SNV) | 58 | 41/17 | 70.1/29.3 | (trnM>rrl) | – | – |
| 26920 | T | T/− | (Deletion) | 64 | 46/18 | 71.9/28.1 | (trnM>rrl) | – | – |
| 26928 | A | A/T | (SNV) | 61 | 43/18 | 70.5/29.5 | (trnM>rrl) | – | – |
| 26931 | T | T/A | (SNV) | 63 | 45/18 | 71.4/28.6 | (trnM>rrl) | – | – |
| 27032 | A | A/G | (SNV) | 72 | 55/17 | 76.4/23.6 | (trnM<>trnV) | – | – |
| 27279 | A | G | SNV | 41 | 39 | 95.1 | trnL(UAG) | A26G | – |
| 27873 | T | – | Deletion | 63 | 63 | 100.0 | rrs | T314 deletion | – |
| 28345 | T | A | SNV | 42 | 42 | 100.0 | rrs | T786A | – |
| 28346 | A | T | SNV | 42 | 42 | 100.0 | rrs | A787T | – |
| 28347 | A | T | SNV | 41 | 41 | 100.0 | rrs | A788T | – |
| 28348 | T | A | SNV | 39 | 39 | 100.0 | rrs | T789A | – |
| 29013 | T | C | SNV | 47 | 47 | 100.0 | (rrs>trnI) | – | – |
AB649423.1 [7].
–: gap. For variation that was not unique, the two most major variations are given at each side of/.
SNV: single nucleotide variation. Variation that was not unique is shown in parenthesis.
Name of the gene containing each variation. For variation in the non-coding region, nearest genes are shown in parenthesis.
Position in the coding sequence. Position is that in the AS sequence, and those which were not unique are given in parenthesis. –: a variation in a non-coding region.
Change in the amino acid sequence of the CB version caused by each variation. Position is that in the AS sequence. Synonymous variations are shown in parenthesis. –: a variation outside a protein coding sequence.
Figure 1The rRNA/tRNA gene cluster of Plasmodium chabaudi chabaudi CB plastid DNA.
(A) Part of the Pcc CB plastid DNA was amplified by PCR from total parasite DNA with the primers 1095 and 1096 (see Table 4 for details of each primer). The product was fractionated on an agarose gel along with DNA size markers (M). DNA that appeared as a single band (*) was collected and analyzed further. (B) Alignment of trace data obtained for the 6 kb PCR product by PCR-direct sequencing with primers that anneal at an end (1095, 1096) or an internal position (1005, 853). Three parts of the alignment are presented in boxes with arrows representing the direction of the sequencing reaction starting from each primer; the name of an encoded gene is given with a horizontal arrow representing the direction of transcription. Non-coding sequence and ambiguous sequencing data due to the presence of two different sequences are highlighted with blue and red, respectively. Because rps4 and sufB share an identical sequence from the 1st to the 8th nucleotide of their coding sequence, the highlighted region in red starts at the 9th residue in the sequencing data from 1005 and 853. The C/T transition at position 28 of trnT, which is clearly identifiable in the sequencing data analyzed from inside (1005/853) but not in those from outside (1095/1096), is indicated with a vertical arrow. (C and D) Schematic representation of the PCR product. Selected genes in the region including the gene of the unusual tRNA-Arg(ACG) (trnR(ACG)*) and the 5' truncated rrl ('rrl) are indicated with color-coded thick arrows. The PCR product (horizontal thick black bar) amplified with primers 1095 and 1096 (red arrows) was a mixture of two DNA species. Because of the coexistence of two different types of molecule, the quality of sequencing data obtained (blue arrows) abruptly dropped at the end of the short IR sequence (highlighted with pink background) and gave a mixture of two sequences (dotted arrow). The sequence of two trnT genes (trnT-1 and -2) of Pcc CB is almost the same except for the variation at their 28th residue (T and C; circled). Each trnT is linked with its upstream gene but not with its downstream gene. Therefore the 28th residue of the gene was a mixture of C and T when the PCR product was sequenced from the outside toward the inside (C), but the residue was uniquely identified as either C or T when the same sample was sequenced from inside to outside (D). The length of each gene is not to scale with the others in this figure.
Oligonucleotides used for PCR amplification and sequencing.
| Name | Nucleotide sequence (5' >3') | Target | Strand |
| 853 |
|
| sense |
| 1005 |
|
| sense |
| 1095 |
|
| antisense |
| 1096 |
|
| antisense |
Gene containing the annealing site.
Of the target gene.
Figure 2Unique forms of the plastid DNA of Plasmodium chabaudi chabaudi CB and their proposed origin.
Schematic diagram of the two forms of the plastid DNA of Pcc CB. Each gene is represented with a box color-coded in white (one specifying a protein), blue (rRNA gene), yellow (tRNA gene) or red (a pseudogene). The colour of the name of each gene represents the transcription direction (black, clockwise; red, counter-clockwise). The bulge connecting two parts of the trnL(UAA) coding sequence indicates the intron. The nucleotide sequences of the complete Pcc CB plastid DNA in forms A (A) and B (B) were deposited to the DDBJ/EMBL/GenBank databases with the accession numbers HF563595 and HF563596, respectively.
Figure 3Proposed origin of the unique Pcc CB plastid DNA.
An intramolecular recombination event probably between trnI and trnV caused partial deletion of one of the two IR units in the ancestral DNA molecule (thick black arrow). The chimeric trnI/trnV in the intermediate molecule (1) was lost probably because it was a pseudogene. Subsequent truncation of the 5' region of rrl in the affected IR unit (thick blue arrow) and degradation of the trnR(ACG) in the other IR unit (thick red arrow) results in the Pcc plastid DNA. The order of these two events as well as which intermediate molecule (2) was generated in this process, are unknown. Finally, differentiation between trnM-1 and -2 as well as between trnT-1 and -2 occurred and these resulted in the Pcc CB plastid DNA. Switching between forms A and B could have happened at any point of this process, and may happen frequently as suggested by the fact that the apparent molar ratio between the two forms is 1∶1. Genes are color-coded as in panels A and B, and those changing/changed are indicated with a green circle.