| Literature DB >> 24301182 |
Takayuki Imura1, Shigeharu Sato, Yukita Sato, Daichi Sakamoto, Takashi Isobe, Koichi Murata, Anthony A Holder, Masayoshi Yukawa.
Abstract
Leucocytozoon caulleryi, a haemosporidian parasite of the chicken (Gallus gallus domesticus), can be highly pathogenic and often fatal. Although this parasite is extremely relevant to veterinary science, knowledge of its genomic features is limited. To gain information applicable to developing novel control methods for the parasite, we analyzed the apicoplast genome of L. caulleryi. This extranuclear organellar DNA of 85.1% A + T and a unit of 34,779 bp was found to encode almost the same set of genes as the plastid genome of Plasmodium falciparum, including 16 tRNA and 30 protein coding genes, and except for one open reading frame, ORF91 absent in L. caulleryi. As in P. falciparum, the L. caulleryi apicoplast DNA contains two sets of a unique inverted repeat (IR), each one 5,253 bp and encoding genes specifying one large and one small rRNA subunit and nine tRNAs but no protein, and separated by a unique 13 bp sequence. Studies of several haemosporidian apicoplast DNA sequences have identified a corresponding IR region; however, none of these studies has looked at the complete sequence, even for well-studied species such as P. falciparum. Phylogenetic studies using a concatenated amino acid sequence based on the open reading frames confirmed the close relationship between L. caulleryi and Plasmodium spp. In this study, we determined the nucleotide sequence of the entire L. caulleryi apicoplast genome, including the region connecting the two IR units. This is the first report of the complete nucleotide sequence of a haemosporidian apicoplast DNA with a canonical IR.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24301182 PMCID: PMC3932168 DOI: 10.1007/s00436-013-3712-9
Source DB: PubMed Journal: Parasitol Res ISSN: 0932-0113 Impact factor: 2.289
Comparison of apicoplast genomes
| Species |
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| Complete/partial | Complete | Partial | Complete | Complete | Complete | Complete | Complete |
| Size (bp) | 34,779 | 34,682 | 29,623 | 34,996 | 34,750 | 39,579 | 33,351 |
| A + T(%) | 85.1 | 86.9 | 86.3 | 78.6 | 79.4 | 80.5 | 78.2 |
| IRz | +a | + | − | + | + | − | − |
| Proteinb,c | 30 | 31 | 31 | 29 | 29 | 32d | 30e |
|
| 1 | 1 | 1 | 1 | 1 | 2 | 2 |
|
| + | + | + | + | + | − | − |
|
| + | + | + | − | − | − | − |
|
| − | + | + | − | − | − | − |
| rRNAb | 2f | 2f | 2 | 2f | 2f | 2 | 2 |
| tRNAb | 25g | 25g | 26h | 24g | 24g | 24i | 24 |
|
| + | + | + | − | − | − | − |
| Intron in | + | + | + | + | − | + | + |
| In-frame stop codonj | − | − | − | + | + | − | − |
| Coding strand | Both | Both | Both | Both | Both | One | One |
| Accession number | AP013071 | X95275, X95276 | HF563595 | U87145 | AY217738 | NC_007758 | NC_011395 |
zInverted repeat
a + present, − absent
bNumber of different products encoded
c rpoC2 considered two separate proteins (rpoC2A and rpoC2B)
dExcluding those specified by 12 repetitive genes predicted between rpoC1 and rpoC2A
eExcluding repetitive gene products from BBOV_V000300, BBOV_V000310, BBOV_V000320, BBOV_V000180, BBOV_V000200, and BBOV_V000210
fThe gene for each is present in IR and duplicated
gGenes specifying 9 tRNA species (tRNA-Ala(UGC), Arg(ACG), Arg(UCU), Asn(GUU), Ile(GAU), Leu(UAG), Met(CAU), Thr(UGU), and Val(UAC)) are present in IR and duplicated
hTwo genes encode different versions of tRNA-Thr. The gene for tRNA-Met is duplicated
iFive genes encode the same tRNA-Met(CAU)
jExcluding the hypothetical codon connecting rpoC2A and rpoC2B
Fig. 1Map of the L. caulleryi plastid genome. Concentric circles represent the circular double-stranded DNA genome, and each gene annotated in the DNA sequence is illustrated on either of the two circles depending on the direction of transcription (outer circle clockwise; inner circle anticlockwise). Yellow boxes represent genes specifying either a protein or an rRNA, whereas lines and gray boxes indicate tRNA-coding genes or hypothetical open reading frames, respectively. Each inverted repeat (IR) unit is indicated by the outermost dotted arcs, and the asymmetric 13-bp sequence separating the two IR units is marked with an asterisk
Fig. 2Phylogeny of the apicoplast genomes. Amino acid sequences specified by the 10 selected plastid-encoded genes were concatenated for each species, and the phylogeny was analyzed by the maximum likelihood method with the JTT + F + Γ model to obtain the dendrogram. Numbers at nodes indicate bootstrap values from 1,000 replicates. The scale bar indicates 0.2 substitutions per site. The accession numbers of the nucleotide sequence data, from which the amino acid sequences used in the analysis were deduced, are given in parentheses after the name of each taxon