| Literature DB >> 21857981 |
Hong-Wei Tan1, Guo-Hua Liu, Xia Dong, Rui-Qing Lin, Hui-Qun Song, Si-Yang Huang, Zi-Guo Yuan, Guang-Hui Zhao, Xing-Quan Zhu.
Abstract
In the present study, we determined the complete mitochondrial DNA (mtDNA) sequence of Apis cerana, the Asiatic cavity-nesting honeybee. We present here an analysis of features of its gene content and genome organization in comparison with Apis mellifera to assess the variation within the genus Apis and among main groups of Hymenoptera. The size of the entire mt genome of A. cerana is 15,895 bp, containing 2 ribosomal RNA genes, 13 protein-coding genes, 22 transfer RNA (tRNA) genes and one control region. These genes are transcribed from both strands and have a nucleotide composition high in A and T. The contents of A+T of the complete genomes are 83.96% for A. cerana. The AT bias had a significant effect on both the codon usage pattern and amino acid composition of proteins. There are a total of 3672 codons in all 13 protein-coding genes, excluding termination codons. The most frequently used amino acid is Leu (15.52%), followed by Ile (12.85%), Phe (10.10%), Ser (9.15%) and Met (8.96%). Intergenic regions in the mt genome of A. cerana are 705 bp in total. The order and orientation of the gene arrangement pattern is identical to that of A. mellifera, except for the position of the tRNA-Ser(AGN) gene. Phylogenetic analyses using concatenated amino acid sequences of 13 protein-coding genes, with three different computational algorithms (NJ, MP and ML), all revealed two distinct groups with high statistical support, indicating that A. cerana and A. mellifera are two separate species, consistent with results of previous morphological and molecular studies. The complete mtDNA sequence of A. cerana provides additional genetic markers for studying population genetics, systematics and phylogeographics of honeybees.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21857981 PMCID: PMC3155526 DOI: 10.1371/journal.pone.0023008
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Mitochondrial genome sequences of Hymenoptera sequenced completely or nearly completely prior to the present study and used for phylogenetic analysis.
| Family | Taxon | Accession number | Reference |
| Apidae |
| L06178 | Crozier and Crozier 1993 |
|
| DQ870926 | Cha et al. 2007 | |
|
| EU401918 | Unpublished | |
|
| AF466146 | Silvestre et al. 2008 | |
| Chrysididae |
| AH015389 | Castro et al. 2006 |
| Pergidae |
| AY787816 | Castro and Dowton 2005 |
| Vanhorniidae |
| DQ302100 | Castro et al. 2006 |
| Vespidae |
| EU302588 | Cameron et al. 2008 |
|
| EU024653 | Cameron et al. 2008 | |
| Pteromalidae |
| EU746609 | Oliveira et al. 2008 |
|
| EU746610 | Oliveira et al. 2008 | |
|
| EU746611 | Oliveira et al. 2008 | |
|
| EU746612 | Oliveira et al. 2008 | |
| Cephidae |
| FJ478173 | Dowton et al. 2009 |
| Orussidae |
| FJ478174 | Dowton et al. 2009 |
| Ichneumonidae |
| FJ478176 | Dowton et al. 2009 |
|
| FJ478177 | Dowton et al. 2009 | |
|
| EU871947 | Wei et al. 2009 | |
| Evaniidae |
| FJ593187 | Wei et al. 2009 |
| Stephanidae |
| FJ478175 | Dowton et al. 2009 |
| Braconidae |
| FJ154897 | Wei et al. 2010 |
|
| FJ387020 | Wei et al. 2010 | |
|
| GU097654 | Wei et al. 2010 | |
|
| GU097655 | Wei et al. 2010 | |
|
| GU097656 | Wei et al. 2010 | |
|
| GU097657 | Wei et al. 2010 | |
|
| GU097658 | Wei et al. 2010 |
Figure 1The mitochondrial genome of Apis cerana.
Protein-coding genes are transcribed in a clockwise direction, except for those underlined. The two ribosomal RNA genes were encoded by the L strand. Transfer RNA genes encoded by H and L strands are shown outside and inside the circular gene map, respectively. Transfer RNA genes are designated by single-letter amino acid codes, except those encoding leucine and serine, which are labeled L1, L2, S1, S2, representing tRNA-Leu(CUN), tRNA-Leu(UUR), tRNA-Ser(AGN) and tRNA-Ser(UCN), respectively.
Positions and nucleotide sequence lengths of mitochondrial genomes of Apis cerana, and start and stop codons for protein-coding genes as well as their tRNA gene anticodons (starting from tRNA-S1).
| Gene/Region | Position | Size (bp) | Strand | Anticodon | Codon | Intergenic nucleotides | ||
| Form | To | Start | Stop | |||||
|
| 1 | 60 | 60 | H | TCT | 3 | ||
|
| 64 | 129 | 66 | H | TTC | 34 | ||
|
| 164 | 229 | 66 | H | CAT | 231 | ||
|
| 461 | 522 | 62 | H | TTG | 0 | ||
|
| 523 | 588 | 66 | H | TGC | 18 | ||
|
| 607 | 672 | 66 | H | GAT | 0 | ||
|
| 673 | 1668 | 996 | H | ATT | TAA | −1 | |
|
| 1668 | 1733 | 66 | L | GCA | 5 | ||
|
| 1739 | 1807 | 69 | L | GTA | 16 | ||
|
| 1824 | 1892 | 69 | H | TCA | 0 | ||
|
| 1893 | 3458 | 1566 | H | ATT | TAA | −5 | |
|
| 3454 | 3523 | 70 | H | TAA | 89 | ||
|
| 3613 | 4293 | 681 | H | ATT | TAA | −2 | |
|
| 4292 | 4359 | 68 | H | GTC | 6 | ||
|
| 4366 | 4437 | 72 | H | TTT | 6 | ||
|
| 4444 | 4605 | 162 | H | ATC | TAA | −19 | |
|
| 4587 | 5264 | 678 | H | ATG | TAA | 17 | |
|
| 5282 | 6061 | 780 | H | ATG | TAA | 66 | |
|
| 6128 | 6194 | 67 | H | TCC | 0 | ||
|
| 6195 | 6548 | 354 | H | ATT | TAA | 19 | |
|
| 6568 | 6633 | 66 | L | TCG | 19 | ||
|
| 6653 | 6720 | 68 | H | GTT | 18 | ||
|
| 6739 | 6809 | 71 | L | GAA | 6 | ||
|
| 6816 | 8486 | 1671 | L | ATT | TAA | 0 | |
|
| 8487 | 8552 | 66 | L | GTG | 17 | ||
|
| 8570 | 9898 | 1329 | L | ATA | TAA | 0 | |
|
| 9899 | 10162 | 264 | L | ATT | TAA | 23 | |
|
| 10186 | 10252 | 67 | H | TGT | 15 | ||
|
| 10268 | 10345 | 78 | L | TGG | 50 | ||
|
| 10396 | 10905 | 510 | H | ATT | TAA | 12 | |
|
| 10918 | 12066 | 1149 | H | ATG | TAA | 23 | |
|
| 12090 | 12156 | 67 | H | TGA | 12 | ||
|
| 12169 | 13083 | 915 | L | ATT | TAA | 0 | |
|
| 13084 | 13152 | 69 | L | TAG | 0 | ||
|
| 13153 | 14480 | 1328 | L | 0 | |||
|
| 14481 | 14547 | 67 | L | TAC | 0 | ||
|
| 14548 | 15333 | 786 | L | 0 | |||
| AT-rich region (nCR) | 15334 | 15895 | 562 | 0 | ||||
indicates gap nucleotides (positive value) or overlapped nucleotides (negative value) between two adjacent genes.
Figure 2Comparison of the mitochondrial gene arrangement among A. mellifera, A. cerana and ancestral pancrustacean.
tRNA genes are labeled by one-letter symbol on the bar when transcription starts from right to left direction, or under the bar when transcription starts from left to right direction. Underlined PCGs or rRNA genes are transcribed from right to left and PCGs that are not underlined are transcribed from left to right.
Nucleotide composition and skews of Apis cerana mitochondrial protein-coding and ribosomal RNA genes and comparison with Apis mellifera.
| Gene | Nucleotide frequency | %A+T | AT-skew | GC-skew | AT-skew | GC-skew | |||
| A | G | T | C |
| |||||
|
| 0.366 | 0.056 | 0.472 | 0.106 | 83.8 | −0.127 | −0.309 | −0.099 | −0.269 |
|
| 0.475 | 0.043 | 0.395 | 0.086 | 87.0 | 0.092 | −0.333 | 0.042 | −0.413 |
|
| 0.347 | 0.110 | 0.413 | 0.130 | 76.0 | −0.086 | −0.085 | −0.087 | −0.093 |
|
| 0.385 | 0.087 | 0.404 | 0.125 | 78.9 | −0.024 | −0.181 | −0.029 | −0.167 |
|
| 0.360 | 0.089 | 0.444 | 0.108 | 80.4 | −0.104 | −0.098 | −0.113 | −0.188 |
|
| 0.368 | 0.084 | 0.442 | 0.105 | 81.0 | −0.091 | −0.110 | −0.087 | −0.148 |
|
| 0.355 | 0.109 | 0.480 | 0.056 | 83.5 | −0.149 | 0.325 | −0.184 | 0.359 |
|
| 0.395 | 0.051 | 0.469 | 0.085 | 86.4 | −0.086 | −0.250 | −0.091 | −0.304 |
|
| 0.373 | 0.051 | 0.475 | 0.102 | 84.8 | −0.120 | −0.334 | −0.105 | −0.375 |
|
| 0.360 | 0.093 | 0.491 | 0.056 | 85.1 | −0.153 | 0.252 | −0.161 | 0.337 |
|
| 0.345 | 0.095 | 0.527 | 0.034 | 87.2 | −0.209 | 0.470 | −0.234 | 0.676 |
|
| 0.379 | 0.095 | 0.470 | 0.056 | 84.9 | −0.107 | 0.257 | −0.115 | 0.297 |
|
| 0.431 | 0.051 | 0.433 | 0.084 | 86.4 | −0.002 | −0.246 | −0.014 | −0.545 |
|
| 0.439 | 0.125 | 0.377 | 0.060 | 81.6 | 0.076 | 0.352 | 0.028 | 0.370 |
|
| 0.404 | 0.111 | 0.428 | 0.058 | 83.1 | −0.029 | 0.312 | −0.043 | 0.375 |
| Total | 0.423 | 0.063 | 0.416 | 0.098 | 83.9 | 0.008 | −0.217 | 0.018 | −0.272 |
AT skew = (A%−T%)/(A%+T%); GC skew = (G%−C%)/( G%+C%).
Protein-coding gene assignments and similarity between A. cerana and A. mellifera.
| Protein-coding gene | Number of amino acids | Identity (%) | |
|
|
|
| |
|
| 53 | 52 | 73.1 |
|
| 225 | 226 | 89.4 |
|
| 304 | 305 | 88.2 |
|
| 331 | 333 | 70.1 |
|
| 117 | 117 | 82.1 |
|
| 439 | 447 | 82.7 |
|
| 87 | 87 | 85.1 |
|
| 556 | 554 | 82.7 |
|
| 169 | 167 | 51.5 |
|
| 521 | 521 | 97.5 |
|
| 226 | 225 | 95.6 |
|
| 259 | 259 | 78.8 |
|
| 382 | 383 | 91.9 |
Codon usage in 13 protein-coding genes of the Apis cerana mitochondrial genome.
| Aa | Codon | N | RSCU | Aa | Codon | N | RSCU | Aa | Codon | N | RSCU | Aa | Codon | N | RSCU |
| Phe | UUU | 336 | 1.81 | Ser | UCU | 55 | 1.31 | Tyr | UAU | 202 | 1.84 | Cys | UGU | 26 | 2.00 |
| UUC | 35 | 0.19 | UCC | 9 | 0.21 | UAC | 17 | 0.16 | UGC | 0 | 0 | ||||
| Leu | UUA | 483 | 5.08 | UCA | 167 | 3.98 | End | UAA | 0 | 0 | Trp | UGA | 84 | 2.00 | |
| UUG | 10 | 0.11 | UCG | 0 | 0 | UAG | 0 | 0 | UGG | 0 | 0 | ||||
| Leu | CUU | 40 | 0.42 | Pro | CCU | 34 | 1.30 | His | CAU | 55 | 1.80 | Arg | CGU | 9 | 0.95 |
| CUC | 1 | 0.01 | CCC | 4 | 0.15 | CAC | 6 | 0.20 | CGC | 1 | 0.11 | ||||
| CUA | 34 | 0.36 | CCA | 67 | 2.55 | Gln | CAA | 39 | 2.00 | CGA | 28 | 2.95 | |||
| CUG | 2 | 0.02 | CCG | 0 | 0 | CAG | 0 | 0 | CGG | 0 | 0 | ||||
| lle | AUU | 452 | 1.92 | Thr | ACU | 38 | 1.26 | Asn | AAU | 223 | 1.88 | Ser | AGU | 15 | 0.36 |
| AUC | 20 | 0.08 | ACC | 1 | 0.03 | AAC | 14 | 0.12 | AGC | 0 | 0 | ||||
| Met | AUA | 312 | 1.90 | ACA | 82 | 2.71 | Lys | AAA | 156 | 1.91 | AGA | 85 | 2.02 | ||
| AUG | 17 | 0.10 | ACG | 0 | 0 | AAG | 7 | 0.09 | AGG | 5 | 0.12 | ||||
| Val | GUU | 83 | 2.21 | Ala | GCU | 29 | 1.63 | Asp | GAU | 60 | 1.94 | Gly | GGU | 44 | 1.30 |
| GUC | 7 | 0.19 | GCC | 4 | 0.23 | GAC | 2 | 0.06 | GGC | 3 | 0.09 | ||||
| GUA | 59 | 1.57 | GCA | 36 | 2.03 | Glu | GAA | 77 | 1.86 | GGA | 82 | 2.43 | |||
| GUG | 1 | 0.03 | GCG | 2 | 0.11 | GAG | 6 | 0.14 | GGG | 6 | 0.18 |
A total of 3672 codons for A. cerana were analyzed, excluding the termination codons.
Aa = Amino acid; N = frequency of each codon; RSCU = relative synonymous codon usage.
Comparison of A+T content (%) of the AT region, protein-coding and rRNA genes of mitochondrial genomes of Hymenopteran.
| Gene/Region | AE | AC | AM | BH | BI | CC | CV | DS | ESP | EA | MB | OO | PSP | PH | SA | VE |
| AT-region | 89.9 | 95.9 | 96.0 | / | 96.0 | 88.3 | / | 96.4 | 88.1 | 85.6 | / | 79.7 | 78.9 | / | / | / |
|
| 86.1 | 87.0 | 89.3 | 92.6 | 92.6 | 82.8 | 93.0 | 89.5 | 91.7 | 69.1 | 91.1 | 82.4 | 86.5 | 93.7 | 90.1 | 82.7 |
|
| 80.4 | 83.8 | 84.7 | 85.0 | 85.4 | 77.3 | 86.4 | 84.1 | 84.6 | 73.9 | 86.6 | 74.8 | 77.6 | 83.1 | 84.3 | 76.3 |
|
| 80.6 | 83.5 | 83.0 | 82.7 | 84.7 | 79.8 | 85.3 | 85.2 | 82.3 | 74.6 | 86.4 | 75.0 | 75.4 | 84.7 | 79.0 | 79.0 |
|
| 80.9 | 86.4 | 86.5 | 89.0 | 89.8 | 85.8 | 91.2 | 89.2 | 88.9 | 80.5 | 91.2 | 75.3 | 80.4 | 89.9 | 88.8 | 83.7 |
|
| 80.5 | 84.8 | 86.4 | 88.3 | 90.0 | 78.3 | 88.7 | 85.0 | 87.3 | 74.1 | 89.0 | 75.0 | 78.7 | 84.5 | 83.5 | 79.9 |
|
| 82.1 | 85.1 | 86.5 | 85.5 | 85.9 | 81.3 | 87.5 | 85.8 | 86.0 | 77.8 | 89.2 | 77.0 | 77.4 | 84.1 | 83.7 | 80.1 |
|
| 83.5 | 87.1 | 85.6 | 86.1 | 88.0 | 85.0 | 88.7 | 89.1 | 88.1 | 77.5 | 90.7 | 82.5 | 81.3 | 89.1 | 87.6 | 85.0 |
|
| 80.7 | 84.9 | 85.7 | 86.9 | 87.4 | 80.5 | 88.8 | 86.0 | 86.3 | 76.3 | 88.6 | 77.4 | 77.9 | 85.7 | 84.2 | 80.6 |
|
| 84.5 | 86.5 | 86.9 | 92.3 | 92.1 | 85.1 | 92.3 | 90.2 | 90.1 | 78.7 | 93.0 | 77.3 | 82.5 | 90.3 | 88.8 | 82.7 |
|
| 80.4 | 81.6 | 81.4 | 85.4 | 86.6 | 80.5 | 91.2 | 88.5 | 89.4 | 75.8 | 82.8 | 81.1 | 77.1 | 84.0 | 89.3 | 80.8 |
|
| 82.9 | 83.1 | 84.5 | 85.3 | 85.5 | 84.6 | 89.7 | 88.0 | 87.2 | 79.7 | 86.8 | 80.1 | 79.4 | 87.3 | 88.8 | 82.9 |
|
| 72.0 | 76.0 | 75.9 | 76.4 | 77.6 | 72.7 | 77.2 | 76.1 | 77.6 | 68.8 | 79.2 | 75.3 | 70.3 | 75.3 | 75.8 | 71.7 |
|
| 75.5 | 78.9 | 80.5 | 81.0 | 82.0 | 74.9 | 83.3 | 81.3 | 82.6 | 72.0 | 83.0 | 72.1 | 73.7 | 80.5 | 80.4 | 74.0 |
|
| 75.4 | 80.4 | 83.0 | 82.6 | 84.8 | 75.6 | 83.6 | 80.2 | 80.2 | 73.9 | 84.7 | 72.1 | 72.3 | 80.5 | 77.2 | 77.0 |
|
| 75.6 | 81.0 | 80.6 | 81.0 | 82.8 | 77.0 | 82.3 | 80.1 | 81.0 | 73.3 | 82.6 | 71.1 | 73.0 | 81.2 | 78.9 | 75.7 |
| EmtG | 80.6 | 84.0 | 84.9 | 85.3 | 86.8 | 82.0 | 87.2 | 87.4 | 85.2 | 77.8 | 86.7 | 76.2 | 77.0 | 84.7 | 84.0 | 80.1 |
AE: Abispa ephippium, AC: Apis cerana, AM: Apis mellifera, BH: Bombus hypocrita, BI: Bombus ignitus, CC: Cephus cinctus, CV: Cotesia vestalis, DS: Diadegma semiclausum, ESP: Enicospilus sp, EA: Evania appendigaster, MB: Melipona bicolor, OO: Orussus occidentalis, PSP: Philaenus spumarius, PH: Polistes humilis, SA: Spathius agrili, VE: Vanhornia eucnemidarum, EmtG: entire mitochondrial genome.
Figure 3Inferred phylogenetic relationship among the Hymenoptera species.
The analyses were conducted using maximum parsimony (MP), maximum likelihood (ML) and neighbour joining (NJ) of amino acid sequences of 13 protein-coding genes, using Triatoma dimidiata as outgroup. The numbers along branches indicate bootstrap values resulting from different analyses in the order: MP/ML/NJ.
Sequences of primers used to amplify PCR fragments from the Apis cerana mitochondrial genome.
| Primers | Sequence (5′-3′) | Source | Estimated size of PCR products |
| AC- |
| This study | 680 bp |
| AC- |
| This study | |
| AC- |
| This study | 370 bp |
| AC- |
| This study | |
| AC- |
| This study | 450 bp |
| AC- |
| This study | |
| AC-L |
| This study | 7.1 kb |
| AC-L |
| This study | |
| AC-L |
| This study | 4.8 kb |
| AC-L |
| This study | |
| AC-L |
| This study | 5.6 kb |
| AC-L |
| This study |