Literature DB >> 21857981

The complete mitochondrial genome of the Asiatic cavity-nesting honeybee Apis cerana (Hymenoptera: Apidae).

Hong-Wei Tan1, Guo-Hua Liu, Xia Dong, Rui-Qing Lin, Hui-Qun Song, Si-Yang Huang, Zi-Guo Yuan, Guang-Hui Zhao, Xing-Quan Zhu.   

Abstract

In the present study, we determined the complete mitochondrial DNA (mtDNA) sequence of Apis cerana, the Asiatic cavity-nesting honeybee. We present here an analysis of features of its gene content and genome organization in comparison with Apis mellifera to assess the variation within the genus Apis and among main groups of Hymenoptera. The size of the entire mt genome of A. cerana is 15,895 bp, containing 2 ribosomal RNA genes, 13 protein-coding genes, 22 transfer RNA (tRNA) genes and one control region. These genes are transcribed from both strands and have a nucleotide composition high in A and T. The contents of A+T of the complete genomes are 83.96% for A. cerana. The AT bias had a significant effect on both the codon usage pattern and amino acid composition of proteins. There are a total of 3672 codons in all 13 protein-coding genes, excluding termination codons. The most frequently used amino acid is Leu (15.52%), followed by Ile (12.85%), Phe (10.10%), Ser (9.15%) and Met (8.96%). Intergenic regions in the mt genome of A. cerana are 705 bp in total. The order and orientation of the gene arrangement pattern is identical to that of A. mellifera, except for the position of the tRNA-Ser(AGN) gene. Phylogenetic analyses using concatenated amino acid sequences of 13 protein-coding genes, with three different computational algorithms (NJ, MP and ML), all revealed two distinct groups with high statistical support, indicating that A. cerana and A. mellifera are two separate species, consistent with results of previous morphological and molecular studies. The complete mtDNA sequence of A. cerana provides additional genetic markers for studying population genetics, systematics and phylogeographics of honeybees.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21857981      PMCID: PMC3155526          DOI: 10.1371/journal.pone.0023008

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

The Asiatic honeybee, Apis cerana Fabricius (Hymenoptera: Apidae), is an important honeybee species in Asian countries. For a long time A. cerana was considered a sub-species of A. mellifera. However, on the basis of several genetic, morphological, and behavioural characteristics, it has been established that A. cerana and A. mellifera are two completely isolated species, and they were considered most recently derived and sister taxa [1]–[4]. Honeybee has been the object of considerable scientific curiosity and has served as the classical model organism in diverse fields such as navigation, face recognition and sensory biology for almost a century [5]–[7]. Honeybee is regarded as the premier pollinator of major fruit crops. It has a long history of association with mankind because of its cavity-nesting lifestyle and it is used for producing honey, bee pollen, wax, and royal jelly [8]. Most metazoan species possess a compact, circular mitochondrial (mt) genome, which varies in size from 14 to 19 kb that contain 36–37 genes, including 12–13 protein-coding genes, two ribosomal RNAs (rRNA) genes and 22 transfer RNAs (tRNA) genes necessary for translation of the proteins encoded by the mtDNA [9]–[11]. mtDNA has been extensively used for studying phylogenetic and evolutionary relationships among animal species, due to its maternal inheritance, rapid evolutionary rate, and lack of genetic recombination [12]–[16]. mtDNA has proved to be an important tool in intra-specific and inter-specific phylogenetic studies of honeybees [17]–[21], and one region of the Apis mt genome, a non-coding sequence located between cytochrome oxidase I (cox1) and cytochrome oxidase II (cox2) has proven to be particularly informative for intra-specific studies [22], [23]. The Hymenoptera are one of the largest insect orders belonging to hexapods, comprising over 100,000 described species [24]. In 1993, Crozier reported the complete sequence of the A. mellifera mt genome, this was the first identified mt genome of Hymenopteran [25]. Until recently, only 9 complete and 18 nearly complete mt genome sequences have been determined in Hymenoptera [25]–[35] (Table 1), and in the genus Apis, only the A. mellifera mt genome has been determined. The lack of knowledge of mt genomics for Hymenoptera is a major limitation for population genetic and phylogenetic studies of the Hymenoptera including the species in the Apidae.
Table 1

Mitochondrial genome sequences of Hymenoptera sequenced completely or nearly completely prior to the present study and used for phylogenetic analysis.

FamilyTaxonAccession numberReference
Apidae Apis mellifera L06178Crozier and Crozier 1993
Bombus ignitus DQ870926Cha et al. 2007
Bombus hypocrita EU401918Unpublished
Melipona bicolor AF466146Silvestre et al. 2008
Chrysididae Primeuchroeus sppAH015389Castro et al. 2006
Pergidae Perga condei AY787816Castro and Dowton 2005
Vanhorniidae Vanhornia eucnemidarum DQ302100Castro et al. 2006
Vespidae Abispa ephippium EU302588Cameron et al. 2008
Polistes humilis synoecus EU024653Cameron et al. 2008
Pteromalidae Nasonia vitripennis AsymCEU746609Oliveira et al. 2008
Nasonia vitripennis HiCD12EU746610Oliveira et al. 2008
Nasonia giraulti RV2EU746611Oliveira et al. 2008
Nasonia longicornis IV7EU746612Oliveira et al. 2008
Cephidae Cephus cinctus FJ478173Dowton et al. 2009
Orussidae Orussus occidentalis FJ478174Dowton et al. 2009
Ichneumonidae Venturia canescens FJ478176Dowton et al. 2009
Enicospilus spFJ478177Dowton et al. 2009
Diadegma semiclausum EU871947Wei et al. 2009
Evaniidae Evania appendigaster FJ593187Wei et al. 2009
Stephanidae Schlettererius cinctipes FJ478175Dowton et al. 2009
Braconidae Cotesia vestalis FJ154897Wei et al. 2010
Spathius agrili FJ387020Wei et al. 2010
Phanerotoma flava GU097654Wei et al. 2010
Diachasmimorpha longicaudata GU097655Wei et al. 2010
Macrocentrus camphoraphilus GU097656Wei et al. 2010
Meteorus pulchricornis GU097657Wei et al. 2010
Aphidius gifuensis GU097658Wei et al. 2010
In the present study, we determined the complete nucleotide sequence of the A. cerana mt genome, we performed phylogenetic analyses using selected Hymenoptera species. The new sequence may provide useful information on both genomics and the evolution of Hymenoptera, because there are only a few complete (or nearly complete) mtDNA sequences available from these animals.

Results and Discussion

General features of the mt genome of A. cerana

The length of the complete mt genome of A. cerana was 15,895 bp (Figure 1), and the mtDNA sequence was deposited in the GenBank under the accession number GQ162109. The mt genome of A. cerana contains 13 protein-coding genes (cox1-3, nad1-6, nad4L, atp6, atp8 and cob), a small subunit ribosomal RNA gene (rrnS), a large subunit ribosomal RNA gene (rrnL), and 22 transfer RNA genes (Table 2). The A. cerana mt genome shows a novel arrangement within Hymenoptera compared with the ancestral pancrustacean mt genome organization [36]. All rearranged genes are tRNAs, tRNA gene rearrangements can be classified as translocations, local inversions, or shuffling. A translocation is a movement of a gene to another position across a protein-coding gene. A rearrangement is classified as an inversion when the tRNA is found on the opposite strand and as shuffling when the tRNA gene is on the same mt stand but in a different position compared with the ancestral organization (without movement across a protein-coding gene) [37].
Figure 1

The mitochondrial genome of Apis cerana.

Protein-coding genes are transcribed in a clockwise direction, except for those underlined. The two ribosomal RNA genes were encoded by the L strand. Transfer RNA genes encoded by H and L strands are shown outside and inside the circular gene map, respectively. Transfer RNA genes are designated by single-letter amino acid codes, except those encoding leucine and serine, which are labeled L1, L2, S1, S2, representing tRNA-Leu(CUN), tRNA-Leu(UUR), tRNA-Ser(AGN) and tRNA-Ser(UCN), respectively.

Table 2

Positions and nucleotide sequence lengths of mitochondrial genomes of Apis cerana, and start and stop codons for protein-coding genes as well as their tRNA gene anticodons (starting from tRNA-S1).

Gene/RegionPositionSize (bp)StrandAnticodonCodonIntergenic nucleotidesa
FormToStartStop
tRNA-Ser(AGN) (S1)16060HTCT3
tRNA-Glu (E)6412966HTTC34
tRNA-Met (M)16422966HCAT231
tRNA-Gln (Q)46152262HTTG0
tRNA-Ala (A)52358866HTGC18
tRNA-Ile (I)60767266HGAT0
nad26731668996HATTTAA−1
tRNA-Cys (C)1668173366LGCA5
tRNA-Tyr (Y)1739180769LGTA16
tRNA-Trp (W)1824189269HTCA0
cox1189334581566HATTTAA−5
tRNA-Leu(UUR) (L2)3454352370HTAA89
cox236134293681HATTTAA−2
tRNA-Asp (D)4292435968HGTC6
tRNA-Lys (K)4366443772HTTT6
atp844444605162HATCTAA−19
atp645875264678HATGTAA17
cox352826061780HATGTAA66
tRNA-Gly (G)6128619467HTCC0
nad361956548354HATTTAA19
tRNA-Arg (R)6568663366LTCG19
tRNA-Asn (N)6653672068HGTT18
tRNA-Phe (F)6739680971LGAA6
nad5681684861671LATTTAA0
tRNA-His (H)8487855266LGTG17
nad4857098981329LATATAA0
nad4L989910162264LATTTAA23
tRNA-Thr (T)101861025267HTGT15
tRNA-Pro (P)102681034578LTGG50
nad61039610905510HATTTAA12
cob10918120661149HATGTAA23
tRNA-Ser(UCN) (S2)120901215667HTGA12
nad11216913083915LATTTAA0
tRNA-Leu(CUN) (L1)130841315269LTAG0
rrnL13153144801328L0
tRNA-Val (V)144811454767LTAC0
rrnS1454815333786L0
AT-rich region (nCR)15334158955620

indicates gap nucleotides (positive value) or overlapped nucleotides (negative value) between two adjacent genes.

The mitochondrial genome of Apis cerana.

Protein-coding genes are transcribed in a clockwise direction, except for those underlined. The two ribosomal RNA genes were encoded by the L strand. Transfer RNA genes encoded by H and L strands are shown outside and inside the circular gene map, respectively. Transfer RNA genes are designated by single-letter amino acid codes, except those encoding leucine and serine, which are labeled L1, L2, S1, S2, representing tRNA-Leu(CUN), tRNA-Leu(UUR), tRNA-Ser(AGN) and tRNA-Ser(UCN), respectively. indicates gap nucleotides (positive value) or overlapped nucleotides (negative value) between two adjacent genes. As shown in Figure 2, we identified eight rearrangements shared between A. cerana and A. mellifera. tRNA-Ala, tRNA-Ser(AGN) and tRNA-Glu move to tRNA-Ile—tRNA-Gln—tRNA-Met cluster, and tRNA-Ala changes position relative to tRNA-Ser(AGN) and tRNA-Glu. tRNA-Trp moves across two tRNAs (tRNA-Cys and tRNA-Tyr) boundaries. The order of tRNA-Ile—tRNA-Gln—tRNA-Met becomes tRNA-Met—tRNA-Gln—tRNA-Ile, and two of tRNAs (tRNA-Gln and tRNA-Arg) arrangements are inversions. Finally, the tRNA-Lys and tRNA-Asp swap positions. The orientation and gene order of the A. cerana mt genome were identical to that of A. mellifera except for the tRNA-Ser(AGN). In A. cerana, tRNA-Ser(AGN) gene was only translocated from nad3-nad5 junction to the junction of nad2 and AT-rich region compared with the ancestral pancrustacean mt genome organization. In A. mellifera, tRNA-Ser(AGN) gene was not only translocated but also shuffled, which swapped positions with tRNA-Glu gene. The duplication/random loss model, and the intramitochondrial genome recombination and duplication/nonrandom loss model are possible mechanisms to explain translocation and shuffling. Of these, intramitochondrial genome recombination is presumed to be more common [37].
Figure 2

Comparison of the mitochondrial gene arrangement among A. mellifera, A. cerana and ancestral pancrustacean.

tRNA genes are labeled by one-letter symbol on the bar when transcription starts from right to left direction, or under the bar when transcription starts from left to right direction. Underlined PCGs or rRNA genes are transcribed from right to left and PCGs that are not underlined are transcribed from left to right.

Comparison of the mitochondrial gene arrangement among A. mellifera, A. cerana and ancestral pancrustacean.

tRNA genes are labeled by one-letter symbol on the bar when transcription starts from right to left direction, or under the bar when transcription starts from left to right direction. Underlined PCGs or rRNA genes are transcribed from right to left and PCGs that are not underlined are transcribed from left to right. The genes atp6, atp8, cox1, cox2, cox3, cob, nad2, nad3 and nad6 are transcribed from one strand, while nad1, nad4, nad4L, and nad5 are transcribed from the other strand. The nucleotide compositions of the entire mtDNA sequences for A. cerana are biased toward A and T, with T being the most favored nucleotide and G the least favored, in accordance with the mt genome of A. mellifera. The content of A+T is 83.9% for A. cerana (42.3% A, 41.6% T, 6.3% G and 9.8% C) (Table 3), 84.9% for A. mellifera (43.2% A, 41.7% T, 5.5% G and 9.6% C), respectively. The bias of the base composition of an individual strand can be described by skewness [38], which is calculated as (A%−T%)/(A%+T%) and (G%−C%)/(C%+G%), respectively. AT-skews and GC-skews of each of the 13 protein-coding genes and the 2 ribosomal RNA genes and the whole mt genome were calculated for A. cerana and A. mellifera (Table 3). In A. cerana, except for the atp8 gene which has a relatively weak skew of A vs T (AT skew = 0.092), other twelve PCGs have a skew of T vs A (AT skew between −0.002 and −0.209). The PCGs of H-strand have a strong skew of C vs G (GC skew between −0.098 and −0.309), and the nad1, nad4, nad4L and nad5 genes, which coded on the L-strand have a strong skew of G vs C (GC skew between 0.252 and 0.470). In the two rRNA genes, the GC skew displayed the same pattern (GC skew = 0.352 and 0.312 for the rrnS and rrnL genes, respectively). The AT skew displayed an opposite pattern (AT skew = 0.076 and −0.029 for rrnS and rrnL genes, respectively). By comparing the skew values of 13 protein-coding genes and two rRNA genes between A. cerana and A. mellifera, we found that GC skews of different genes coded on the same strand are all positive or negative, whereas the AT skews of different genes coded on the same strand are either positive or negative. This is congruent with the pattern of skew values in other insects [39]. Wei et al. (2010) found that the criterion for detecting reversal of strand asymmetry on mt genomes should be the sign of GC skew values, not the AT skew values.
Table 3

Nucleotide composition and skews of Apis cerana mitochondrial protein-coding and ribosomal RNA genes and comparison with Apis mellifera.

GeneNucleotide frequency%A+TAT-skewGC-skewAT-skewGC-skew
AGTC A. mellifera
atp60.3660.0560.4720.10683.8−0.127−0.309−0.099−0.269
atp80.4750.0430.3950.08687.00.092−0.3330.042−0.413
cox10.3470.1100.4130.13076.0−0.086−0.085−0.087−0.093
cox20.3850.0870.4040.12578.9−0.024−0.181−0.029−0.167
cox30.3600.0890.4440.10880.4−0.104−0.098−0.113−0.188
cob0.3680.0840.4420.10581.0−0.091−0.110−0.087−0.148
nad10.3550.1090.4800.05683.5−0.1490.325−0.1840.359
nad20.3950.0510.4690.08586.4−0.086−0.250−0.091−0.304
nad30.3730.0510.4750.10284.8−0.120−0.334−0.105−0.375
nad40.3600.0930.4910.05685.1−0.1530.252−0.1610.337
nad4L0.3450.0950.5270.03487.2−0.2090.470−0.2340.676
nad50.3790.0950.4700.05684.9−0.1070.257−0.1150.297
nad60.4310.0510.4330.08486.4−0.002−0.246−0.014−0.545
rrnS0.4390.1250.3770.06081.60.0760.3520.0280.370
rrnL 0.4040.1110.4280.05883.1−0.0290.312−0.0430.375
Total0.4230.0630.4160.09883.90.008−0.2170.018−0.272

AT skew = (A%−T%)/(A%+T%); GC skew = (G%−C%)/( G%+C%).

AT skew = (A%−T%)/(A%+T%); GC skew = (G%−C%)/( G%+C%). The A. cerana mt genes overlap a total of 27 bp in 4 locations ranging from 1 to 19 bp (Table 2). Gene overlaps on the same strand in the mt genome of A. cerana can be founded in three regions. The longest is a 19 bp overlap between atp8 and atp6. The second case is a 5 bp overlap between cox1 and tRNA-Leu(UUR). The third is 2 bp overlap occurred between cox2 and tRNA-Asp. Overlaps of genes coded by the different strands also can be found in the mt genome of A. cerana. one bp overlap exists between nad2 and tRNA-Cys. Interestingly, the A. cerana mt genome has the same overlapping nucleotides and locations between genes as those of A. mellifera, where genes also overlap a total of 27 bp in four locations [25]. Apart form the A+T-rich region, the A. cerana mt genes are separated by a total of 705 bp of intergenic spacer sequences, which are spread over 22 regions and range in size form 1 to 231 bp (Table 2). The longest intergenic region (231 bp) is located between tRNA-Met and tRNA-Gln genes. In A. mellifera mt genome, a total of 813 bp of intergenic spacer sequences are spread over 24 regions, ranging in size from 1 to 193 bp. The longest one is located between tRNA-Leu(UUR) and cox2 [25]. This region has proven to be particularly informative for intraspecific studies, because the sequences do not appear to be subject to strong purifying selection and accumulate numerous base substitutions and insertion/deletions [21]. In comparison, A. cerana has only 89 bp intergenic spacer sequence in this region. Such short intergenic regions were also found in other metazoans [40]–[42]. Furthermore, previous studies have shown that the intergenic regions in the mt genomes of some metazoan contain signals for transcription initiation and replication [43], [44]. In the spacer region (tRNA-Ser(UCR)–nad1), we found a 6 bp motif (TACTTA), which is conserved between A. cerana and A. mellifera. This intergenic spacer region may correspond to the binding site of mtTERM, a transcription attenuation factor [45].

Protein-coding genes and codon usage patterns

The boundaries between protein-coding genes in the mt genome of A. cerana were determined by aligning their sequences and by identifying translation initiation and termination codons with comparison to those of A. mellifera. The amino acid sequences inferred for nad3, nad4L, cox1 and cox3 genes are similar in length to those of A. mellifera (Table 4). The similarity of the amino acid sequences is 51.5%–97.5% between A. cerana and A. mellifera (Table 4). Based on similarity, cox1 is the most conserved protein-coding gene, while the nad6 is the least conserved. Genes in mt genome may have different evolutionary rates, which might be caused by different selection pressures or the restriction of gene function. That is what we found in honeybee that rates of different mitochondrial genes diverged differently.
Table 4

Protein-coding gene assignments and similarity between A. cerana and A. mellifera.

Protein-coding geneNumber of amino acidsIdentity (%)
A. cerana A. mellifera A. cerana/A. mellifera
atp8535273.1
atp622522689.4
nad130430588.2
nad233133370.1
nad311711782.1
nad443944782.7
nad4L878785.1
nad555655482.7
nad616916751.5
cox152152197.5
cox222622595.6
cox325925978.8
cob38238391.9
The predicted translation initiation and termination codons for the protein-coding genes of A. cerana mt genome were compared with those of A. mellifera. All protein-coding sequences started with a typical ATN codon. The start codons inferred in the mt genome of A. cerana are ATC (one of 13 protein-coding genes), ATT (eight of 13 protein-coding genes), ATG (three of 13 protein-coding genes) and ATA (one of 13 protein-coding genes) as a translation initiation codon (Table 2). Furthermore, the anomalous initiation codon and incomplete stop codon frequently occur in protein-coding genes of most of the Hymenoptera mtDNA sequenced to date, such as the gene cox1 has been found to employ TTG as a start codon and nad4 has been found to employ TA as a stop codon in Vanhornia eucnemidarum [27]. The codon TGA, which is a termination codon in the universal code, codes for tryptophan in most mt genomes except those of higher plants [46]. All stop codons used TAA as a termination codon in the mt genome of A. cerana. This is presumably polyadenylation after transcription to complete the termination codon [47]. However, two of the 13 protein-coding genes (cox1, cox2) are predicted to employ T as an initiation codon in the mt genome of A. mellifera [25]. Furthermore, incomplete stop codon frequently occurs in protein-coding genes of most of the Hymenoptera mt DNA sequenced to date [27], [28], [32], [34]. The pattern of codon usage in the A. cerana mtDNA was also studied. Excluding the termination codons, a total of 3672 amino acids are encoded by the A. cerana mt genome, and a total of 3676 amino acids for A. mellifera. The most frequently used amino acids were Leu (15.52%), followed by Ile (12.85%), Phe (10.10%), Ser (9.15%) and Met (8.96%) (Table 5).
Table 5

Codon usage in 13 protein-coding genes of the Apis cerana mitochondrial genome.

AaCodonNRSCUAaCodonNRSCUAaCodonNRSCUAaCodonNRSCU
PheUUU3361.81SerUCU551.31TyrUAU2021.84CysUGU262.00
UUC350.19UCC90.21UAC170.16UGC00
LeuUUA4835.08UCA1673.98EndUAA00TrpUGA842.00
UUG100.11UCG00UAG00UGG00
LeuCUU400.42ProCCU341.30HisCAU551.80ArgCGU90.95
CUC10.01CCC40.15CAC60.20CGC10.11
CUA340.36CCA672.55GlnCAA392.00CGA282.95
CUG20.02CCG00CAG00CGG00
lleAUU4521.92ThrACU381.26AsnAAU2231.88SerAGU150.36
AUC200.08ACC10.03AAC140.12AGC00
MetAUA3121.90ACA822.71LysAAA1561.91AGA852.02
AUG170.10ACG00AAG70.09AGG50.12
ValGUU832.21AlaGCU291.63AspGAU601.94GlyGGU441.30
GUC70.19GCC40.23GAC20.06GGC30.09
GUA591.57GCA362.03GluGAA771.86GGA822.43
GUG10.03GCG20.11GAG60.14GGG60.18

A total of 3672 codons for A. cerana were analyzed, excluding the termination codons.

Aa = Amino acid; N = frequency of each codon; RSCU = relative synonymous codon usage.

A total of 3672 codons for A. cerana were analyzed, excluding the termination codons. Aa = Amino acid; N = frequency of each codon; RSCU = relative synonymous codon usage.

Transfer RNA genes

The 22 tRNA genes encoded in the mt genome of the A. cerana vary in length from 60 to 78 nucleotides with differences in stem and loop sizes of dihydrouridine (D) and TΨC loops. The order and orientation of the gene arrangement pattern are identical to that of A. mellifera, except for the position of the tRNA-Ser(AGN) gene. All of the 22 tRNA genes have the typical cloverleaf structure, except for tRNA-Ser(AGN) that lacks DHU arm (Figure S1). Their putative secondary structures (Figure S1) are similar to those of A. mellifera [25], indicating their similar functions. Among Hymenopteran, mismatched base pairs have been reported in mitochondrial tRNAs inferred for A. mellifera, Melipona bicolor, Diadegma semiclausum, Bombus ignitus and Evania appendigaster [25], [28], [31], [32], [34]. A total of 6 mismatched base pairs occur in tRNAs of A. cerana. Two tRNA genes, tRNA-Ala and tRNA-Gln, have a single G-T and T-T mismatches in the acceptor stem, respectively. tRNA-Val has a single T-T mismatch in the anticodon arm, and tRNA-Cys, tRNA-His and tRNA-Pro all have a single G-T mismatches in the DHU arm. Sequences of three tRNAs overlap with adjacent genes, tRNA-Cys overlaps with the adjacent gene nad2 on the opposite strand for 1 bp at its 3′-end. tRNA-Leu(UUR) overlaps with the adjacent gene cox1 on the same strand for 5 bp at its 3′-end. tRNA-Asp overlaps with the adjacent gene cox2 on the same strand for 2 bp at its 3′-end. Finally, and most importantly, stem mismatches and sequence overlap are not uncommon for mt tRNAs of Hymenopteran, and are probably repaired by a post-transcriptional editing process [48], [49]. The 22 tRNA genes are located on both strands, 14 tRNAs are encoded on the H-strand and 8 on the L-strand (Table 2). In these tRNAs, there is a strict conservation of the sizes of the amino acid acceptor stem (13–15 bp) and the anticodon loop (7–9 bp). Their D-loops consist of 4–10 bp. The extra variable loops of 4–5 bp and the TΨC loops of 4–10 bp complete the cloverleaf structures. In most metazoan mitochondria, tRNA-Ser(AGN) generally has an unpaired DHU arm, while tRNA-Ser(UCN) has a standard cloverleaf structure [50], [51]. The Apis tRNA-Ser(AGN) and tRNA-Ser(UCN) structures conform to this interpretation.

Ribosomal RNA genes

The rrnS and rrnL genes of A. cerana were identified by sequence comparison with those of A. mellifera. The rrnS is located between tRNA-Leu(CUN) and tRNA-Val, and the rrnL is located between tRNA-Val and the A+T-rich region. The length of the rrnS and rrnL gene of A. cerana is 786 bp and 1328 bp, respectively, but relatively shorter than the genes in A. mellifera (827 bp and 1371 bp respectively) (Table 2). The A+T contents of the rrnS and rrnL for A. cerana are 81.6% and 83.1%, respectively (Table 3). Sequence identity in the rrnS and rrnL genes is 53.9% and 86.0% between A. cerana and A. mellifera, respectively.

A+T-rich region

The A+T-rich region is believed to be involved in the regulation of transcription and control of DNA replication, characterized by five elements: (1) a polyT stretch at the 5′ end of the A+T-rich region, which may be involved in the control of transcription and/or replication initiation; (2) a [TA(A)]n-like stretch following the polyT stretch; (3) a stem and loop structure, which may be associated with the second strand-replication origin; (4) a TATA motif and a G (A)nT motif flanking the stem and loop structure, and (5) a G+A rich sequence downstream of the stem and loop structure [52]. The A+T-rich region is well known for the initiation of replication in both vertebrates and invertebrates, and the reduced G+C content is one of the most outstanding features of this region [9]. The largest non-coding region (562 bp) in the A. cerana mt genome is flanked by rrnS and tRNA-Ser(AGN), it is highly enriched in AT (95.5%). In this region, we found some of the elements commonly present in most insects' A+T-rich region, such as PolyT stretch, [TA(A)]n-like stretch, stable stem-loop secondary structures (not shown), and TATA motif, which may function in the initiation of genome replication. Based on these features, it possibly functions as a control region. The AT-region of Diadegma semiclausum has the highest percentage of A+T content (96.4%) and Orussus occidentalis has the lowest A+T content (79.7%) among Hymenopteran sequenced to date (Table 6).
Table 6

Comparison of A+T content (%) of the AT region, protein-coding and rRNA genes of mitochondrial genomes of Hymenopteran.

Gene/RegionAEACAMBHBICCCVDSESPEAMBOOPSPPHSAVE
AT-region89.995.996.0/96.088.3/96.488.185.6/79.778.9///
atp886.187.089.392.692.682.893.089.591.769.191.182.486.593.790.182.7
atp680.483.884.785.085.477.386.484.184.673.986.674.877.683.184.376.3
nad180.683.583.082.784.779.885.385.282.374.686.475.075.484.779.079.0
nad280.986.486.589.089.885.891.289.288.980.591.275.380.489.988.883.7
nad380.584.886.488.390.078.388.785.087.374.189.075.078.784.583.579.9
nad482.185.186.585.585.981.387.585.886.077.889.277.077.484.183.780.1
nad4L83.587.185.686.188.085.088.789.188.177.590.782.581.389.187.685.0
nad580.784.985.786.987.480.588.886.086.376.388.677.477.985.784.280.6
nad684.586.586.992.392.185.192.390.290.178.793.077.382.590.388.882.7
rrnS80.481.681.485.486.680.591.288.589.475.882.881.177.184.089.380.8
rrnL82.983.184.585.385.584.689.788.087.279.786.880.179.487.388.882.9
cox172.076.075.976.477.672.777.276.177.668.879.275.370.375.375.871.7
cox275.578.980.581.082.074.983.381.382.672.083.072.173.780.580.474.0
cox375.480.483.082.684.875.683.680.280.273.984.772.172.380.577.277.0
cob75.681.080.681.082.877.082.380.181.073.382.671.173.081.278.975.7
EmtG80.684.084.985.386.882.087.287.485.277.886.776.277.084.784.080.1

AE: Abispa ephippium, AC: Apis cerana, AM: Apis mellifera, BH: Bombus hypocrita, BI: Bombus ignitus, CC: Cephus cinctus, CV: Cotesia vestalis, DS: Diadegma semiclausum, ESP: Enicospilus sp, EA: Evania appendigaster, MB: Melipona bicolor, OO: Orussus occidentalis, PSP: Philaenus spumarius, PH: Polistes humilis, SA: Spathius agrili, VE: Vanhornia eucnemidarum, EmtG: entire mitochondrial genome.

AE: Abispa ephippium, AC: Apis cerana, AM: Apis mellifera, BH: Bombus hypocrita, BI: Bombus ignitus, CC: Cephus cinctus, CV: Cotesia vestalis, DS: Diadegma semiclausum, ESP: Enicospilus sp, EA: Evania appendigaster, MB: Melipona bicolor, OO: Orussus occidentalis, PSP: Philaenus spumarius, PH: Polistes humilis, SA: Spathius agrili, VE: Vanhornia eucnemidarum, EmtG: entire mitochondrial genome.

Phylogenetic analyses

Many systematic and population genetic studies have been based on genetic markers in the mt genomes at both the nucleotide and amino acid levels [53], [54]. So far, three mt genes (rrnL, cox2, nad2) and two nuclear gene (EF1-α intron and itpr) have been used for phylogenetic study of members within the genus Apis, yielding discrepant results [3], [4]. Usage of complete mt sequences for phylogenetic analyses would be more reliable, but so far only two complete mt genomes (A. cerana and A. mellifera) are available within the genus Apis. To better understand the evolution of genome-level features in A. cerana, phylogenetic relationships among representative members of the Hymenoptera were inferred from concatenated amino acid sequences of the 13 mt protein-coding genes. With addition of the A. cerana mt genome sequences, there are now 28 Hymenopteran sequences available for analysis, and only 16 of these have the complete sequences of the 13 mt protein-coding genes. The final alignment of the amino acid sequences of 13 protein genes for the 16 taxa were subjected to the ML, MP and NJ analyses. The results revealed that A. cerana and A. mellifera were closely related with high statistical support, indicating that A. cerana and A. mellifera are sister relationship (Figure 3), supporting the taxonomic classification by the analyses of morphological and molecular data. Three major clades were recovered within Apidae: Apini (A. cerana and A. mellifera), Meliponini (M. bicolor), and Bombini (B. ignitus+B. hypocrita sapporoensis) that were strongly supported in all (ML, MP and NJ) analyses. These results were congruent with previous studies [33], [35]. Our data provide a robust support for the relationship among Hymenopteran.
Figure 3

Inferred phylogenetic relationship among the Hymenoptera species.

The analyses were conducted using maximum parsimony (MP), maximum likelihood (ML) and neighbour joining (NJ) of amino acid sequences of 13 protein-coding genes, using Triatoma dimidiata as outgroup. The numbers along branches indicate bootstrap values resulting from different analyses in the order: MP/ML/NJ.

Inferred phylogenetic relationship among the Hymenoptera species.

The analyses were conducted using maximum parsimony (MP), maximum likelihood (ML) and neighbour joining (NJ) of amino acid sequences of 13 protein-coding genes, using Triatoma dimidiata as outgroup. The numbers along branches indicate bootstrap values resulting from different analyses in the order: MP/ML/NJ. In conclusion, the mt genome of A. cerana exhibits almost the same characteristics with that of A. mellifera. Phylogenetic analyses indicated that A. cerana and A. mellifera are two separate species, supporting the taxonomic classification by genetic relatedness and phylogeny. The analyses of A. cerana mt genome have added to our knowledge on mt genomes of Hymenoptera, and provided additional genetic markers for studying population genetics, systematics and phylogeographics of honeybees.

Materials and Methods

Sample origin and DNA extraction

One A. cerana female adult was collected from the Mengla county, Yunnan province, southeastern China, and was preserved in 75% ethanol and stored at 4°C until used for DNA extraction. Total genomic DNA was extracted from its thoracic muscle tissue by treatment with sodium dodecyl sulphate/proteinase K (Merck), followed by purification using Wizard™ DNA Clean-Up System (Promega) and then eluted into 60 µl H2O according to the manufacturer's recommendations. DNA samples were stored at −20°C until further use.

Amplification and sequencing of partial cox1, nad4 and rrnL

Partial fragments of the cox1, nad4 and rrnL mtDNA genes were amplified using three sets of primers (Table 7), designed by the authors according to homological mtDNA fragments of other Hymenopteran deposited in GenBank. PCR reactions were carried out in a 25 µl reaction volume consisting of 14.25 µl sterile deionized water, 2.5 µl 10×PCR Buffer (Mg2+ free), 4.0 µl MgCl2 (25 mM), 2.0 µl dNTPs (2.5 mM each), 0.25 µl each primer (50 pmol/µl), 0.25 µl ExTaq DNA polymerase (5 U/µl, Takara) and 1.5 µl DNA template (40 ng/µl) with the following conditions: after an initial denaturation at 94°C for 5 min, then 94°C for 30 s (denaturation), 45–50°C for 30 s (annealing), 72°C for 30 s (extension) for 35 cycles, followed by 72°C for 5 min (final extension). Each amplicon (5 µL) was examined by agarose gel electrophoresis to validate amplification efficiency. Then, the partial cox1, nad4 and rrnL amplicons were sent to TaKaRa Company (Dalian, China) for sequencing from both directions by using primers used in the PCR amplifications.
Table 7

Sequences of primers used to amplify PCR fragments from the Apis cerana mitochondrial genome.

PrimersSequence (5′-3′)SourceEstimated size of PCR products
AC-cox1F ATTAGATTCTGATTGCTTCCT This study680 bp
AC-cox1R TATGTTGCTAATCATCTAAAT This study
AC-nad4F ATTAATCGTCTATTAGTTTG This study370 bp
AC-nad4R TAAAAGCTCATGTTGAAGC This study
AC-rrnLF TGAACTCAAATCATGTAAGAT This study450 bp
AC-rrnLR ACTGTACAAAGGTAGCATAAT This study
AC-Lcox1F AGAAATTTATTTTATCCAAGACCAGGAACAG This study7.1 kb
AC-Lnad4R TGAAATTAGGAGGTTATGGGATATTACGTT This study
AC-Lnad4F AACATTAACTAAAAAAAATAAACCTGAAGAT This study4.8 kb
AC-LrrnLR CCATGAAATAAATTTAAATAGCTGCAGTA This study
AC-LrrnLF TGCGTTTAACTTTTCTCTTAATTCAACATC This study5.6 kb
AC-Lcox1R TTGCTGAAGTAAAATAAGCTCGTGTATCAA This study

Long-PCR amplification and sequencing

The nucleotide sequences obtained from these three genes were then used to design A. cerana-specific primer sets for long PCR reactions. Three overlapping long PCR fragments covering the entire mt genome of A. cerana were obtained using the long PCR primers sets (Table 7). The Long-PCR reaction volume amounted 50 µl containing 28.5 µl sterile deionized water, 5.0 µl 10×LA PCR Buffer(Mg2+ free), 5.0 µl MgCl2 (25 mM), 8.0 µl dNTPs (2.5 mM each), 0.5 µl each primer (50 pmol/µl), 0.5 µl LA Taq DNA polymerase (5 U/µl, Takara) and 2 µl DNA template (40 ng/µl). Long-PCR cycling conditions used were initial denaturation step at 92°C for 2 min, followed by 30 cycles of denaturation at 92°C for 10 s, primer annealing at 50°C for 30 s and elongation at 60°C for 10 min during the first 10 cycles, and then an additional 10 s per cycle during the last 20 cycles. The final elongation step was continued at 60°C for 10 min. All amplifications were done on a T-Gradient thermocycler (Biometra, Germany). The three long-PCR fragments were sequenced using a primer-walking strategy. All sequencing was performed with ABI PRISM Big Dye terminator chemistry and analyzed on ABI 3730 automated sequencers (Applied Biosystems, USA).

Gene annotation and sequence analysis

Sequences were assembled manually and aligned against the complete mt genome sequence of A. mellifera using the computer program Clustal×1.83 [55] to identify gene boundaries. The open-reading frames and codon usage profiles of protein-coding genes were analyzed using the program MacVector 4.1.4 (Kodak, version4.0). Translation initiation and translation termination codons were identified based on comparison with the mt genome of A. mellifera. The amino acid sequences inferred for the mt genes of the A. cerana were aligned with those of A. mellifera by using Clustal×1.83. Based on pairwise alignments, amino acid identity (%) was calculated for homologous genes. Codon usage was examined based on the relationships between the nucleotide composition of codon families and amino acid occurrence, where the genetic codons are partitioned into AT rich codons, GC-rich codons and unbiased codons. For analyzing ribosomal RNA genes, putative secondary structures of 22 tRNA genes were identified using tRNAscan-SE [56], of the 22 tRNA genes, 19 were identified using tRNAscan-SE, the other 3 tRNA genes were found by eye inspection, and rRNA genes were identified by comparison with the mt genome of A. mellifera. Phylogenetic relationship among the Hymenoptera were performed using the 14 Hymenoptera species (Table 1) as ingroup, plus the mt DNA sequence of A. cerana obtained in the present study, using one Reduviidae species (Triatoma dimidiata, GenBank accession number AF301594) as the outgroup, based on amino acid sequences of 13 protein-coding genes. Amino acid sequences for each gene were individually aligned using Clustal×1.83 [55] under default setting, and then concatenated into single alignments for phylogenetic analyses. Three methods, namely neighbor joining (NJ), maximum likelihood (ML) and maximum parsimony (MP), were used for phylogenetic re-constructions. Standard unweighted MP was performed using package Phylip 3.67 [57]. NJ analysis was carried out using PAUP 4.0 Beta 10 programme [58], and ML analysis was performed using PUZZLE 4.1 under the default setting [59]. The consensus tree was obtained after bootstrap analysis, with 1000 replications for NJ and MP trees, and 100 for ML tree, with values above 50% reported. Inferred secondary structures of 22 tRNAs found in mtDNA. (DOC) Click here for additional data file.
  51 in total

1.  Characterization of 67 mitochondrial tRNA gene rearrangements in the Hymenoptera suggests that mitochondrial tRNA gene position is selectively neutral.

Authors:  Mark Dowton; Stephen L Cameron; Jessica I Dowavic; Andy D Austin; Michael F Whiting
Journal:  Mol Biol Evol       Date:  2009-04-09       Impact factor: 16.240

2.  The mitochondrial genome of Protohermes concolorus Yang et Yang 1988 (Insecta: Megaloptera: Corydalidae).

Authors:  Jimeng Hua; Ming Li; Pengzhi Dong; Qiang Xie; Wenjun Bu
Journal:  Mol Biol Rep       Date:  2008-10-24       Impact factor: 2.316

3.  Phylogenetic relationships of honey bees (Hymenoptera:Apinae:Apini) inferred from nuclear and mitochondrial DNA sequence data.

Authors:  Maria C Arias; Walter S Sheppard
Journal:  Mol Phylogenet Evol       Date:  2005-04-19       Impact factor: 4.286

4.  Mapping of nascent light and heavy strand transcripts on the physical map of HeLa cell mitochondrial DNA.

Authors:  P Cantatore; G Attardi
Journal:  Nucleic Acids Res       Date:  1980-06-25       Impact factor: 16.971

5.  The position of the Hymenoptera within the Holometabola as inferred from the mitochondrial genome of Perga condei (Hymenoptera: Symphyta: Pergidae).

Authors:  Lyda Raquel Castro; Mark Dowton
Journal:  Mol Phylogenet Evol       Date:  2005-01-04       Impact factor: 4.286

6.  Honeybee navigation: distance estimation in the third dimension.

Authors:  M Dacke; M V Srinivasan
Journal:  J Exp Biol       Date:  2007-03       Impact factor: 3.312

7.  New views on strand asymmetry in insect mitochondrial genomes.

Authors:  Shu-Jun Wei; Min Shi; Xue-Xin Chen; Michael J Sharkey; Cornelis van Achterberg; Gong-Yin Ye; Jun-Hua He
Journal:  PLoS One       Date:  2010-09-15       Impact factor: 3.240

8.  Multiple origins of advanced eusociality in bees inferred from mitochondrial DNA sequences.

Authors:  S A Cameron
Journal:  Proc Natl Acad Sci U S A       Date:  1993-09-15       Impact factor: 11.205

9.  The complete mitochondrial genome of Evania appendigaster (Hymenoptera: Evaniidae) has low A+T content and a long intergenic spacer between atp8 and atp6.

Authors:  Shu-jun Wei; Pu Tang; Li-hua Zheng; Min Shi; Xue-xin Chen
Journal:  Mol Biol Rep       Date:  2009-08-05       Impact factor: 2.316

10.  The complete mitochondrial genome sequence of the hydrothermal vent galatheid crab Shinkaia crosnieri (Crustacea: Decapoda: Anomura): a novel arrangement and incomplete tRNA suite.

Authors:  Jin-Shu Yang; Hiromichi Nagasawa; Yoshihiro Fujiwara; Shinji Tsuchida; Wei-Jun Yang
Journal:  BMC Genomics       Date:  2008-05-30       Impact factor: 3.969

View more
  13 in total

1.  The complete mitochondrial genomes of two ghost moths, Thitarodes renzhiensis and Thitarodes yunnanensis: the ancestral gene arrangement in Lepidoptera.

Authors:  Yong-Qiang Cao; Chuan Ma; Ji-Yue Chen; Da-Rong Yang
Journal:  BMC Genomics       Date:  2012-06-22       Impact factor: 3.969

2.  Uncovering the novel characteristics of Asian honey bee, Apis cerana, by whole genome sequencing.

Authors:  Doori Park; Je Won Jung; Beom-Soon Choi; Murukarthick Jayakodi; Jeongsoo Lee; Jongsung Lim; Yeisoo Yu; Yong-Soo Choi; Myeong-Lyeol Lee; Yoonseong Park; Ik-Young Choi; Tae-Jin Yang; Owain R Edwards; Gyoungju Nah; Hyung Wook Kwon
Journal:  BMC Genomics       Date:  2015-01-02       Impact factor: 3.969

3.  Patterns of positive selection in seven ant genomes.

Authors:  Julien Roux; Eyal Privman; Sébastien Moretti; Josephine T Daub; Marc Robinson-Rechavi; Laurent Keller
Journal:  Mol Biol Evol       Date:  2014-04-29       Impact factor: 16.240

4.  The complete mitochondrial genome of the cavity-nesting honeybee, Apis koschevnikovi (Insecta: Hymenoptera: Apidae).

Authors:  Takeshi Wakamiya; Salim Tingek; Hisashi Okuyama; Takuya Kiyoshi; Jun-Ichi Takahashi
Journal:  Mitochondrial DNA B Resour       Date:  2017-01-17       Impact factor: 0.658

5.  The mitochondrial genome of Vulgichneumon sp. (Hymenoptera: Ichneumonidae).

Authors:  Yimeng Chen; Yiying Chen; Fengqin Cao; Zhifu Cui
Journal:  Mitochondrial DNA B Resour       Date:  2019-09-06       Impact factor: 0.658

6.  The complete mitochondrial genome of Galba pervia (Gastropoda: Mollusca), an intermediate host snail of Fasciola spp.

Authors:  Guo-Hua Liu; Shu-Yan Wang; Wei-Yi Huang; Guang-Hui Zhao; Shu-Jun Wei; Hui-Qun Song; Min-Jun Xu; Rui-Qing Lin; Dong-Hui Zhou; Xing-Quan Zhu
Journal:  PLoS One       Date:  2012-07-26       Impact factor: 3.240

7.  Complete mitochondrial genome sequence data provides genetic evidence that the brown dog tick Rhipicephalus sanguineus (Acari: Ixodidae) represents a species complex.

Authors:  Guo-Hua Liu; Fen Chen; Yi-Zhou Chen; Hui-Qun Song; Rui-Qing Lin; Dong-Hui Zhou; Xing-Quan Zhu
Journal:  Int J Biol Sci       Date:  2013-04-22       Impact factor: 6.580

8.  The complete mitochondrial genome of Meloidogyne graminicola (Tylenchina): a unique gene arrangement and its phylogenetic implications.

Authors:  Longhua Sun; Kan Zhuo; Borong Lin; Honghong Wang; Jinling Liao
Journal:  PLoS One       Date:  2014-06-03       Impact factor: 3.240

9.  The mitochondrial genome of the leaf-cutter ant Atta laevigata: a mitogenome with a large number of intergenic spacers.

Authors:  Cynara de Melo Rodovalho; Mariana Lúcio Lyra; Milene Ferro; Maurício Bacci
Journal:  PLoS One       Date:  2014-05-14       Impact factor: 3.240

10.  Population structure of honey bees in the Carpathian Basin (Hungary) confirms introgression from surrounding subspecies.

Authors:  Erika Péntek-Zakar; Andrzej Oleksa; Tomasz Borowik; Szilvia Kusza
Journal:  Ecol Evol       Date:  2015-11-04       Impact factor: 2.912

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.