Literature DB >> 33187054

Characterization of the Complete Mitochondrial Genomes from Two Nitidulid Pests with Phylogenetic Implications.

Xiaoxiao Chen1, Qing Song1, Min Huang1.   

Abstract

The complete mitochondrial genomes of Xenostrongylusvariegatus and Epuraea sp. were sequenced and analyzed. The total genome lengths are 17,657 and 16,641 bp, with an A+T content of 77.2% and 76.4%, respectively. Each mitochondrial genome consists of 37 coding genes and a non-coding (AT-rich) region. All protein-coding genes (PCGs) start with the standard start codon, ATN, and end with complete stop codons, TAA and TAG, or an incomplete stop codon, T. All tRNAs can be folded into the typical clover-leaf secondary structure, with the exception of trnS1 in both species with a reduced dihydrouridine (DHU) arm. The AT-rich region has tandem repeats differing in both number and length. Genetic distance and Ka/Ks analyses show that nad6 has a higher variability and more rapid evolutionary rate than other PCGs. Both maximum likelihood and Bayesian inference phylogenetic analyses based on 13 PCGs and 2 ribosome DNAs (rDNAs) agree with the previous phylogenies in supporting the Nitidulidae monophyly and the sister-group relationship of Kateretidae + (Monotomidae + Nitidulidae).

Entities:  

Keywords:  Cucujoidea; Nitidulidae; mitochondrial genome; phylogeny

Year:  2020        PMID: 33187054      PMCID: PMC7697951          DOI: 10.3390/insects11110779

Source DB:  PubMed          Journal:  Insects        ISSN: 2075-4450            Impact factor:   2.769


1. Introduction

Nitidulidae is the largest group within the Cucujoidea (Coleoptera, Polyphaga), containing 350 genera in ten subfamilies, with nearly 4500 species worldwide [1,2]. Members of Nitidulidae inhabit a wide range of habitats in the Holarctic, Oriental, and Afrotopical Regions [3,4]. Many nitidulid species are pests of grain and other cash crops, seriously impacting plant pollination and seed production, and also spreading fungal pathogens [5,6,7,8,9,10,11,12,13,14,15,16,17,18]. Xenostrongylus variegatus Fairmaire, 1891, and Epuraea sp., the two species treated here, are also important pests of oilseed rape [19,20] and beehives respectively, with both widely distributed across China. Nearly all morphological and molecular data analyzed to date support the monophyly of Nitidulidae [1,21,22], except Tang’s analysis nesting Nitidulidae within Erotylidae based on mitochondrial genomes [23], and Bocak’s analysis nesting Passandridae within Nitidulidae [24]. However, Tang’s and Bocak’s analyses did not specifically focus on Nitidulidae and included very few species of Nitidulidae, so the results were not conclusive. The phylogenetic relationship of Nitidulidae to other cucujoid families also remains unclear. Most morphological data support the sister relationship of Nitidulidae + Kateretidae [22,25,26,27], and this result is also supported by certain studies based on gene fragments, such as Cline et al. [21], based on seven loci (12S, 16S, 18S, 28S, COI, COII, and H3), and Robertson et al. [2], based on eight loci (18S, 28S, H3, CAD, 12S, 16S, COI, and COII). The sister-group relationship of (Nitidulidae + Kateretidae) with Monotomidae was also supported by Bocak et al.’s [24] study based on four loci (18S, 28S, rrnL, and COI). Nevertheless, Hunt [28] suggested that Nitidulidae is closer to Monotomidae than to Kateretidae. Leschen noted that even though Nitidulidae and Monotomidae share an apparent morphological apomorphy, i.e., abdominal tergite VII exposed in dorsal view and tergite VIII in males with sides curved ventrally forming a genital capsule, their sister relationship is still doubtful [25]. So, further phylogenetic studies are needed in order to clarify the relationships between Nitidulidae and related families of Cucujoidea. So far, only five complete nitidulid mitochondrial genomes (Epuraea guttata (Olivier, 1811), Carpophilus dimidiatus (Fabricius, 1792), Carpophilus pilosellus (Motschulsky, 1858), Aethina tumida (Murray, 1867), and Nitidulidae sp.) have been published in GenBank. In this study, we present the mitochondrial genomes of two additional nitidulid species, Xenostrongylus variegatus and Epuraea sp., annotating and analyzing their structures in detail. We reconstruct the phylogenetic relationships of Nitidulidae and related families of Cucujoidea based on 13 protein-coding genes (PCGs) and 2 rRNAs of 17 taxa, including three outgroups and fourteen ingroups of insects. The purpose of this study is to improve our understanding on the mitochondrial characteristics of Nitidulidae and its phylogenetic relationships with related families.

2. Materials and Methods

2.1. Materials and DNA Extraction

Xenostrongylus variegatus was collected from Xiaozhongdian, Shangri-La, Yunnan Province, China, in 2018. Epuraea sp. was collected from honeycomb in Xishuangbanna, Yunnan Province, China, in 2019. All materials were preserved in 100% ethanol and stored at −80 °C in the Entomological Museum of the Northwest A&F University. The total genomic DNA was extracted using the DNeasy DNA Extraction kit (Qiagen) after the morphological identification.

2.2. Sequence Analysis

The mitochondrial genomes of X. variegatus and Epuraea. sp. were sequenced by next-generation sequencing (NGS; Illumina HiSeq X10; 5.46 gb raw data; by Biomarker Technologies Corporation, Beijing, China). The raw data were preprocessed, then assembled and annotated with the default parameters used in the mitochondrial genomes of C. dimidiatus and C. pilosellus as the reference sequences, respectively. Default parameters were performed by Geneious 8.1.3 (Biomatters, Auckland, New Zealand) [29]. The 13 PCGs were identified by finding open reading frames (ORFs) and were translated into amino acids according to the invertebrate mitochondrial genetic code. The positions and secondary structures of 22 tRNAs were predicted by the MITOS Web Server (http://mitos.bioinf.uni-leipzig.de/index.py) [30]. Then, we manually edited the clover-leaf secondary structure with Adobe Illustrator CS5 according to the predicted structures. Two rRNAs and the AT-rich region were identified by the location of adjacent genes and through comparison with other reported homologous sequences of members of Nitidulidae. Mitogenomic circular maps were produced using CGView Server (http://stothard.afns.ualberta.ca/cgview_server/) [31]. The base composition, component skew, and codon usage of the PCGs and relative synonymous codon usage (RSCU) were analyzed using PhyloSuite v1.2.1 [32]. Tandem repeats of the control region were established by the Tandem Repeats Finder Online server (http://tandem.bu.edu/trf/trf.html) [33]. A sliding window of 200 bp was used to estimate the nucleotide diversity (Pi) of the PCGs at a step size of 20 bp by DnaSP V5 in order to evaluate the Pi value of the PCGs among seven nitidulid mitochondrial genomes [34]. The ratio of the number of nonsynonymous substitutions per nonsynonymous site (Ka) to the number of synonymous substitutions per synonymous site (Ks) of 13 PCGs for seven species of Nitidulidae was estimated using DnaSP V5 [34]. The genetic distances between seven species of Nitidulidae based on each PCG were estimated with Mega 6 [35] with the Kimura-2-parameter model.

2.3. Phylogenetic Analysis

The phylogenetic analyses were performed using 13 PCGs and 2 rRNAs from 17 species of Cucujoidea (Table 1). All of the reported complete and partial mitochondrial genomes in this study were downloaded from GenBank. Standardization of data and extraction of information was conducted by PhyloSuite v1.2.1. The nucleotide sequences of the PCGs were aligned in batches with MAFFT using codon alignment and the G-INS-i (accurate) strategy. rRNAs were aligned with MAFFT version 7 online services using the Q- INS-i strategy (https://mafft.cbrc.jp/alignment/server/). Gaps and ambiguously aligned sites in the alignment were removed using Gblocks, and then by concatenating each gene into PhyloSuite. The optimal nucleotide replacement model and segmentation strategy were recommended by PartitionFinder. The best fitting models (Table S1) were selected for each partition using the “greedy” search algorithm, and were “linked” to estimated branch lengths using the Bayesian information criterion (BIC) [32].
Table 1

Summary of the mitogenomic sequence information used in the present study.

FamilySpeciesAccession NumberReference
Sphindidae Aspidiphorus orbiculatus KT780625Unpublished
ErotylidaeLanguriidae sp.MG193464[36]
Erotylinae sp1MH836601[37]
Erotylinae sp2MH789736[37]
Monotomidae Monotoma quadricollis KX035132Unpublished
Rhizophagus aeneus KX087340Unpublished
Kateretidae Brachypterolus vestitus KX087245Unpublished
NitidulidaeNitidulidae sp MH789742[37]
Aethina tumida NC_036104[38]
Xenostrongylus variegatus MW044620This study
Epuraea guttata KX087289Unpublished
Carpophilus dimidiatus NC_046036[39]
Carpophilus pilosellus MN604383[39]
Epuraea sp.MW044619This study
SilvanidaeUleiota sp.KX035149Unpublished
Cucujidae Cucujus clavipes GU176341[40]
Cucujus haematodes KX087268Unpublished
Maximum likelihood (ML) and Bayesian inference (BI) were used for the phylogenetic analyses based on four 17-taxa datasets, namely: (1) the PCG123 matrix, including all three codon positions of protein-coding genes; (2) the PCG123R matrix, including all three codon positions of protein-coding genes and two rRNA-encoding genes; (3) the PCG12 matrix, the first and second codon positions of protein-coding genes; and (4) the PCG12R matrix, including the first and second codon positions of protein-coding genes and two rRNA-encoding genes. The ML phylogenetic analyses were performed using IQ-TREE V 1.6.8 [41], using an ultrafast bootstrap algorithm with 1000 replicates. The BI phylogenetic analyses were performed using MrBayes 3.2.7 [42], and 1 × 107 Markov chain Monte Carlo (MCMC) generations, sampled per 1000 generations. Convergence occurred when the average standard deviation of the split frequencies was <0.01; the first 25% of the samples were discarded as burn-in, and the remaining samples were used to generate a consensus tree and to estimate the posterior probabilities.

3. Results and Discussion

3.1. Genome Organization

The mitochondrial genomes are characterized by their asymmetric AT and GC content in the nucleotide composition. Both mitochondrial genomes show a heavy AT nucleotide bias. The AT content of the whole genome is 77.2% in X. variegatus (A = 39.4%, T = 37.8%, C = 13%, and G = 9.8%) and 76.4% in Epuraea sp. (A = 37.6%, T = 38.8%, C = 14.4%, and G = 9.3%; Table 2). Among all of the reported species of Nitidulidae, only X. variegatus shows a lower AT content in the AT-rich region than in the rDNAs. In addition, all of the reported Nitidulidae species show positive AT skews and negative GC skews in the whole genomes, expect for Epuraea sp., which has a negative AT skew (Table 3).
Table 2

Nucleotide composition of mitogenomes of X. variegatus and Epuraea sp.

RegionsSize (bp)T(U)CAGAT(%)GC(%)AT SkewGC Skew
X. variegatus
Full genome17,65737.81339.49.877.222.80.021−0.141
PCGs11,0464311.53411.57723−0.1160
1st codon position368237.210.735.117.172.327.8−0.0290.229
2nd codon position368247.317.721.613.468.931.1−0.374−0.136
3rd codon position368244.46.245.4489.810.20.012−0.207
tRNAs145438.6939.612.878.221.80.0130.174
rRNAs207942.86.638.512.181.318.7−0.0530.296
AT-rich region291040.213.534.411.974.625.4−0.078−0.064
Epuraea sp.
Full genome16,64138.814.437.69.376.423.7−0.015−0.216
PCGs11,09742.912.93212.274.925.1−0.146−0.026
1st codon position369936.711.934.41771.128.9−0.0320.179
2nd codon position369946.718.321.213.767.932−0.376−0.143
3rd codon position369945.28.540.3685.514.5−0.057−0.175
tRNAs144536.310.939.413.475.724.30.0410.103
rRNAs208141.96.936.914.478.821.3−0.0630.353
AT-rich region198453.812.128.85.282.617.3−0.302−0.397
Table 3

Nucleotide composition of the Nitidulidae mitochondrial genomes: E. guttata (E1), Epuraea sp. (E2), C. dimidiatus (C1), C. pilosellus (C2), Nitidulidae sp. (N), A. tumida (A), and X. variegatus (X).

SpeciesWhole GenomeAT SkewGC SkewPCGstRNAsrRNAsA + T-Rich Region
Size (bp)AT (%)Size (bp)AT (%)Size (bp)AT (%)Size (bp)AT (%)Size (bp)AT (%)
E1 16,02176.50.043−0.1911,07375.7145175.7208176.4--
E2 16,64176.4−0.015−0.21611,09774.9144575.8208178.8198482.6
C1 15,71775.20.038−0.20211,09474.5144174.9206175105783.6
C2 15,68677.20.027−0.17711,10376.5144276.5207977.594486.7
N 17,43278.40.036−0.18311,09176.3144378.2207380.3--
A 16,57676.90.034−0.22311,10975.4146077.2206479.5--
X 17,65777.20.021−0.14111,04677145478.2207981.3291074.6
The lengths of the complete mitochondrial genome are 17,657 bp in X. variegatus and 16,641 bp in Epuraea sp., the length of the former is longer than that reported for Nitidulidae (Table 3) because of the differences in the number of AT-repeats in the AT-rich region. The mitochondrial genomes of both species consist of closed, circular, double-stranded DNA molecules (Figure 1 and Figure 2), and contain 37 genes, including 13 PCGs, 22 tRNAs, 2 rDNAs, and a AT-rich region. While four PCGs (nad1, nad4, nad4L, and nad5), eight tRNAs (Q, C, Y, F, H, P, L1. and V), and two rRNAs (lrRNA and srRNA) are encoded in the heavy strand, the others are encoded in the light strand (Table 4). The sequence of genes is consistent with the reference mitochondrial genome arrangement and with other Nitidulidae.
Figure 1

Mitochondrial map of X. variegatus.

Figure 2

Mitochondrial map of Epuraea sp.

Table 4

Mitogenomic organization of X. variegatus and Epuraea sp.

Position Size (bp)Intergenic NucleotidesCodon Strand
FromTo StartStop
X. variegatus/E. sp.
trnI 1/164/6364/63 +/+
trnQ 62/61130/12969/69−3/−3 −/−
trnM 131/129199/19769/69/−1 +/+
nad2 200/1981174/1205975/1008 ATT/ATTTAA/TAA+/+
trnW 1202/12141268/128067/6727/8 +/+
trnC 1383/12841446/134564/62114/3 −/−
trnY 1448/13461510/141063/651/ −/−
cox1 1503/14033042/29421540/1540−8/−8ATT/ATCT/T+/+
trnL2 3043/29433107/300765/65 +/+
cox2 3108/30083780/3695673/688 ATT/ATTT/T+/+
trnK 3781/36963851/376571/70 +/+
trnD 3855/37663924/383170/663/ +/+
atp8 3925/38324069/3987145/156 ATC/ATCT/TAG+/+
atp6 4076/39814747/4655672/6756/−7ATA/ATGTAA/TAA+/+
cox3 4747/46555533/5438787/784−1/−1ATG/ATGT/TAG+/+
trnG 5534/54395597/550164/63 +/+
nad3 5604/55025951/5855348/3546/ATT/ATTTAG/T+/+
trnA 5950/58546015/591766/64−2/−2 +/+
trnR 6015/59186077/597963/62−1/ +/+
trnN 6077/59806142/604666/67−1/ +/+
trnS1 6143/60476209/611367/67 +/+
trnE 6210/61146273/617664/63 +/+
trnF 6272/61756336/623965/65−2/−2 −/−
nad5 6337/62498053/79531717/1705 /9ATA/ATTT/TAG−/−
trnH 8051/79548114/801864/65−3/ −/−
nad4 8112/80169444/93421333/1327−3/−3ATT/ATAT/T−/−
nad4L 9435/93399722/9623288/285−10/−4ATG/ATGTAA/TAA−/−
trnT 9725/96269789/968965/642/2 +/+
trnP 9790/96909854/975565/66 −/−
nad6 9859/976010,359/10,263501/5044/4ATA/ATATAA/TAA+/+
cytb 10,359/10,26311,498/11,4051140/1143−1/−1ATG/ATGTAG/TAG+/+
trnS2 11,497/11,40411,564/11,47168/68−2/−2 +/+
nad1 11,582/11,48912,514/12,421933/93317/17ATT/ATTTAG/TAG−/−
trnL1 12,534/12,44112,600/12,50567/6519/19 −/−
rrnL 12,601/12,50613,891/13,8051291/1300 −/−
trnV 13,892/13,80613,959/13,87568/70 −/−
rrnS 13,960/13,87714,747/14,657788/781/1 −/−
AT-rich region14,748/14,65817,657/16,6412910/1984 +/+
Apart from the AT-rich region, there are 197 bp spacers across nine gene intervals ranging from 1–114 bp in X. variegatus, and 62 bp spacers across eight gene intervals ranging from 1–19 bp in Epuraea sp. The longest intergenic spacer is located between trnW and trnC in X. variegatus, and nad1 and trnL1 in Epuraea sp, while in A. tumida the longest is 18 bp between trnL2 and cox2. In C. dimidiatus, C. pilosellus, and E. guttata, there are 24 bp, 107 bp, and 79 bp intergenic spacers between trnW and trnC, respectively. Gene overlaps are found at the junctions of 11 pairs of genes ranging from 1–10 bp in X. variegatus and 1–9 bp in Epuraea sp., with the longest overlap located between nad4 and trnT in X. variegatus, trnY, and cox1 in Epuraea sp, A. tumida, C. dimidiatus, C. pilosellus, and E. guttata.

3.2. Protein-coding Genes (PCGs)

The total length of all 13 PCGs of X. variegatus is 11,046 bp and of Epuraea sp. is 11,097 bp, accounting for 62.56% and 66.68% of the total length of their mitochondrial genomes, respectively (Table 2). The start and stop codons were determined based on the reference sequences. Most PCGs start with a typical start codon ATN (ATC, ATG, ATA, and ATT), except for nad1, which starts with the unusual start codon TTG in A. tumida, E. guttata, and an unidentified Nitidulidae sp. Correspondingly, the PCGs ended with the stop codons TAA and TAG, whereas an incomplete stop codon, T, was found in cox1, cox2, cox3, atp8, nad4, and nad5 in Nitidulidae (Table 5). Such incomplete stop codons are common in insects and may result from post-transcriptional polyadenylation [43]. Furthermore, the stop codon TAA is used more frequently than TAG, and all seven Nitidulidae have cox1, at least, ending in an incomplete stop codon T.
Table 5

Start and stop codons of the mitochondrial genomes: E. guttata (E1), Epuraea sp. (E2), C. dimidiatus (C1), C. pilosellus (C2), Nitidulidae sp. (N), A. tumida (A), and X. variegatus (X).

GeneStart Codon/Stop Codon
E1 E2 C1 C2 N A X
nad2 ATT/TAAATT/TAAATT/TAAATT/TAAATT/TATT/TAAATT/TAA
cox1 ATT/TATC/TATT/TATT/TATT/TATA/TATT/T
cox2 ATA/TATT/TATC/TATT/TATT/TAGATT/TATT/T
atp8 ATT/TAGATC/TAGATC/TAGATC/TAGATG/TAAATT/TAGATC/T
atp6 ATG/TAAATG/TAAATG/TAAATA/TAAATG/TAAATA/TAAATA/TAA
cox3 ATG/TATG/TATG/TATG/TATT/TAAATG/TATG/T
nad3 ATA/TAGATT/TAGATT/TAGATT/TAGATT/TAAATA/TAGATT/TAG
nad5 ATA/TATT/TATT/TATT/TTAG/TAAATA/TATA/T
nad4 ATG/TAAATA/TATG/TATG/TATG/TAAATG/TATT/T
nad4L ATG/TAAATG/TAAATG/TAAATG/TAAATT/TAAATG/TAAATG/TAA
nad6 ATC/TAAATA/TAAATA/TAAATA/TAAATG/TAGATA/TAAATA/TAA
Cytb ATA/TAGATG/TAGATG/TAGATG/TAGTTG/TAGATG/TAAATG/TAG
nad1 AAC/ATCATT/TAGATA/TAGATG/TAGATT/TAATTG/TAGATT/TAG
The total AT ratios of 13 PCGs are 77.0% in X. variegatus (A = 34.0%, T = 43.0%, C = 13.0%, and G = 9.8%) and 74.9% in Epuraea sp. (A = 32.0%, T = 42.9%, C = 12.9%, and G = 12.2%). Both species show negative AT skews (−0.116 in X. variegatus and −0.146 in Epuraea sp.). X. variegatus shows no CG skew (0) and Epuraea sp. shows a negative CG skew (−0.026) (Table 2). The first codon position AT content (72.3% in X. variegatus and 71.1% in Epuraea sp.) is higher than that of the second codon position (68.9% in X. variegatus and 67.9% Epuraea sp.) and is much lower than that of the third codon position (89.8% in X. variegatus and 85.5% in Epuraea sp.). The relative synonymous codon usage (RSCU) is shown in Figure 3. UUA (Leu), AUU (Ile), UUU (Phe), UCU (Ser 2), and AUA (Met) are the most frequently used codons in both species, which is highly consistent with the previously reported frequencies in Nitidulidae. As indicated by these results, nearly all of them consist of A and U, and contribute to the high AT content of PCGs.
Figure 3

Relative synonymous codon usage (RSCU) of the mitochondrial DNA protein-coding genes (PCGs) of seven nitidulid species.

3.3. Transfer and Ribosomal RNAs

The total length of all 22 tRNAs of X. variegatus is 1454 bp and of Epuraea sp. is 1445 bp, which is within the previously reported range for Nitidulidae, accounting for 8.23% and 8.68% of the total length of their mitochondrial genomes, respectively. The total AT percent is 78.2% (A = 39.6%, T = 38.6%, C = 9%, and G = 12.8%) for X. variegatus and 75.7% (A = 39.4%, T = 36.3%, C = 10.9%, and G = 13.4%) for Epuraea sp. Both species show positive AT skews (0.013 in X. variegatus and 0.041 in Epuraea sp.) and CG skews (0.174 in X. variegatus and 0.103 in Epuraea sp.) (Table 2). The length of each tRNA is between 63 bp (trnY and trnR) and 71 bp (trnK) in X. variegatus and between 62 bp (trnC and trnR) and 70 bp (trnK) in Epuraea sp. (Table 4). Nearly all tRNAs can be folded into the typical clover-leaf structure, except for trnS1, which in both shows a reduced dihydrouridine (DHU) arm. The size of the anticodon (AC) arm and the amino acid acceptor (AA) arm are consistently 5 bp and 7 bp, respectively. The TΨC arm and DHU arm are variable: trnW, trnF, trnH, and trnT in both species; trnG in X. variegatus; and trnR in Epuraea sp. all lack the TΨC-loop. The trnS1 in both species lack the dihydorouridine (DHU) arm, which has been reported in other metazoans [44,45,46,47,48,49]. The length of the AC-loop is normally seven nucleotides, except for trnA in X. variegatus, which is six nucleotides. The trnS1 and trnA in Epuraea sp. have five nucleotides and the DHU loop ranges from 2–4 bp. The TΨC loop ranges from 3–5 bp in both species. The DHU-loop ranges from 3–9 nucleotides in Epuraea sp. and 3–8 nucleotides in X. variegatus. There are a total of 27 mismatched base pairs in X. variegatus of six types (U-U, U-G, A-G, A-C, U-C, and A-A) and 33 mismatched base pairs of six types (U-U, U-G, C-C, A-G, A-C, and U-C) found in Epuraea sp (Figure 4 and Figure 5).
Figure 4

Inferred secondary structure for the tRNAs of X. variegatus.

Figure 5

Inferred secondary structure for the tRNAs of Epuraea sp.

The rRNAL and rRNAS are located between trnL1 and trnV, and trnV and the AT-rich region with lengths in X. variegatus of 1291 bp and 788 bp, but 1300 bp and 781 bp in Epuraea sp. The total rRNAs show a negative AT skew (−0.053 in X. variegatus and −0.063 in Epuraea sp.) and a positive CG skew (0.296 in X. variegatus and 0.353 in Epuraea sp.). The AT content in X. variegatus is 81.3% and 78.8% in Epuraea sp (Table 3). Therefore, rRNAs are highly conserved in the Nitidulidae for length, AT content, and location.

3.4. AT-rich Region

The assumed control region (the AT-rich region) is the major noncoding region in the mitochondrial genome. It is located between rrnS and trnI, and plays a regulatory role in the transcription and replication of the mtDNA [50,51,52,53,54]. The lengths of the AT-rich region of X. variegatus and Epuraea sp. are 2910 bp and 1984 bp, respectively (Figure 6). Both are longer than those previously reported for Nitidulidae. The AT contents of these regions are 74.6% and 82.6% in X. variegatus and Epuraea sp., respectively. The AT-rich regions in both species show negative AT skews (−0.078 in X. variegatus and −0.302 in Epuraea sp.) and negative CG skews (−0.064 in X. variegatus and −0.397 in Epuraea sp.). Both species have different lengths of tandem repeat, located at positions 1041 bp to 1660 bp in X. variegatus and 1368 bp to 1436 bp in Epuraea sp., respectively. Moreover, two poly-T stretches and two poly-C stretches are found near rrnS in Epuraea sp., which may be the origin of the DNA replication minor strand [51].
Figure 6

Structures of AT-rich region in mitogenomes of Epuraea sp. and X. variegatus. The dark red ellipses are the tandem repeat regions, the blue blocks indicate non-repeat regions, the green circles are the poly-T stretches, and the purple circles are poly-C stretches.

3.5. Nucleotide Analyses

The nucleotide diversity calculated for 13 PCGs of the seven Nitidulidae are shown in Figure 7. The results indicate that different genes have different nucleotide diversity values. In all PCGs, nad6 (Pi = 0.280) shows the highest nucleotide diversity values, next to nad2 (Pi = 0.255) and atp8 (Pi = 0.238). However, cox1 (Pi = 0.162) and nad1 (Pi = 0.154) show lower nucleotide diversity values and are the most conserved of the mitochondrial PCGs (Figure 7).
Figure 7

Sliding window analyses of 13 PCGs among seven nitidulid mitogenomes. The red line shows the value of nucleotide diversity (Pi) in a sliding window analysis (a sliding window of 200 bp with the step size of 20 bp); the Pi value of each gene is shown under the gene name.

Pairwise comparisons of the genetic distances show consistent results: nad6 (0.354) and nad2 (0.315) have greater distances and a faster evolution, while nad1 (0.172) and cox1 (0.184) represent shorter distances and a slower evolution. The average nonsynonymous (Ka) and synonymous (Ks) replacement rates of the 13 PCGs in seven mitochondrial genomes are estimated to be in the range of 0.096–0.481, indicating that all PCGs are under purifying selection. In addition, cox1 (0.096) exhibits the strongest purifying selection and shows the lowest evolutionary rate. In contrast, the substitution rates of nad4L (0.481) and nad6 (0.462) are much higher than in other PCGs, suggesting that they may be under a relaxed purifying selection (Figure 8). This suggests that the latter gene may be most suitable for resolving phylogenetic relationships among closely related species.
Figure 8

Genetic distance and non-synonymous (Ka) to synonymous (Ks) substitution rates of 13 PCGs among seven nitidulid species.

3.6. Phylogenetic Analysis

The phylogenetic analyses in this study were based on four datasets (PCG123, PCG123R, PCG12, and PCG12R) including 17 species of Cucujoidea. The partitioning schemes and models for the four datasets are listed in Tables S1 and S2. Eight tree topologies were constructed according to the ML and BI analysis (Figure 9 and Figures S1–S6). Although the tree topologies were not completely consistent among the analyses, all of the results support the monophyly of Nitidulidae and a sister-group relationship of Kateretidae + (Monotomidae + Nitidulidae).
Figure 9

Phylogenetic tree produced from Maximum likelihood (ML) and Bayesian inference (BI) analyses based on PCG12R. The numbers on branches are bootstrap value (BS) and Bayesian posterior probabilities (PP).

Both BI and ML methods based on four different datasets strongly support the monophyly of Nitidulidae (Nitidulinae + (Carpophilinae + Epuraeinae)), which is consistent with previous studies of Cline and Lee [1,21,25]. In the present study, Kateretidae consistently forms a sister-group with Monotomidae + Nitidulidae, forming a monophyletic clade with moderate support (bootstrap value (BS) = 70 and Bayesian posterior probabilities (PP) = 1). The sister relationship of Nitidulidae to Monotomidae is supported by high posterior probabilities in BI trees (PP = 0.993). This result is consistent with that of Hunt [28], but contradicts most previous phylogenetic analyses based on morphological characters [25,26,27] and gene fragments [1,2,21], which all support the Nitidulidae sister to Kateretidae. Considering that only a few taxa are included in this study, more species need to be sequenced and the mitochondrial data need to be combined with data from nuclear genes and morphology in order to provide a more robust phylogeny of Nitidulidae and the related families.

4. Conclusions

New complete mitochondrial genomes of two nitidulid species, X. variegatus and Epuraea sp., are provided. Comparative analyses of the available Nitidulidae mitochondrial genomes show that they are highly conserved in terms of their genome size, base content and composition, codon usage, and secondary structures of tRNAs. The results of the phylogenetic analyses confirm the monophyly of Nitidulidae and support the sister relationship of Kateretidae + (Monotomidae + Nitidulidae). This indicates that mitochondrial data can help resolve phylogenetic relationships at different levels in the taxonomic hierarchy. Although some differences between the present results and previously published phylogenies of this group of beetles may be due to differences in the taxon sampling and phylogenetic analysis methods, the present study indicates that mitochondrial genome sequencing can contribute to an improved understanding of the phylogenetic relationships among and within the Cucujoidea.
  25 in total

Review 1.  Animal mitochondrial genomes.

Authors:  J L Boore
Journal:  Nucleic Acids Res       Date:  1999-04-15       Impact factor: 16.971

2.  DnaSP v5: a software for comprehensive analysis of DNA polymorphism data.

Authors:  P Librado; J Rozas
Journal:  Bioinformatics       Date:  2009-04-03       Impact factor: 6.937

3.  MEGA6: Molecular Evolutionary Genetics Analysis version 6.0.

Authors:  Koichiro Tamura; Glen Stecher; Daniel Peterson; Alan Filipski; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2013-10-16       Impact factor: 16.240

4.  Tandem repeats finder: a program to analyze DNA sequences.

Authors:  G Benson
Journal:  Nucleic Acids Res       Date:  1999-01-15       Impact factor: 16.971

5.  The contribution of mitochondrial metagenomics to large-scale data mining and phylogenetic analysis of Coleoptera.

Authors:  Benjamin Linard; Alex Crampton-Platt; Jerome Moriniere; Martijn J T N Timmermans; Carmelo Andújar; Paula Arribas; Kirsten E Miller; Julia Lipecki; Emeline Favreau; Amie Hunter; Carola Gómez-Rodríguez; Christopher Barton; Ruie Nie; Conrad P D T Gillett; Thijmen Breeschoten; Ladislav Bocak; Alfried P Vogler
Journal:  Mol Phylogenet Evol       Date:  2018-07-25       Impact factor: 4.286

6.  PhyloSuite: An integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies.

Authors:  Dong Zhang; Fangluan Gao; Ivan Jakovlić; Hong Zou; Jin Zhang; Wen X Li; Gui T Wang
Journal:  Mol Ecol Resour       Date:  2019-11-06       Impact factor: 7.090

7.  Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data.

Authors:  Matthew Kearse; Richard Moir; Amy Wilson; Steven Stones-Havas; Matthew Cheung; Shane Sturrock; Simon Buxton; Alex Cooper; Sidney Markowitz; Chris Duran; Tobias Thierer; Bruce Ashton; Peter Meintjes; Alexei Drummond
Journal:  Bioinformatics       Date:  2012-04-27       Impact factor: 6.937

8.  The first mitochondrial genome for caddisfly (insecta: Trichoptera) with phylogenetic implications.

Authors:  Yuyu Wang; Xingyue Liu; Ding Yang
Journal:  Int J Biol Sci       Date:  2013-12-13       Impact factor: 6.580

9.  Complete Mitochondrial Genome Sequence of Aethina tumida (Coleoptera: Nitidulidae), a Beekeeping Pest.

Authors:  Véronique Duquesne; Aurélie Delcont; Anthéa Huleux; Véronique Beven; Fabrice Touzain; Magali Ribière-Chabert
Journal:  Genome Announc       Date:  2017-11-02

10.  The complete mitochondrial genomes of two sibling species of camellia weevils (Coleoptera: Curculionidae) and patterns of Curculionini speciation.

Authors:  Shou-Ke Zhang; Jin-Ping Shu; Yang-Dong Wang; Ya-Ning Liu; Han Peng; Wei Zhang; Hao-Jie Wang
Journal:  Sci Rep       Date:  2019-03-04       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.