Literature DB >> 26810560

The Complete Mitochondrial Genome of Brachmia macroscopa (Lepidoptera: Gelechiidae) and Its Related Phylogenetic Analysis.

Li Ma1, Wan-Wei Dong2, Guo-Fang Jiang2, Xing Wang3.   

Abstract

The sweet potato leaf folder, Brachmia macroscopa, is an important pest in China. The complete mitogenome, which consists of 13 protein-coding genes (PCGs), 22 transfer RNA genes, two ribosomal RNA genes, and an A + T-rich region, was sequenced and found to be 15,394 bp in length (GeneBank no. KT354968). The gene order and orientation of the B. macroscopa mitogenome were similar to those of other sequenced lepidopteran species. All of the PCGs started with ATN as the canonical start codon except for cox1, which started with CGA. In regard to stop codons, most PCGs stopped at TAA except for cox2, which stopped at TA, and nad4, which stopped at a single T. Thirteen PCGs of the available species (33 taxa) were used to demonstrate phylogenetic relationships. The ditrysian cluster was supported as a monophyletic clade at high levels by using maximum likelihood and Bayesian methods. The apoditrysian group, covering the Gelechioidea, formed a monophyletic clade with a bootstrap value of 88% and a posterior probability of 1.00. The superfamily Gelechioidea was supported as a monophyletic lineage by a posterior probability of 1.00.
© The Author 2016. Published by Oxford University Press on behalf of the Entomological Society of America.

Entities:  

Keywords:  Brachmia macroscopa; Gelechioidea; mitogenome; phylogeny

Mesh:

Substances:

Year:  2016        PMID: 26810560      PMCID: PMC4725263          DOI: 10.1093/jisesa/iev157

Source DB:  PubMed          Journal:  J Insect Sci        ISSN: 1536-2442            Impact factor:   1.857


As a well-known leaf-eating pest, Brachmia macroscopa Meyrick, belonging to the family Gelechiidae in the superfamily Gelechioidea, occurs in many districts of China, including Shanghai, Zhejiang, Hunan, and Hainan (Wen and Wang 2010), and damages crops by feeding on large amounts of mesophyll (Wang and Tan 2011). This pest is widely distributed outside of China, in India, the Philippines, Burma, Vietnam, Korea, and Japan (Wang and Tan 2011). This pest has already led to severe crop failures and has brought great economic losses for farmers (Huang and Li 2013). In severe cases, the rate of crop damage has reached 60–85% in some fields (Wen and Wang 2010). However, its genetic characteristics have rarely been reported. To improve the management of B. macroscopa, it is important to know more information about this pest, including its genetic characteristics and phylogenetic position. Information can be inferred from the mitochondrial genome, both for phylogenetic analysis and evolutionary biology, and it can help us to understand an insects’ phylogenetic position. Compared with individual genes, the complete mitochondrial genome can be more informative, and can also provide more information on genome level characteristics; for instance, gene arrangement, gene content, base composition genetic codon variation, and transfer RNA (tRNA) and ribosomal RNA (rRNA) gene secondary structures. Lepidopteran mitochondrial DNA (mtDNA) is typically a circular molecule encoding a set of 13 protein-coding genes (PCGs), 22 tRNAgenes, and two rRNA genes (Wolstenholme 1992, Boore 1999). A major non-coding element, called the control (A + T-rich) region, is always present (Lewis et al. 1995, Zhang et al. 1995, Inohira et al. 1997, Shao et al. 2001), and this is the site of gene replication and the initiation of genome transcription (Boore 1999, Taanman 1999). The mtDNA datasets have become the most helpful markers in phylogenetics, phylogeography, and genetic population studies (Avise 2000, Cameron et al. 2007). Gelechioidea, consisting of 1,425 genera and 16,250 described species worldwide, is one of the largest and the most important lepidopteran taxa (Hodges 1998). It includes some important insect pests and occupies a very important position in the evolutionary tree of Lepidoptera. The mitogenomes of Gelechioidea have rarely been studied, with only a few sequenced mitogenomes available from five gelechioid species (09-2015), including four that are incomplete: Oegoconia novimundi (KJ508036), Perimede sp. (KJ508041), Ethmia eupostica (KJ508047), and Endrosis sarcitrella (KJ508037) (Timmermans et al. 2014), and a recently reported completely sequenced species, Promalactis suzukiella (Park et al. 2014). It seems that research on the phylogeny of Gelechioidea had always had some divergence or some unresolved points (Passoa 1995; Kaila 2004, Kaila et al. 2011; Bucheli and Wenzel 2005, Regier et al. 2009, Mutanen et al. 2010). In this article, the complete mitochondrial genome of B. macroscopa was first sequenced and described in comparison with other lepidopteran insects, and the phylogenetic position of Gelechioidea was analyzed based on the mitogenomic data for the purpose of understanding whether such a complete dataset can determine deeper phylogeny among the contested lepidopterans, as well as the Gelechioidea.

Materials and Methods

Sampling and DNA Extraction

Larvae of B. macroscopa were collected from the field (an experimental field belonging to our laboratory and involving no endangered/protected species) around Hunan Agriculture University (Changsha, Hunan, China; 28°11′N, 113°4′E) and were bred in the laboratory with fresh water spinach (Ipomoea aquatica Forsk) leaves. Pupae of the second generation were collected and extracted to obtain total genomic DNA using the Wizard Genomic DNA Purification Kit (Promega, Beijing, China) according the manufacturer’s instructions.

PCR Amplification and Sequencing

Specific primers were designed using Primer Premier 5 to compare with the known sequences and fragments of lepidopteran insects (Table 1). Universal primers were used for PCR-amplified short fragments of the mitogenome of B. macroscopa (Simon et al. 1994, Lee et al. 2006, Simon et al. 2006).
Table 1.

Primers used for amplification of the mitogenome of B. macroscopa

FragmentRegionPrimer (J/N)Primer sequence (J/F) 5′→3′
F1rrns-trnQSR-J-14610cATAATAGGGTATCTAATCCTAGT
Gln-RebGCACAATAATTT TTGATATTAGATATAGTTTA
F2nad2J-60 dGGTATTTGATCAGGAATAGTAGGAA
N-735 dCCAATAAATGGGGGTAATCCTCCTA
F3nad2-cox1J-489 dGGGGGATTAAATCAAACCTC
N-C1RebGCTGTTACAATAGTATTATAA ATTTGATCATC
F4cox1-cox2C1-J-2167aTTGATTTTTCGGACATCCTGAAGT
C1-N-3649aCCGCAAATTTCTGAACATTGACCA
F5cox2-nad3C2-JdCCGCAAATTTCTGAACATTGACCA
N-5731cTTTGGATCAAACCCACATTC
F6cox3-nad5C3-J-5470cGCTGCAGCTTGATATTGACA
N5-N-7793dAATCCTAATCCATCTCAACCT
F7nad5-nad4N5-J-7572cAAAAGGAATTTGAGCTCTTTTAGT
N4-N-9153cTGAGGTTATCAACCAGAGCG
F8nad4-cytbN4-J-8941cGAAACTGGGGCTTCAACATGAGC
N-11328dGGCAAATAGGAAATATCATTC
F9cytb-nad1CB-J dCATATTCAACCCGAATGATA
N1-N-12588 dAATCGAACTCCTTTTGATTTTGC
F10nad1-rrnLJ-11876cCGAGGTAAAGTACCACGAACTCA
16S-NdATATGTACATATTGCCCGTC
F11rrnL-rrnSJ-13900dCTTGTGTATCAGAGTTTATTA
N-12SRdGTAAAAGTTCAAATAGCAAG

aPrimers modified from Simon et al. (1994) prior to this mitogenome.

bPrimers modified from Lee et al. (2006) prior to this mitogenome.

cPrimers from Simon et al. (2006) prior to this mitogenome.

dPrimers newly designed for this genome.

Primers used for amplification of the mitogenome of B. macroscopa aPrimers modified from Simon et al. (1994) prior to this mitogenome. bPrimers modified from Lee et al. (2006) prior to this mitogenome. cPrimers from Simon et al. (2006) prior to this mitogenome. dPrimers newly designed for this genome. PCR amplification was carried out in 25 μL reactions that contained 0.2 μL rTaq (TaKaRa Co., Dalian, China), 1 μL DNA, 2.5 μL 10× rTaq buffer (Mg2 + free), 2.5 μL 25 mM MgCl2, 2.0 μL dNTPs, and 0.5 μL of each primer. PCR conditions were 94°C for 5 min, 35 cycles at 94°C for 30 s, 50–59°C for 30 s, 72°C for 1–2.5 min, and a subsequent 10 min final extension at 72°C. The amplified fragments were sequenced directly by the appropriate primers using a commercial kit.

Bioinformatic Analysis

Sequences were proof-read and assembled using the program Geneious version 4.8.4 (Drummond et al. 2010). PCG boundaries were identified with the ORF finder (http://www.ncbi.nlm.nih.gov/orf/gorf.html). After completely sequencing the mt genome, it was annotated both by hand and by automated methods. For the written annotation, the method presented by Cameron (2014) was followed. The automated annotation was accomplished with MITOS (Bernt et al. 2013). Confirmation of the tRNA genes was verified using the tRNAscan-SE program (http://lowelab.ucsc.edu/tRNAscan-SE/) (Lowe and Eddy 1997). Unidentified tRNAs were compared with sequences from other species. The two genes encoding the large and small rRNA subunits (rrnL and rrnS) were confirmed based on the rRNA alignments with other species published on NCBI, and the secondary structures were determined using Mfold Web Server (http://mfold.rna.albany.edu/?q = mfold). The comparative analysis and spread correction were performed by the software Mega 6.0 (Tamura et al. 2013) to obtain the complete mitogenome of B. macroscopa (Tamura et al. 2013). The related lepidopteran sibling species were analyzed by blast searches on the NCBI database. The skews of the compositions were determined using the formulas: AT skew = [A − T]/[A + T]; GC skew = [G − C]/[G + C] (Junqueira et al. 2004).

Phylogenetic Analysis

Along with the B. macroscopa mitochondrial genome, 29 available lepidopteran species (Son and Kim 2011, Gong et al. 2012) and four hepialoid mitogenomes (Napialus hunanensis, Thitarodes pui, Ahamus yunnanensis, and Thitarodes renzhiensis) (Cao et al. 2012, Yi et al. 2016a, 2016b) were used in the phylogenetic analysis (Table 2). The nucleic acid regions and amino acid sequences from all 13 PCGs were aligned using Mega 6.0 (Tamura et al. 2013). Gblock 0.91b with default settings was used with conserved regions of the putative amino acids (Castresana, 2000).
Table 2.

Source and information for the phylogenetic analysis

SuperfamilyFamilySpeciesAccession numberLength (bp)References
UrodoideaUrodidaeUrodus decensKJ50806215,279Timmermans et al. (2014)
YponomeutoideaPlutellidaePlutella xylostellaNC_02532216,014Dai et al. (2016)
YponomeutidaePrays oleaeNC_02594816,499van Asch et al. (2016)
LyonetiidaeL. malifoliellaNC_01854715,646Wu et al. (2012)
TineoideaTineidaeTineola bisselliellaKJ50804515,661Timmermans et al. (2014)
ZygaenoideaZygaenidaeRhodopsona rubiginosaKM24466815,248Tang et al. (2014)
TortricoideaTortricidaeAdoxophyes honmaiNC_00814115,680Lee et al. (2006)
Grapholita dimorphaNC_02458215,813Niu et al. (2016)
Adoxophyes oranaNC_02139615,343Wu et al. (2013)
Cydia pomonellaNC_02000315,253Shi et al. (2013)
Rhyacionia leptotubulaNC_01961915,877Zhu et al. (2012)
Grapholita molestaNC_01480615,717Son and Kim (2011)
HQ11641615,776Gong et al. (2012)
Spilonota lechriaspisNC_01429415,368Zhao et al. (2011)
Choristoneura longicellanaNC_01999615,759Unpublished
Acleris fimbrianaNC_01875415,933Zhao et al. (2014)
Epiphyas postvittanaKJ50805115,451Timmermans et al. (2014)
Retinia pseudotsugaicolaNC_02286515,282Unpublished
GelechioideaOecophoridaeE. sarcitrellaKJ50803715,317Timmermans et al. (2014)
P. suzukiellaKM87554215,507Park et al. (2014)
ElachistidaeE. euposticaKJ50804715,347Timmermans et al. (2014)
CosmopterigidaePerimede sp.KJ50804115,131Timmermans et al. (2014)
AutostichidaeO. novimundiKJ50803615,408Timmermans et al. (2014)
GelechiidaeB. macroscopaKT35496815,400present study
CossoideaCossidaeEogystia hippophaecolusNC_02393615,431Gong et al. (2013)
HepialoideaHepialidaeN. hunanensisNC_02442415,301Yi et al. (2016a)
T. puiNC_02353015,064Yi et al. (2016b)
A. yunnanensisNC_01809515,816Cao et al. (2012)
T. renzhiensisNC_01809416,173Cao et al. (2012)
GracillarioideaGracillariidaePhyllonorycter plataniKJ50804415,791Timmermans et al. (2014)
Phyllonorycter froelichiellaKJ50804815,538Timmermans et al. (2014)
Cameraria ohridellaKJ50804215,513Timmermans et al. (2014)
CopromorphoideaCarposinidaeCarposina sasakiiNC_02321215,611Wu et al. (2016)
Source and information for the phylogenetic analysis The 13 individual best fitting models for the nucleic acid dataset were defined with the Akaike Information Criterion (AIC) and the jModelTest 2.1.5 (Ronquist and Huelsenbeck 2003). Then, Bayesian inference (BI) analysis was performed via MrBayes v3.2.3 (Darriba et al. 2012), with the MCMC analysis run for 1,000,000 generations and a burn-in series of 1,000. In addition, the maximum likelihood (ML) method was conducted with RAxML v 8.0.2 (Stamatakis 2014), in which the 13 best fit substitution models for the 13 separate PCGs (aa sequence) were determined on the Protest web server in terms of AIC (Abascal et al. 2005).

Results

Genome Organization and Base Composition

The complete mtDNA of B. macroscopa was featured as a 15,394 bp closed circle (Fig. 1) and encoded 37 genes as well as containing a putative A + T-rich region. The 37 genes were organized by 13 PCGs (cox1-3, nd1-6, atp6, atp8, nad4L, and cytb), 22 tRNA genes and 2 rRNA genes (Table 3). Twenty of the genes were transcribed on the major strand (J-strand), and the other 17 were transcribed on the minor strand (N-strand).
Fig. 1.

Circular map of the B. macroscopa mitogenome. Annotations of the gene names were the standard abbreviations adopted in this article; single letters were drawn on the basis of IUPAC-IUB abbreviation for their matching amino acid.

Table 3.

Summary of the B. macroscopa mitogenome

GeneStrandNucleotide no.Size(bp)INAnticodonStart codonStop codon
trnMJ1–67678CAT
trnIJ76–142674GAT
trnQN147–2147083TTG
nad2J298–1,284987−2ATATAA
trnWJ1,283–1,35068−8TCA
trnCN1,343–1,4096711GCA
trnYN1,421–1,489693GTA
cox1J1,493–3,0281536−5CGATAA
trnL(UUR)J3,024–3,091680TAA
cox2J3,092–3,778687−5ATGTA-
trnKJ3,774–3,844710CTT
trnDJ3,845–3,913699GTC
atp8J3,923–4,078156−7ATATAA
atp6J4,072–4,749678−1ATGTAA
cox3J4,749–5,53778938ATGTAA
trnGJ5,576–5,64368−3TCC
nad3J5,641–5,9973572ATATAA
trnAJ6,000–6,06767−1TGC
trnRJ6,067–6,130640TCG
trnNJ6,131–6,196663GTT
trnS(AGN)J6,200–6,265667GCT
trnEJ6,273–6,335631TTC
trnFN6,337–6,402661GAA
nad5N6,404–8,1461743−10ATTTAA
trnHN8,137–8,201650GTG
nad4N8,202–9,54013390ATGT-
nad4LN9,541–9,8342942ATGTAA
trnTJ9,837–9,902660TGT
trnPN9,903–9,972702TGG
nad6J9,975–10,50252814ATATAA
cytbJ10,517–11,67111552ATATAA
trnS(UCN)J11,674–11,743703TGA
nad1N11,747–12,703957−6ATATAG
trnL(CUN)N12,698–12,7687116TAG
rrnLN12,769–14,22514570
trnVN14,226–14,291660TAC
rrnSN14,292–15,0697780
A + T-rich15,070–15,3943250
Circular map of the B. macroscopa mitogenome. Annotations of the gene names were the standard abbreviations adopted in this article; single letters were drawn on the basis of IUPAC-IUB abbreviation for their matching amino acid. Summary of the B. macroscopa mitogenome The intergenic spacer sequence was 184 bp long in total and was made up of 16 regions, which varied from 1 to 83 bp in length, with two major intergenic spacer sequences with 65.8% of base pairs of the entire spacer regions located between trnQ and nad2 (83 bp), and cox3 and trnG (38 bp). In addition, 10 overlaps of 48 bp in length were present in the B. macroscopa mitogenome. The smallest was only 1 bp, and the largest was just 10 bp, located between nad and trnH. The overall nucleotide composition of B. macroscopa mtDNA was 41.0% A, 39.9% T, 11.3% C, and 7.7% G. The A + T content was significantly biased (80.9%). The positive and negative skew were obviously distinct with an AT skew value of 0.014 and GC skew value of −0.188.

Codon Usage in PCGs

Regarding the PCGs, the major strand included nad2, cox1, cox2, atp8, atp6, cox3, nad3, nad6, and cytb, while the minor strand harbored nad5, nad4, nad4L, and nad1. Almost all of the PCGs started with ATN except for cox1, which started with CGA. Eleven of 13 PCGs ended with TAA, except for cox2 with TA, and nad4 with a single T. The PCGs contained a total of 3,593 codons, excluding the start and termination codons. AUU (Ile), AAU (Asn), UUU (Phe), UUA (Leu), and AUA (Met) were the most abundant amino acid codons, and made up 47.1% of the total. The content of A + T was usually higher than G + C according to the summarized codon usage (Fig. 2A). The relative synonymous codon usage (RSCU) of the third position showed that the frequency of AU codons in two and fourfold degeneracy was greater than GC (Fig. 2B).
Fig. 2.

Codon usage in the B. macroscopa mitogenome. (A) CDspT (codons per thousand) indicate the codons used in coding amino acids per thousand codons. Codon families are given on the x-axis. (B) RSCU.

Codon usage in the B. macroscopa mitogenome. (A) CDspT (codons per thousand) indicate the codons used in coding amino acids per thousand codons. Codon families are given on the x-axis. (B) RSCU.

The Secondary Structure for RNA Genes

The two rRNA genes, 16 s (rrnL) and 12 s (rrnS), were 1,457 and 778 bp in length, respectively. The rrnL gene was situated in trnS and trnV, and the rrnS gene was located between trnV and the A + T-rich region. Both of these genes were located on the N-strand. The complete secondary structures of the rrnL and rrnS molecules were determined and are shown in Figs. 3 and 4, respectively. The rrnL secondary structure contained six domains, made up of two conserved and four variable regions, and three other domains appeared in the rrnS secondary structure. Twenty-two tRNA genes, ranging from 63 to 71 bp in length, were contained in the B. macroscopa mitogenome. Fourteen of these genes were mapped on the J-strand and eight were mapped on the N-strand. Typical cloverleaf secondary structures were also found in this species, except for trnS which lacked a dihydrouridine arm (Fig. 5).
Fig. 3.

Predicted secondary structure of the rrnL gene in the B. macroscopa mitogenome. Red-colored letters stand for variable positions and blue for conserved positions. Tertiary structures are noted by continuous lines. Base-pairing is delineated as follows: Watson-Crick pairs are joined by lines, GU pairs by dots, and other non-canonical base pairs by asterisks. Each domain is indicated with Roman numerals.

Fig. 4.

Predicted secondary structure of the rrnS gene in the B. macroscopa mitogenome. Red-colored letters stand for variable positions and blue for conserved positions. Tertiary structures are noted by continuous lines. Base-pairing is delineated as follows: Watson-Crick pairs are joined by lines, GU pairs by dots, and other non-canonical base pairs by asterisks. Each domain is indicated with Roman numerals.

Fig. 5.

Predicted secondary structure of tRNAs gene in the B. macroscopa mitogenome. Dashes (–) indicate Watson-Crick base-pairing, centered solid pentagram () indicate G-U base-pairing, and solid triangles (▴) indicate mismatches.

Predicted secondary structure of the rrnL gene in the B. macroscopa mitogenome. Red-colored letters stand for variable positions and blue for conserved positions. Tertiary structures are noted by continuous lines. Base-pairing is delineated as follows: Watson-Crick pairs are joined by lines, GU pairs by dots, and other non-canonical base pairs by asterisks. Each domain is indicated with Roman numerals. Predicted secondary structure of the rrnS gene in the B. macroscopa mitogenome. Red-colored letters stand for variable positions and blue for conserved positions. Tertiary structures are noted by continuous lines. Base-pairing is delineated as follows: Watson-Crick pairs are joined by lines, GU pairs by dots, and other non-canonical base pairs by asterisks. Each domain is indicated with Roman numerals. Predicted secondary structure of tRNAs gene in the B. macroscopa mitogenome. Dashes (–) indicate Watson-Crick base-pairing, centered solid pentagram () indicate G-U base-pairing, and solid triangles (▴) indicate mismatches.

Features in the A + T-Rich Region

The A + T-rich region, located between the trnM and rrnS genes, was 325 bp long with a higher content (96.7%) and the absence of a large repeating fragment. The conserved structure consisted of an ‘ATAGT’-like motif, and included a poly-T. The poly-T was considered to be the origin of the minor strand replication (Fig. 6A). Three microsatellites, ‘(TA)8’, ‘(TA)9’, and ‘(TA)7’, were observed in this region, and they were located 205, 241, and 261 bp upstream of rrnS, respectively. The conserved ‘ATTTA’ sequence was located between the microsatellite (TA)8 and (TA)9. A potential stem-loop structure existed in the A + T-rich region without the ‘TATA’ sequence at the 5′ end and without the ‘G(A)nT’ at the 3′ end (Fig. 6B).
Fig. 6.

The A + T-rich region of B. macroscopa mitogenome. (A) Structure of the A + T-rich region. (B) A potential stem-loop structure found in the A + T-rich region.

The A + T-rich region of B. macroscopa mitogenome. (A) Structure of the A + T-rich region. (B) A potential stem-loop structure found in the A + T-rich region.

Phylogenetic Relationships

The phylogenetic relationships among the superfamilies within the Ditrysia were reconstructed and are shown in Figs. 7 and 8. The topological structures of the two trees were almost identical. The ditrysian group was supported as a monophyletic group by a high bootstrap value of 100% and a posterior probability of 1.00. The results from both ML and BI analysis show that the Tortricoidea is a robust monophyletic group supported by a high bootstrap value of 100% and is a sister to the clade ((Urodoidea + Copromorphoidea) + (Gelechioidea + (Cossoidea + Zygaenidae))) supported by a bootstrap value of 100% and a posterior probability of 1.00. However, bootstrap values of all small clades within the latter clade are lower. These six superfamilies are, together, a sister of the clade (Yponomeutoide +Gracillarioidea), in which Gracillarioidea is a robust monophyletic group, whereas Yponomeutoide is a polyphyletic group. Tortricoidea were recovered as a sister to the rest of Apoditrysia in the analyses, and this was supported by a bootstrap value of 100% and a posterior probability of 1.00. The Gelechioidea was assumed be a sister group to the Apoditrysia (Cho et al. 2011), but it was not supported in our results because the Gelechioidea was nested within the clade Apoditrysia in the two trees. The Apoditrysia group (Gelechioidea, Zygaenoidea, Cossoidea, Copromorphoidea, Urodoidea, and Tortricoidea) was a monophyletic clade with a bootstrap value of 100% and a posterior probability of 1.00. A sister relationship between Yponomeutoidea + Gracillarioidea and the Apoditrysia was supported by a bootstrap value of 88% and a posterior probability of 1.00. The superfamily Tineoidea was a monophyletic clade with a bootstrap value of 100% and a posterior probability of 1.00. This is the earliest clade derived from Ditrysia, supporting the general agreement that tineoids are the oldest ditrysian superfamily (Minet 1991, Kristensen and Skalski 1998). The superfamily Gelechioidea, which includes O. novimundi, E. sarcitrella, E. eupostica, B. macroscopa, and Perimede sp. in this study, formed a monophyletic group supported by a posterior probability of 1.00 (Fig. 8).
Fig. 7.

Phylogeny of lepidopteran superfamilies including B. macroscopa. ML phylogram using the concatenated 13 PCGs of mitogenomes obtained in these species, which used partitioned models for analysis. The scale bar indicates the number of substitutions per spot, and values of each node specify bootstrap percentages of 1,000 replicates.

Fig. 8.

BI phylogram using the concatenated 13 PCGs of mitogenomes obtained in these species, which used partitioned models for analysis. The scale bar indicates the number of substitutions per spot, and values of each node specify bootstrap percentages of 1,000 replicates.

Phylogeny of lepidopteran superfamilies including B. macroscopa. ML phylogram using the concatenated 13 PCGs of mitogenomes obtained in these species, which used partitioned models for analysis. The scale bar indicates the number of substitutions per spot, and values of each node specify bootstrap percentages of 1,000 replicates. BI phylogram using the concatenated 13 PCGs of mitogenomes obtained in these species, which used partitioned models for analysis. The scale bar indicates the number of substitutions per spot, and values of each node specify bootstrap percentages of 1,000 replicates.

Discussion

Genome Organization and Base Characteristics

The gene order and orientation of the B. macroscopa mitogenome were identical to the fully sequenced ditrysian species with the gene order trnM-trnI-trnQ. Nevertheless, it was different from the gene order trnI-trnQ-trnM in the non-ditrysian lineage of Lepidoptera (Flook et al. 1995, Yi et al. 2014). The AT nucleotides in the B. macroscopa mitogenome skewed slightly, and the content was 81.0%, which is the same as other lepidopteran insects in the region; such as, Lobocla bifasciatus, Argynis nerippe (Kim et al. 2011, 2014), and Argynnis hyperbius (Wang et al. 2012). The AT skew was 0.013, which indicated the occurrence of A more than T. The AT content in the PCGs was 79.2%, which is similar to that of Potanthus flavus (Kim et al. 2014) and Kallima inachus (Qin et al. 2012). The AT content of cox1 was the lowest, at 72.1%, whereas atp8 was significantly higher, at 95.5%, among the PCGs. Numerous studies have come to the conclusion that there is usually more A than T, and more C than G on the J strand, but the situation might occasionally be reversed for some species (Wei et al. 2010). We, however, observed that PCGs on the J-strand had a positive AT-skew and GC-skew. That means that this species has more A than T, and less C than G on the major strand. This novel observation had never before been made in the mitogenome of any insect. Regrettably, we have not elucidated the mechanism of this phenomenon. Nevertheless, early studies found that the value of the GC skew was not associated with gene direction but with replication orientation, while the value of the AT skew could change over gene direction, replication, and codon positions (Wei et al. 2010).

Intergenic Spacer Regions and Overlapping

The spacer is located commonly between trnQ and nad2, but it is not conserved, as its length varies in lepidopteran species. It was the longest of all spacers of B. macroscopa. This result might support the conclusion that the spacer between trnQ and nad2 has no functional significance or that it acts as another origin of replication (Cameron and Whiting 2008). In addition, the spacer, which has only been shown in the lepidopteran insects, had high homology with adjacent nad2 genes (Xia et al. 2011). The common overlap (7 bp), between atp8 and atp6 that is widely distributed in other lepidopteran mitogenomes (Jiang et al. 2009, Zhu et al. 2013, Chen et al. 2014), was also present in the B. macroscopa mitogenome. Meanwhile, the position of the maximum overlap between nad5 and trnH was different from other lepidopteran mitogenomes, such as Diaphania pyloalis, which has the maximum overlap between trnF and nad5 (Zhu et al. 2013), Chilo suppressalis, which also has the maximum overlap between trnF and nad5 (Chai et al. 2012), and Attacus atlas, which has the maximum overlap between trnW and trnC (Chen et al. 2014).

Protein-Coding Genes

‘ATA’ and ‘ATG’ were present as start codons with the same frequency among the 12 PCGs, while ‘CGA’ was the start codon of cox1. The sequencing of the 5′ region of cox1 genes from 39 lepidopteran species indicated the conservation of ‘CGA’ in lepidopteran mitogenomes and implied that ‘CGA’ may be a synapomorphic feature in Lepidoptera (Kim et al). Eleven of the 13 PCGs ended with ‘TAA’ codons, while the remaining two stopped with ‘TTA’ or a single ‘T’. The incomplete termination codons could be completed by the mRNA process of polyadenylation (Anderson et al. 1981). It is important that the incomplete stop codons could make up the complete TAA in the assembly process of mRNA (Boore 1999, Gong et al. 2012). It was concluded that A and U were more frequently used in PCGs because the value for the RSCU of NNU and NNA codons was always >1.

rRNA and tRNA Genes

The rRNA genes of insects are usually conserved in mitogenomes. Therefore, the secondary structures for rrnL and rrnS RNA in B. macroscopa are similar to those of other Lepidoptera (Gillespie et al. 2006, Niehuis et al. 2006a, b; Chai et al. 2012). The base pairs in the rrnL gene did not fully comply with Watson-Crick base-pairing. The mispairing of H991 in the rrnL secondary structures was not observed in Leucoptera malifoliella (Lepidoptera: Lyonetiidae) (Fig. 4), but it is very common in lepidopteran insects; such as, Apocheima cinerarius (Lepidoptera: Geometridae), C. suppressalis (Lepidoptera: Pyralidae), Manduca sexta (Lepidoptera: Sphingidae), and Zygaena sarpedon lusitanica (Lepidoptera: Zygaenidae). The H47, H673, H1047, and H1241 in the secondary structure of the rrnS gene were different from those of M. sexta in structure and length. The H47 portion was one of the variable sites among the species (Gillespie et al. 2006, Gong et al. 2012); therefore, it could provide valuable information on the phylogenetic relationship of H39 and H367 (Wei et al. 2010). The majority of the tRNA is likely to fold into an iconic clover-leaf secondary structure, except for trnS. The trnS was the only special case that could not form a complete cloverleaf structure, but the incomplete structure evolved early in the metazoans (Garey and Wolstenholme 1989, Wolstenholme 1992). However, previous research has shown that the trnS is highly conserved in nearly all families, as well as in B. macroscopa. Knowledge of insects involves insect ecology, behavior, systematics, host plant choices, and so on, but information about the worldwide superfamily Gelechioidea has been very limited (Hodges 1998). The sister group of Gelechioidea is still unknown (Mutanen et al. 2010). Mitogenomic phylogeny research was carried out by Timmermans et al. (2014), and it demonstrated that the Gelechioidea was not grouped with other lower Apoditrysia. The phylogenetic systematics of 10 superfamilies in Lepidoptera were constructed based on mitochondrial data that showed a sister relationship between Gelechioidea and Zygaenidae + Cossoidea, because both of them cluster in a same node and have a positive Bayesian posterior probability of 1.00. Significantly, our study may support the Gelechioidea belonging to the Apoditrysia, and it differed from the findings of Timmermans et al. (2014). The two analyses were both conducted based on mitogenome data, and whether these differences were due to the introduction of the complete mitogenome in this study remains to be investigated. The phylogenetic relationship among the Yponomeutoide, Gracillarioidea, Gelechioidea, and Apoditrysia were not well resolved by Kristensen et al. (2007). In our results, the Yponomeutoide + Gracillarioidea and Apoditrysia were sister groups and clustered on the same branch, with a bootstrap value of 88% and a high posterior probability of 1.00. Although the value of ML was lower, it can still provide some information for the uniform relationships. The structure was supported by Regier et al. (2013), and it had many differences from the nested Yponomeutoide + Gracillarioidea and Gelechioidea (Kristensen and Skalski 1998). The ML and BI analyses have different advantages in reconstructing phylogenetic trees, and in some instances the results may be different according to the different methods. The results of this study, which revealed the monophyly of Gelechioidea, was mostly supported by previous analyses that were based on morphology, ecology, and mitogenomes (Passoa 1995, Kaila 2004, Bucheli and Wenzel 2005, Regier et al. 2009, Mutanen et al. 2010, Kaila et al. 2011). Although the ML value was lower, the two separate analyses correctly interpreted the monophyly of Gelechioidea. Six species were chosen, representing five subfamilies (Symmocinae, Oecophoninae, Ethmiinae, Dichomeridinae, and Chrysopeleiinae). According to the analysis of Mutanen et al. (2010), Autostichidae was not monophyletic unless Glyphydoceridae and Deoclonidae were included. The tree, in Figs. 7 and 8, showed that Autostichidae and Oecophoridae clustered in the same node, and therefore the monophyly of Autostichidae was not confirmed. With further observations, the Ethmiinae and the other two subfamilies had higher bootstrap values and strongly resembled the earlier results of phylogenetic relationships indicated by Passoa (1995) and Bucheli and Wenzel (2005). The consensus on familial relationships ((Autostichidae + Oecophoridae) + Elachistidae) + (Gelechiidae + Cosmopterigidae) by ML and ((Elachistidae + Gelechiidae) + Oecophoridae) + (Autostichidae + Cosmopterigidae) by BI obviously conflicted with the latest molecular analysis that revealed different affinities (Elachistidae + (Cosmopterigidae + Gelechiidae) + (Autostichidae + Oecophoridae)) (Heikkilä et al. 2014). The challenge of getting better mitogenome-based phylogenies probably requires that more Gelechioidea mitogenomes are sequenced.
  58 in total

1.  jModelTest 2: more models, new heuristics and parallel computing.

Authors:  Diego Darriba; Guillermo L Taboada; Ramón Doallo; David Posada
Journal:  Nat Methods       Date:  2012-07-30       Impact factor: 28.547

2.  Mitochondrial genome of the pine tip moth Rhyacionia leptotubula (Lepidoptera: Tortricidae).

Authors:  Jia-Ying Zhu; Shu-Jun Wei; Qin-Wen Li; Song Yang; Yong-He Li
Journal:  Mitochondrial DNA       Date:  2012-07-18

3.  Complete mitochondrial genome of Kallima inachus (Lepidoptera: Nymphalidae: Nymphalinae): comparison of K. inachus and Argynnis hyperbius.

Authors:  Xin-Min Qin; Qing-Xin Guan; De-Long Zeng; Feng Qin; Hui-Min Li
Journal:  Mitochondrial DNA       Date:  2012-06-18

4.  The complete mitochondrial genome of the codling moth Cydia pomonella (Lepidoptera: Tortricidae).

Authors:  Bao-Cai Shi; Wei Liu; Shu-Jun Wei
Journal:  Mitochondrial DNA       Date:  2012-09-14

5.  Phylogeny of the mega-diverse Gelechioidea (Lepidoptera): adaptations and determinants of success.

Authors:  Lauri Kaila; Marko Mutanen; Tommi Nyman
Journal:  Mol Phylogenet Evol       Date:  2011-08-27       Impact factor: 4.286

6.  The complete mitochondrial genome of Leucoptera malifoliella Costa (Lepidoptera: Lyonetiidae).

Authors:  Yu-Peng Wu; Jin-Liang Zhao; Tian-Juan Su; Jie Li; Fang Yu; Douglas Chesters; Ren-Jun Fan; Ming-Chang Chen; Chun-Sheng Wu; Chao-Dong Zhu
Journal:  DNA Cell Biol       Date:  2012-08-02       Impact factor: 3.311

7.  The complete mitochondrial genomes of two ghost moths, Thitarodes renzhiensis and Thitarodes yunnanensis: the ancestral gene arrangement in Lepidoptera.

Authors:  Yong-Qiang Cao; Chuan Ma; Ji-Yue Chen; Da-Rong Yang
Journal:  BMC Genomics       Date:  2012-06-22       Impact factor: 3.969

8.  The complete mitochondrial genome of the oriental fruit moth Grapholita molesta (Busck) (Lepidoptera: Tortricidae).

Authors:  Ya-jun Gong; Bao-cai Shi; Zong-jiang Kang; Fan Zhang; Shu-jun Wei
Journal:  Mol Biol Rep       Date:  2011-06-14       Impact factor: 2.316

9.  Characterization of the complete mitochondrial genomes of Cnaphalocrocis medinalis and Chilo suppressalis (Lepidoptera: Pyralidae).

Authors:  Huan-Na Chai; Yu-Zhou Du; Bao-Ping Zhai
Journal:  Int J Biol Sci       Date:  2012-04-17       Impact factor: 6.580

10.  A large-scale, higher-level, molecular phylogenetic study of the insect order Lepidoptera (moths and butterflies).

Authors:  Jerome C Regier; Charles Mitter; Andreas Zwick; Adam L Bazinet; Michael P Cummings; Akito Y Kawahara; Jae-Cheon Sohn; Derrick J Zwickl; Soowon Cho; Donald R Davis; Joaquin Baixeras; John Brown; Cynthia Parr; Susan Weller; David C Lees; Kim T Mitter
Journal:  PLoS One       Date:  2013-03-12       Impact factor: 3.240

View more
  3 in total

1.  The complete mitochondrial genome of Trabala vishnou guttata (Lepidoptera: Lasiocampidae) and the related phylogenetic analyses.

Authors:  Liuyu Wu; Xiao Xiong; Xuming Wang; Tianrong Xin; Jing Wang; Zhiwen Zou; Bin Xia
Journal:  Genetica       Date:  2016-10-21       Impact factor: 1.082

2.  Complete mitochondrial genome of Scythris sinensis (Lepidoptera: Scythrididae).

Authors:  Jeong Sun Park; Min Jee Kim; Sung-Soo Kim; Iksoo Kim
Journal:  Mitochondrial DNA B Resour       Date:  2020-06-17       Impact factor: 0.658

3.  Analysis on the Complete Mitochondrial Genome of Andraca theae (Lepidoptera: Bombycoidea).

Authors:  Xing-Shi Gu; Li Ma; Xing Wang; Guo-Hua Huang
Journal:  J Insect Sci       Date:  2016-10-01       Impact factor: 1.857

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.