Literature DB >> 20376208

The complete mitochondrial genome of the fall webworm, Hyphantria cunea (Lepidoptera: Arctiidae).

Fang Liao1, Lin Wang, Song Wu, Yu-Ping Li, Lei Zhao, Guo-Ming Huang, Chun-Jing Niu, Yan-Qun Liu, Ming-Gang Li.   

Abstract

The complete mitochondrial genome (mitogenome) of the fall webworm, Hyphantria cunea (Lepidoptera: Arctiidae) was determined. The genome is a circular molecule 15 481 bp long. It presents a typical gene organization and order for completely sequenced lepidopteran mitogenomes, but differs from the insect ancestral type for the placement of tRNA(Met). The nucleotide composition of the genome is also highly A + T biased, accounting for 80.38%, with a slightly positive AT skewness (0.010), indicating the occurrence of more As than Ts, as found in the Noctuoidea species. All protein-coding genes (PCGs) are initiated by ATN codons, except for COI, which is tentatively designated by the CGA codon as observed in other lepidopterans. Four of 13 PCGs harbor the incomplete termination codon, T or TA. All tRNAs have a typical clover-leaf structure of mitochondrial tRNAs, except for tRNA(Ser)(AGN), the DHU arm of which could not form a stable stem-loop structure. The intergenic spacer sequence between tRNA(Ser)(AGN) and ND1 also contains the ATACTAA motif, which is conserved across the Lepidoptera order. The H. cunea A+T-rich region of 357 bp is comprised of non-repetitive sequences, but harbors several features common to the Lepidoptera insects, including the motif ATAGA followed by an 18 bp poly-T stretch, a microsatellite-like (AT)(8) element preceded by the ATTTA motif, an 11 bp poly-A present immediately upstream tRNA(Met). The phylogenetic analyses support the view that the H. cunea is closerly related to the Lymantria dispar than Ochrogaster lunifer, and support the hypothesis that Noctuoidea (H. cunea, L. dispar, and O. lunifer) and Geometroidea (Phthonandria atrilineata) are monophyletic. However, in the phylogenetic trees based on mitogenome sequences among the lepidopteran superfamilies, Papillonoidea (Artogeia melete, Acraea issoria, and Coreana raphaelis) joined basally within the monophyly of Lepidoptera, which is different to the traditional classification.

Entities:  

Keywords:  Arctiidae; Fall webworm; Hyphantria cunea; Lepidoptera; Mitochondrial genome; Phylogeny

Mesh:

Substances:

Year:  2010        PMID: 20376208      PMCID: PMC2850540          DOI: 10.7150/ijbs.6.172

Source DB:  PubMed          Journal:  Int J Biol Sci        ISSN: 1449-2288            Impact factor:   6.580


1. Introduction

The fall webworm, Hyphantria cunea Drury (Lepidoptera: Arctiidae), is a severe invasive and quarantine pest which has a wide range of habitats. It is a polyphagus pest that feeds on about 160 species of broad leaf trees. The preferred host plants include mulberry, oak, hickory, pecan, walnut, elm, alder, willow, sweetgum, and poplar. This insect has caused serious damage to forests throughout its range and appears to be continuing to spread. It also damages the roadside and garden trees around urban areas. The species was introduced from North America to Central Europe and Eastern Asia in the early 1940s 1, 2. In China, this species was first found in Dandong (124°N/40°E) of Liaoning Province in 1979, and now has spread southwards to Shanghai (129°N/31°E) and westwards to Xianyang (108°N/34°E) of Shanxi Province. The southern populations in China may complete three generations in one year, while in the north the fall webworm completes only one life cycle. Many studies have been done on aspects of adaptability, sex pheromones, host preference and natural enemies of the fall webworm 3. As H. cunea is a devastating invasive species, the mitochondrial genome (mitogenome) information of the species may provide fundamental information for future phylogenetic analyses and evolutionary biology. Insect mitochondrial DNA (mtDNA) is a circular DNA molecule with 14-20 kb in size and has a remarkably conserved set of 37 genes, including 13 protein-coding genes (PCGs; subunits 6 and 8 of the F0 ATPase [ATP6 and ATP8]; cytochrome oxidase subunits 1-3 [COI-III]; cytochrome b [Cytb]; NADH dehydrogenase subunits 1-6 and 4L [ND1-6 and ND4L]), two ribosomal RNA genes (large and small ribosomal RNAs [lrRNA and srRNA]), and 22 tRNA genes 4, 5. It additionally contains a control region of variable length, known as the adenine (A) + thymine (T)-rich region in insect mtDNA, which is involved in the regulation and initiation of mtDNA replication and transcription 6. The mitochondrial genes and genomes have been widely used as an informative molecular marker for diverse evolutionary studies of animals, including phylogenetics and population genetics 7-9, with the development of long range PCR for amplification of partial sequence of mtDNA genes and whole mitochondrial genome 10. At present, the complete or nearly complete mitogenome sequences from more than 100 species of insects have been determined. However, only 19 complete or nearly complete mitogenomes are currently available in the GenBank for lepidopteran species (Table 1). The Ostrinia sequences each lack the sequence information of the A+T-rich region, partial tRNA and srRNA sequence. The L. chinensis and P. xuthus sequences each lack the sequence information of the partial srRNA, A+T-rich region, tRNA, and partial ND2. Within the insects, the Lepidoptera order accounts for more than 160 000 species. Despite this huge taxonomic diversity the existing information on lepidopteran mtDNA is very limited and limited to six superfamiles among the 45-48 known and to 13 families of the recognized 120. Newly added lepidopteran mitogenomes can provide further insights into our understanding of diversity of lepidopteran mitogenomes and evolution.
Table 1

List of the complete mitogenome of Lepidoptera

Superfamily / FamilySpeciesAcc. numberReference
Noctuoidea
ArctiidaeHyphantria cuneaGU592049This study
LymantriidaeLymantria disparFJ617240Zhu et al. unpublished
NotodontidaeOchrogaster luniferAM94660111
Geometroidea
GeometridaePhthonandria atrilineataEU56976412
Bombycoidea
SaturniidaeAntheraea pernyiAY24299613
SaturniidaeAntheraea yamamaiEU72663014
SaturniidaeCaligula boisduvaliiEF62222715
SaturniidaeEriogyna pyretorumFJ68565316
BombycidaeBombyx moriAB07026417
BombycidaeChinese Bombyx mandarinaAY30162018
BombycidaeJapanese Bombyx mandarinaNC_00339517
SphingidaeManduca sextaEU28678519
Pyraloidea
CrambidaeDiatraea saccharalisFJ240227Li and Yue, unpublished
CrambidaeOstrinia furnicalisNC_00336820
CrambidaeOstrinia nubilalisNC_00336720
Tortricoidea
TortricidaeAdoxophyes honmaiDQ07391621
Papilionoidea
NymphalidaeAcraea issoriaNC_01360422
PieridaeArtogeia meleteEU59712423
LycaenidaeCoreana raphaelisDQ10270324
PapilionidaeLuehdorfia chinensisEU622524Liu et al. unpublished
PapilionidaePapilio xuthusEF621724Feng et al. unpublished
The H. cunea mitochondrial COI, COIII and Cytb have been utilized for biological identification as DNA barcode and phylogenetic studies 25, 26, but the genetic information on the complete mtDNA of the species remains largely unknown. In this study, we describe the complete mitogenome sequence of the fall webworm, H. cunea, and compare its sequence with other available lepidopteran mitogenomes. Furthermore, the mitogenome sequence of H. cunea was used to provide further insight into the phylogenetic relationships among lepidopteran superfamilies.

2. Materials and Methods

Insect and total DNA extraction

The H. cunea larvae were collected on the mulberry trees on the campus of the Shenyang Agricultural University, Shenyang, China. The larvae were then fed on the leaves of mulberry trees in room until pupation. The fresh pupae were directly frozen and kept in the laboratory at - 80 °C. A single pupa was used to extract the total DNA using the TIANamp Genomic DNA Kit (TIANGEN, Beijing, China) according to the manufacturer's instruction.

PCR amplification and sequencing

The full mitogenome of H. cunea was amplified in four overlapping fragments by PCR amplification using universal primers and specific primers designed for this study (Fig. 1; Table 2). All PCR reactions were performed in a 50 μl volume with 1 U of LA Taq (TaKaRa Co., Dalian, China), 1 μl (about 20 ng) of DNA, 5 μl 10 × LA Taq buffer (Mg2+ plus), 200 μM dNTPs, and 10 pmol each primer. Initially, the H. cunea COI gene of ~600 bp was amplified using the primer set LYQ1/LYQ2 as previously reported 25, and the Cytb gene of ~400 bp was amplified using the primer set LYQ5/LYQ6 as previously reported 27. The PCR amplification was performed under the following procedure: 2 min at 94 °C, followed by 35 cycles of 1 min at 94 °C, 30 sec at 50 °C, and 1 min at 72 °C, with a subsequent 10 min final extension at 72 °C. After purification with TIANgel Midi Purification Kit (TIANGEN, Beijing, China), the PCR fragments were directly sequenced with the PCR primers.
Fig 1

Linear map of the mitogenome of Hyphantria cunea. The tRNAs are labeled according to the IUPAC-IUB single letter amino acid codes above the bar indicating coding sequence on major strand or below the bar showing on minor strand. One-letter symbol L, L*, S and S* denote codon tRNA, tRNA, tRNA, and tRNA, respectively. Underlined PCGs or rRNA genes are located on minor strand and PCGs that are not underlined are located on major strand. Overlapping lines (F1-F4) under the map denote four overlapping PCR fragments amplified for sequencing. The line at the lower left represents the map scale.

Table 2

Primers used to amplify the Hyphantria cunea mitogenome

PrimerSequence (5'- 3')FragmentReference
LYQ1GGTCAACAAATCATAAAGATATTGGF125
LYQ2TAAACTTCAGGGTGACCAAAAAATCA
LYQ5TATGTACTACCATGAGGACAAATATCF227
LYQ6ATTACACCTCCTAATTTATTAGGAAT
LYQ29CTTTTCTATTACTTCTTTCTCTTCCTGTTTTAF3This study
LYQ32TAAAATAATAAATGGTAATAAAAAATGAAATG
LYQ30TAAAACAGGAAGAGAAAGAAGTAATAGAAAAGF4This study
LYQ31CATTTCATTTTTTATTACCATTTATTATTTTA
On the basis of the information from the determined fragments, two new primer pairs LYQ29/LYQ32 and LYQ30/LYQ31 were designed to amplify the remaining longer fragments of the mitogenome of H. cunea (F3 and F4 in Fig. 1 and Table 1). The two fragments were amplified with denaturation at 94 °C for 2 min, followed by 35 cycles of 1 min at 94 °C, 10 min at 65 °C, with a subsequent 10 min final extension at 72 °C. These PCR products were then utilized to construct a shotgun sequencing library. In brief, DNAs were sheared into 1-3 kb fragments using DNase I, and the DNA fractions were collected with a Chromaspin TE 1000 column. The DNA fractions were then cloned into the pGEM-T easy vector (Promega, USA), and each of the resultant plasmid DNAs was isolated with a Wizard Plus SV Minipreps DNA Purification System (Promega, USA). DNA sequencing was conducted using the ABI PRISM BigDyeTerminator v3.1 Cycle Sequencing Kit and the ABI PRISMTM 3100 Genetic Analyzer (PE Applied Biosystems, USA). All fragments were sequenced from both strands. The number of clones sequenced was sufficient to fulfill the six times coverage of the mitogenome.

Sequence analysis

The sequence alignment was carried out using Clustal X 28. The PCGs and rRNA genes were determined by BLAST on NCBI Entrez Database and by comparing them with homologous regions in other lepidopteran mitogenome sequences. The PCG nucleotide sequences were translated on the basis of the Invertebrate Mitochondrial Genetic Code. The tRNA genes and its secondary structure were predicted using the tRNAscan-SE Search 29. The two tRNA secondary structure not found by tRNAscan-SE Search was developed using the constraints proposed by Steinberg and Cedergren 30. Composition skew analysis was carried out to describe the base composition of nucleotide sequences, which measures the relative number of As to Ts (AT skew=[A-T]/[A+T]) and Gs to Cs (GC skew=[G-C]/[G+C]) 31. Codon usage was calculated using the Countcodon program version 4 (http://www.kazusa.or.jp/codon/countcodon.html). The entire A+T-rich region was subjected to a search for the tandem repeats using Tandem Repeats Finder program 32. The sequence data has been deposited in GenBank under accession No. GU592049.

Phylogenetic analysis

To illustrate the phylogenetic relationship of Lepidoptera, the other complete mitogenomes were obtained from GenBank. The L. chinensis and P. xuthus sequences lacking more sequence information were excluded. The mitogenomes of Drosophila yakuba (NC_001322) 33 and Anopheles gambiae (NC_002084) 34 were used as outgroups. The alignment of the amino acid sequences of each 13 mitochondrial PCGs was aligned with Clustal X 28 using default settings and concatenated. As for the ND4 genes, the insertion of A nucleotide in O. furnacalis (position 8211 bp) and O. nubilalis (position 8206 bp) resulted in transcript frameshifts 13, the amino acid sequences of which were therefore revised for further phylogenetic analyses. The concatenated set of amino acids sequences from the 13 PCGs was used in phylogenetic analyses, which was performed using maximum parsimony (MP) and Nerghbor-joining (NJ) methods by using MEGA ver 4.0 35.

3. Results and Discussion

Genome organization

The H. cunea mitogenome presents the typical gene content observed in metazoan mitogenomes (Table 3, Fig. 1): 13 PCGs, 22 tRNA genes, two rRNA subunits, and a major non-coding region known as the A+T-rich region in insects 5. The complete mitogenome of H. cunea consists of 15 481 bp, which is well within the range observed in the completely sequenced lepidopteran insects, with size ranging from 15 140 in A. melete to 15 928 in Japanese B. mandarina (Table 4). The gene order and orientation of the H. cunea mitogenome are identical to the completely sequenced lepidopteran mitogenomes. By the translocation of tRNA to a position 5' upstream of tRNA, the lepidopteran arrangement differs from that of D. yakuba, the hypothesized ancestral gene order of insects 36. This suggests that the mitochondrial gene arrangement in lepidopteran insects evolved independently after splitting from its stem lineage 24.
Table 3

Annotation and gene organization of the Hyphantria cunea mitogenome

GeneStrandNucleotide no.Size(bp)AnticodonNonOLStart codonStop codon
tRNAMetJ1-6767CAT
tRNAIleJ68-13669GAT11
tRNAGlnN148-21669TTG50
ND2J267-1277101118ATTTAA
tRNATrpJ1 296-1 36469TCA8
tRNACysN1 357-1 41963GCA4
tRNATyrN1 424-1 48966GTA11
COIJ1 501-3 0341534CGAT-tRNA
tRNALeu(UUR)J3 035-3 10066TAA
COIIJ3 101-3 782682ATGT-tRNA
tRNALysJ3 783-3 85371CTT1
tRNAAspJ3 853-3 91866GTC
ATP8J3 919-4 0801627ATATAA
ATP6J4 074-4 7506775ATGTA-COIII
COIIIJ4 756-5 5477929ATGTAA
tRNAGlyJ5 557-5 62165TCC
ND3J5 622-5 975354ATTTAA
tRNAAlaJ5 976-6 04267TGC7
tRNAArgJ6 050-6 11667TCG5
tRNAAsnJ6 122-6 18867GTT9
tRNASer(AGN)J6 198-6 26366GCT21
tRNAGluJ6 285-6 35268TTC2
tRNAPheN6 351-6 41868GAA3
ND5N6 422-8 1671746ATATAA
tRNAHisN8 168-8 23568GTG
ND4N8 236-9 5741339ATGT-tRNA
ND4LN9 575-9 8622885ATGTAA
tRNAThrJ9 868-9 93265TGT
tRNAProN9 933-9 99765TGG7
ND6J10 005-10 53553113ATTTAA
CytbJ10 549-11 697114917ATATAA
tRNASer(UCN)J11 715-11 78268TGA34
ND1N11 817-12 7559391ATGTAA
tRNALeu(CUN)N12 757-12 82468TAG
lrRNAN12 825-14 2501426
tRNAValN14 251-14 31666TAC
srRNAN14 317-15 124808
A+T-rich region15 125-15 481357

J-strand, majority-coding strand; N-strand, minority-coding strand; Non, non-coding region; OL, overlapping region.

Table 4

Composition and skewness in the major strand of lepidopteran mitogenomes

Speciessize (bp)A%G%T%C%A+T %AT skewGC skew
Whole genome
Hyphantria cunea15 48140.587.5539.8012.0780.380.010-0.230
Lymantria dispar15 56940.587.5739.3012.5579.880.016-0.247
Ochrogaster lunifer15 59340.097.5637.7514.6077.840.030-0.318
Phthonandria atrilineata15 49940.787.6740.2411.3181.020.007-0.192
Antheraea pernyi15 56639.227.7740.9412.0680.16-0.021-0.216
Antheraea yamamai15 33839.267.6941.0412.0280.29-0.022-0.219
Caligula boisduvalii15 36039.347.5841.2811.7980.62-0.024-0.217
Eriogyna pyretorum15 32739.177.6341.6511.5580.82-0.030-0.204
Bombyx mori15 65643.067.3138.3011.3381.360.059-0.216
Chinese Bombyx mandarina15 68243.117.4038.4811.0181.590.057-0.196
Japanese Bombyx mandarina15 92843.087.2138.6011.1181.680.055-0.213
Manduca sexta15 51640.677.4641.1110.7681.79-0.005-0.181
Diatraea saccharalis15 49040.877.4239.1512.5680.020.021-0.257
*Ostrinia nubilalis 14 53541.368.0238.8111.8280.170.031-0.192
*Ostrinia furnicalis 14 53641.467.9138.9211.7180.370.032-0.194
Adoxophyes honmai15 68040.157.8840.2411.7380.39-0.001-0.178
Acraea issoria15 24538.947.7440.8112.5079.76-0.023-0.235
Artogeia melete15 14040.387.8739.4112.3579.780.012-0.222
Coreana raphaelis15 31439.377.3043.2910.0482.66-0.047-0.158
*Luehdorfia chinensis13 86040.077.7440.4411.7580.51-0.005-0.206
*Papilio xuthus13 96439.537.8540.4512.1779.98-0.012-0.216
A+T-rich region
Hyphantria cunea35745.661.1249.303.9294.96-0.038-0.556
Lymantria dispar43545.291.6150.502.3096.09-0.054-0.176
Ochrogaster lunifer31944.511.5748.905.0293.42-0.047-0.524
Phthonandria atrilineata45740.700.6657.551.0998.25-0.171-0.246
Antheraea pernyi55241.124.1749.285.4390.40-0.090-0.127
Antheraea yamamai33441.623.5947.906.8989.52-0.070-0.315
Caligula boisduvalii33042.122.1249.396.3691.52-0.079-0.500
Eriogyna pyretorum35842.182.5150.005.3192.18-0.085-0.358
Bombyx mori49444.941.6250.612.8395.55-0.059-0.272
Chinese Bombyx mandarina48446.492.6947.932.8994.42-0.015-0.036
Japanese Bombyx mandarina74745.522.4149.672.4195.18-0.0430
Manduca sexta32445.001.5450.313.2995.37-0.055-0.334
Diatraea saccharalis33543.280.6051.644.4894.93-0.088-0.764
Adoxophyes honmai48948.472.8645.812.8694.270.0280
Acraea issoria43045.811.4050.232.5696.05-0.046-0.293
Artogeia melete35143.873.1345.307.6989.17-0.016-0.421
Coreana raphaelis37544.271.3349.874.5394.13-0.059-0.545

* partial mitogenome lacking of the A+T-rich region.

Genome composition and skewness

The genome composition of the major strand of the H. cunea mitogenome is heavily biased toward As and Ts, accounting for 80.38%: A 40.58%, G 7.55%, T 39.80% and C 12.07%, as is the case with other insect sequences (Table 4). The bias value is similar to the completely sequenced lepidopteran insects, with the range from 77.84% in O. lunifer to 82.66% in C. raphaelis. The A+T content in the sequence of the A+T-rich region is 94.96%, also within the range observed in the completely sequenced lepidopteran insects, with the value from 89.17% in A. melete to 98.25% in P. atrilineata. The lepidopteran AT skewness values vary from -0.047 in C. raphaelis to 0.059 in B. mori (Table 4). The AT skewness for the major strand of the H. cunea mitogenome is slightly positive (0.010), indicating the occurrence of more As than Ts. This case is also found in L. dispar (0.016), O. lunifer (0.030), P. atrilineata (0.007), B. mori (0.059), Japanese B. mandarina (0.055), Chinese B. mandarina (0.057), A. melete (0.012), D. saccharalis (0.021), O. nubilalis (0.031), and O. furnicalis (0.032). In contrast, the AT skews are negative in the other lepidopteran mitogenomes. When considering the A+T-rich region, however, the bias toward the use of Ts over As is more obvious in the analyzed lepidopteran mitogenomes with the H. cunea mitogenome exhibiting a slightly value (-0.038). The only one exception is represented in A. honmai where the A+T-rich region exhibits a slightly positive AT skewness (0.028). In all sequenced lepidopteran mitogenomes, the GC skewness values vary from -0.158 in C. raphaelis to -0.318 in O. lunifer with the H. cunea mitogenome exhibiting a moderate skewness value (-0.230), referring to the occurrence of more Cs than Gs in the lepidopteran mitogenomes.

Protein-coding genes

All of the PCGs in the H. cunea mitogenome are initiated by typical ATN codons (six with ATG, three with ATT, and three with ATA), except for COI (Table 3). The open reading frame of the H. cunea COI gene also starts at a CGA codon for arginine as found in all lepidopteran insects (Fig. 2). The typical ATN initiator for mitochondrial PCGs is also not found at the start site for H. cunea COI or near the tRNA. The plausible translation initiator for H. cunea COI is ATA, located within the tRNA gene, overlapping 19 bp with the tRNA; however, a codon following this triplet has a TAG-stop codon before the CGA codon. This ATA sequence is unlikely to be the start site for H. cunea COI, and there are no other probable start codons for H. cunea COI. Thus, the COI gene must use an atypical start site. In the previous studies, some tetranucleotide (ATAA, TTAA, GTAA and ATTA) and a hexanucleotide (ATTTAA) have been proposed as an initiator of COI for Diptera insects including mosquitoes and Drosophila 34, 37-41. Among the completely sequenced lepidopteran insects, including A. pernyi, A. yamamai, B. mori, B. mandarina, and C. raphaelis, the TTAG has been designated as an initiator for COI 13, 14, 17, 18, however, Ostrina species were designated as ATTTAG 20, and A. issoria and C. boisduvalii were designated as TTG 15, 22, and six species (A. honmai, A. melete, E. pyretorum, M. sexta, P. atrilineata, and O. lunifer) were designated as CGA 11, 12, 16, 19, 21, 23. A recent study by analysis of the transcript information from the cDNA sequence of the mtDNA-encoded protein gene revealed that the translation initiation codon for the COI gene is TCG (Serine), rather than those atypical and longer codons in Diptera 41 Therefore, we tentatively designated the CGA as the COI start codon although no mRNA expression data for H. cunea are available until now.
Fig 2

Alignment of initiation region for the cytochrome oxidase subunit I (COI) genes of lepidopteran insects. The Diptera insect Anopheles funestus was included due to fact that the translation initiation codon for the COI gene was determined by analysis of the transcript information analysis 41. The first four or five codons and their amino acids are shown on the right-hand side of the figure. Boxed nucleotides are the presumed translation initiators, which have been postulated as the initiation codon for COI in each species. Underlined nucleotides indicate the adjacent partial sequence of tRNA. Arrows indicate the direction of transcription.

Nine of 13 PCGs in H. cunea harbor the usual termination codon TAA, but the remaining four possess the incomplete termination codons T for COI, COII, ND4, and TA for ATP6 (Table 3). The COI, COII, and ND4 terminate with T exactly adjacent to tRNAs, and ATP6 terminate with TA immediately followed by the ATG translation initiation codon of COIII. These incomplete stop codons are commonly found in metazoan mitochondrial genes 33. The common interpretation of this phenomenon is that TAA termini are created via posttranscriptional polyadenylation 42. The Relative Synonymous Codon Usage (RSCU) in PCGs was investigated and the results are summarized in Table 5. In PCGs of the H. cunea mitogenome, the codons CCG, GCG, TGC, CGC, AGC, and AGG are not represented. The genome-wise A+T bias is also reflected in the codon usage of H. cunea mitogenome. The codons TTA (Leu), ATT (Ile), TTT (Phe), and ATA (Met) are the four most frequently used codons in the H. cunea mitogenome, accounting for 39.2%. These codons are all composed of A or T nuleotides, thus indicating the biased usage of A and T nucleotides in the H. cunea PCGs. These four codons were also most frequently used in the sequenced lepidopteran insects. Leucine (14.82%), isoleucine (11.99%), phenylalanine (9%), and serine (8.52%) are the most frequent amino acids in H. cunea mitochondrial proteins, accounting for 44.33%. These amino acids are also the most frequently represented in other insects, averaging 45.08% 43.
Table 5

Codon usage of the protein-coding genes in Hyphantria cunea mitogenome*

Codon (aa)n%RSCUCodon (aa)n%RSCUCodon (aa)n%RSCU
UUU(F)3328.951.99UCU(S)1133.052.86UAU(Y)1804.851.82
UUC(F)20.590.01UCC(S)100.270.25UAC(Y)180.490.18
UUA(L)4490.104.85UCA(S)812.182.05UAA(*)---
UUG(L)350.940.38UCG(S)20.050.05UAG(*)---
CUU(L)431.160.47AGU(S)280.750.71CAU(H)561.511.65
CUC(L)10.030.01AGC(S)000CAC(H)120.320.35
CUA(L)200.540.22AGA(S)822.212.08CAA(Q)601.621.97
CUG(L)20.050.02AGG(S)000CAG(Q)10.030.03
AUU(I)41011.051.84ACU(U)731.971.93AAU(N)2155.791.69
AUC(I)350.940.16ACC(U)140.380.37AAC(N)391.050.31
AUA(M)2657.141.91ACA(U)621.681.64AAA(K)1002.691.82
AUG(M)120.320.09ACG(U)20.050.05AAG(K)100.270.18
GUU(V)902.432.37GCU(A)842.262.63GGU(G)681.831.35
GUC(V)50.130.13GCC(A)60.160.19GGC(G)30.080.06
GUA(V)501.351.32GCA(A)381.021.19GGA(G)1082.912.14
GUG(V)70.190.18GCG(A)000GGG(G)230.620.26
UGU(C)280.752.00CGU(R)140.381.08CCU(P)752.022.36
UGC(C)000CGC(R)000CCC(P)110.300.35
UGA(W)922.481.88CGA(R)360.972.77CCA(P)411.101.29
UGG(W)60.160.12CGG(R)20.050.15CCG(P)000
GAU(D)591.591.84GAA(E)661.781.74
GAC(D)50.130.16GAG(E)100.270.26

*A total of 3711 codons were analyzed excluding the initiation and termination codons. RSCU, relative sunonymous codon usage.

rRNA genes and tRNA genes

As in all other insect mitogenome sequences, two rRNA genes are present in H. cunea. They are located between tRNA and tRNA, and between tRNA and the A+T-rich region, respectively. The length of the H. cunea lrRNA is 1 426 bp, which is the longest among the available completely sequenced lepidopteran insects, with the size range from1 412 bp in D. saccharalis to 1 319 bp in A. melete. The length of the H. cunea srRNA is 808 bp, which is well within the range observed in the available completely sequenced lepidopteran insects, with the size range from 806 bp in O. lunifer to 774 bp in C. boisduvalii. The H. cunea mitogenome harbors 22 tRNA genes, which are scattered around the molecule (Table 2; Fig. 1). The predicted secondary structure of the H. cunea tRNAs are shown in Fig. 3. Twenty tRNA genes were identified by tRNAscan Search 29. These 20 tRNA genes vary from 63 bp (tRNA) to 71 bp (tRNA) in size, and present a typical clover-leaf secondary structure of previously published mitochondrial tRNA genes. The H. cunea tRNA and tRNA genes not identified by tRNAscan-SE Search were determined to be 66 bp and 68 bp in size, respectively. Their sizes were determined by comparing the conserved relative genome position and sequence similarity with other lepidopteran mitogenome sequences. The tRNA also shows a typical clover-leaf secondary structure. However, the tRNA presents an unusual secondary structure lacking a stable stem-loop structure in the DHU arm, which has been observed in several other metazoan species including insects 4. The anticodons are identical to those observed in other lepidopteran insects.
Fig 3

Inferred secondary structure of the 22 tRNAs of the Hyphantria cunea mitogenome. +, GT pairs; -, AT/GC pairs. The tRNAs are labeled with the abbreviations of their corresponding amino acids. Arms of tRNAs (clockwise from top) are the amino acid acceptor arm, the TψC arm, the anticodon arm, and the dihydrouridine arm.

A total of 23 unmatched base pairs occurred in the H. cunea tRNA genes. Twelve of 22 tRNA genes, including tRNA, tRNA, tRNA, tRNA, tRNA, tRNA, tRNA, tRNA, tRNA, tRNA, tRNA, and tRNA, were found to have 18 G-U mismatches in their secondary structures, which forms a weak bond. Three U-U mismatches were found in the amino acid acceptor stem of tRNA, tRNA, and tRNA. The tRNA gene was proposed to contain an A-A mismatch in the TψC stem, and the tRNA gene contained an A-A mismatch in the anticodon stem. Moreover, the tRNA and tRNA genes were found to contain an extra nucleotide A in the TψC stem, respectively. Mismatches observed in tRNAs can be corrected through RNA-editing mechanisms that are well known for arthropod mtDNA 44. The number of mismatches in the H. cunea is similar to those observed in other available lepidopteran insects: 24 in A. pernyi 13, and 24 in E. pyretorum 16; but lower than in O. lunifer where 35 mismatches were found 11. No mechanism, however, has been deduced for such high numbers of mismatches in insect mitochondrial tRNAs.

Noncoding and overlapping regions

The H. cunea mitogenome harbors a total of 230 bp intergenic spacer sequences, which are spread over 18 regions ranging in size from 1 to 50 bp (Table 3). The largest intergenic spacer sequence is present between tRNAand ND2 gene, with an extreme richness in A and T nucleotides (92%). This spacer is not found in non-lepidopteran insect species 19, but is found to be a feature common to the 21 lepidopteran mitogenomes which have been sequenced to date. By alignment analysis, whilst invariant between each of the congeneric species-pairs which have been examined (O. furnicalis and O. nubilalis; B. mori and B. mandarina; A. pernyi and A. yamamai), this region showed limited sequence conservation between even closely related lepidopteran groups such as within Bombycoidea or between bombycoids and other macrolepidopterans 11, 19, indicating it would imply no functional significance or might not serve as another origin of replication 15. Additionally, two other intergenic spacer sequences of more than 20 bp are present between tRNAand tRNA (21 bp), and between tRNAand ND1 (34 bp), respectively. The spacer region of 34 bp also contains the ATACTAA motif 19, which is conserved across the Lepidoptera order (Fig. 4). This 7 bp motif is possibly fundamental to site recognition by the transcription termination peptide (mtTERM protein) 45. This region is present in most insect mtDNAs even if the nucleotide sequence can be quite divergent 19.
Fig 4

Alignment of the intergenic spacer region between tRNAand ND1 of lepidopteran insects. The shaded ATACTAA motif 19 was conserved across the Lepidoptera order.

Eighteen base pairs were identified as overlapping sequences varying from 1 to 8 bp in four regions (Table 3). The longest overlap is 8 bp between tRNA and tRNA. Similarly sized overlaps are also observed in other sequenced lepidopteran species 14. The 7 bp overlap with the reading frame involving the ATP8/ATP6 genes was found. This feature is common to other sequenced lepidopteran mitogenomes, and was found in many animal mitogenomes 5. As for the two remaining overlaps, one is 1 bp between tRNA and tRNA, the other is 2 bp between tRNA and tRNA.

A+T- rich region

The H. cunea A+T-rich region is located between the srRNA gene and tRNA (Table 3; Fig. 1), which includes the origin sites for transcription and replication 45. This region was identified to be 357 bp in length, which is well within the range observed in the completely sequenced lepidopteran insects, with size ranging from 319 bp in O. lunifer to 747 bp in Japanese B. mandarina (Table 4). The A+T-rich region shows the highest A+T content (94.96%) of any region of the H. cunea mitogenome. The presence of extra tRNA-like structures in the A+T-rich region has been reported in the lepidopteran insects. In the case of Chinese B. mandarina, one tRNA(TGA)-like sequence is located within the A+T-rich region forming a structure with four stem-loops and one big loop 18. In A. yamamai A+T-rich region, two tRNA-like structures are present: tRNA(UCN)-like sequence and tRNA-like sequence, which possess the proper anticodon and form a clover-leaf structure, indicating they may be functional although there are many mismatches in both aminoacyl and anticodon stem regions 14. However, no tRNA-like structure was detected in the H. cunea A+T-rich region. The presence of varying copy numbers of tandemly-repeated elements has been reported to be one of the characteristics of the insect A+T-rich region 7. Some lepidopteran insects have been observed to possess the repeat element in the A+T-rich region. In the case of Antheraea, the A. pernyi A+T-rich region harbors a repeat element of 38 bp tandemly repeated six times 13, whereas the A. roylei A+T-rich region has five repeat elements 9, 13. In Japanese B. mandarina, the A+T-rich region harbors a tandem triplication of a 126 bp repeat unit, whereas in B. mori and Chinese B. mandarina the A+T-rich region has only one repeat element 17, 18. It has also been reported that Arethusana arethusa (Nymphalidae: Satyrinae), Leptidea sinapis (Pieridae), and Parnassius apollo (Papilionidae) have a longer A+T-rich region (~500-700 bp), due mainly to an increase in the size and copy number of repeat units 46. However, the H. cunea A+T-rich region is comprised of non-repetitive sequences, but harbors several features (Fig. 5) common to the Lepidoptera A+T-rich region 11, 19. In B. mori, the ON (origin of minority or light strand replication) is located 21 bp downstream from srRNA gene, which contains the motif ATAGA followed by an 18 bp poly-T stretch 47. A very similar pattern occurs in H. cunea where the ATAGA motif is located 20 bp downstream from srRNA gene and is followed by an 18 bp poly-T stretch. A microsatellite-like (AT)8 element preceded by the ATTTA motif is present in the 3' end of the H. cunea A+T-rich region. The presence of a microsatellite preceded by the ATTTA motif is common in the control regions of insect mitogenomes, and has been found in all other lepidopteran species which have been sequenced 19. Finally, an 11 bp poly-A is present immediately upstream tRNA. This poly-A element is still a common feature of the A+T-rich region in Lepidoptera 19, 46. This sequence has been suggested to be involved in the control of transcription and/or replication initiation in other insects or have some other unknown functional role 7.
Fig 5

The features present in the A+T-rich region of Hyphantria cunea. The sequence is shown in the N strand.

Phylogenetic relationships

To place the H. cunea mtDNA sequence in perspective relative to other lepidopteran insect mitogenomes and to probe into the phylogenetic relationships among the lepidopteran superfamilies, a data set containing the concatenated amino acid sequences of 13 PCGs was generated. The sequences of the 13 PCGs were concatenated, rather than analyzed separately, to reconstruct the phylogenetic relationships, which may result in a more complete analysis 48. The final alignment resulted in 3838 amino acid sites for the 19 ingroup and two outgroup taxa, including gaps. Of these sites, 1606 were conserved, 2170 were variable, and 1555 were informative for parsimony. The MP and NJ analyses generate overall similar topology except for the branching superfamily among A. honmai, and Papillonoidea species (Fig. 6). The phylogenetic analyses support a close relationship between H. cunea and L. dispar with 100% bootstrapping value, which is consistent with the morphological classification. The phylogenetic analyses also support a sister relationship between Noctuoidea (H. cunea, L. dispar, and O. lunifer) and Geometroidea (P. atrilineata).
Fig 6

Phylogeny of lepidopteran insects. (A) Current hypothesis of lepidopteran superfamily relationships after Kristensen and Skalski (1999) 50. Phylogenetic trees inferred from amino acid sequences of 13 PCGs of the mitogenome by using MP analysis (B) and NJ analysis (C). Drosophila yakuba 33 and Anopheles gambiae 34 were used as outgroups. The numbers above branches specify bootstrap percentages (1000 replicates).

The M. sexta is sometimes placed in their own superfamily Sphingidea. However, phylogenetic analyses based on the complete mitogenome in this study strongly support the placement within the superfamily Bombycoidea, which is consistent with the previous findings by morphological analysis 49 and by molecular analysis based on some nuclear genes 50. These 19 sequences represent six superfamilies within the lepidopteran suborder: Bombycoidea, Geometroidea, Noctuoidea, Papillonoidea, Pyraloidea, and Tortricidea. Based on morphological analysis, Bombycoidea, Noctuoidea, Papillonoidea and Geometroidea are designated as the Macrolepidoptera; Pyraloidea together with Macrolepidoptera are designated as Obtectornera; Tortricoidea is the sister to the remaining lepidopteran superfamilies covered in the present study (Fig. 6A) 49. In the phylogenetic trees constructed (Fig. 6B and C), the butterflies of Papillonoidea (A. melete, A. issoria, and C. raphaelis) are sisters to the remaining lepidopteran superfamilies, also showing a basal position within the monophyly of Lepidoptera 16. This result is different to the traditional morphological analyses (Fig. 6A). Also, recent phylogenetic analyses of 123 species representing 27 superfamilies of Ditrysia based on five protein-coding nuclear genes (6.7 kb total) provide sufficient information to conclusively demonstrate that several prominent features of the current morphology-based hypothesis, including the position of the butterflies, need revision 51. These results present in this study suggest that the complete insect mitogenome sequence has a power to resolve the majority of family relationships within superfamilies, however, the deeper nodes among superfamilies need more efforts.
  45 in total

1.  Biological identifications through DNA barcodes.

Authors:  Paul D N Hebert; Alina Cywinska; Shelley L Ball; Jeremy R deWaard
Journal:  Proc Biol Sci       Date:  2003-02-07       Impact factor: 5.349

2.  Hexapod origins: monophyletic or paraphyletic?

Authors:  Francesco Nardi; Giacomo Spinsanti; Jeffrey L Boore; Antonio Carapelli; Romano Dallai; Francesco Frati
Journal:  Science       Date:  2003-03-21       Impact factor: 47.728

Review 3.  Animal mitochondrial DNA: structure and evolution.

Authors:  D R Wolstenholme
Journal:  Int Rev Cytol       Date:  1992

Review 4.  Mitochondrial transcription initiation. Variation and conservation.

Authors:  G S Shadel; D A Clayton
Journal:  J Biol Chem       Date:  1993-08-05       Impact factor: 5.157

5.  Drosophila melanogaster mitochondrial DNA, a novel organization and genetic code.

Authors:  M H de Bruijn
Journal:  Nature       Date:  1983 Jul 21-27       Impact factor: 49.962

6.  tRNA punctuation model of RNA processing in human mitochondria.

Authors:  D Ojala; J Montoya; G Attardi
Journal:  Nature       Date:  1981-04-09       Impact factor: 49.962

7.  The mitochondrial DNA molecular of Drosophila yakuba: nucleotide sequence, gene organization, and genetic code.

Authors:  D O Clary; D R Wolstenholme
Journal:  J Mol Evol       Date:  1985       Impact factor: 2.395

8.  The complete mitochondrial genome of the yellow coaster, Acraea issoria (Lepidoptera: Nymphalidae: Heliconiinae: Acraeini): sequence, gene organization and a unique tRNA translocation event.

Authors:  Jing Hu; Daxiu Zhang; Jiasheng Hao; Dunyuan Huang; Stephen Cameron; Chaodong Zhu
Journal:  Mol Biol Rep       Date:  2009-11-29       Impact factor: 2.316

9.  Genes for cytochrome c oxidase subunit I, URF2, and three tRNAs in Drosophila mitochondrial DNA.

Authors:  D O Clary; D R Wolstenholme
Journal:  Nucleic Acids Res       Date:  1983-10-11       Impact factor: 16.971

10.  The utility of the neglected mitochondrial control region for evolutionary studies in lepidoptera (insecta).

Authors:  Marta Vila; Mats Björklund
Journal:  J Mol Evol       Date:  2004-03       Impact factor: 2.395

View more
  38 in total

1.  Complete mitochondrial genome of yellow meal worm (Tenebrio molitor).

Authors:  Li-Na Liu; Cheng-Ye Wang
Journal:  Dongwuxue Yanjiu       Date:  2014-11-18

2.  Characterization and Phylogenetic Analysis of the Complete Mitochondrial Genome of Saturnia japonica.

Authors:  Jiang Liu; Junjun Dai; Jinjin Jia; Yemei Zong; Yahao Sun; Ying Peng; Lei Wang; Cen Qian; Baojian Zhu; Guoqing Wei
Journal:  Biochem Genet       Date:  2021-09-22       Impact factor: 1.890

3.  Structural characteristics and phylogenetic analysis of the mitochondrial genome of the rice leafroller, Cnaphalocrocis medinalis (Lepidoptera: Crambidae).

Authors:  Yonghua Yin; Fujuan Qu; Zhongwu Yang; Xiuyue Zhang; Bisong Yue
Journal:  Mol Biol Rep       Date:  2013-12-31       Impact factor: 2.316

4.  The complete mitochondrial genome of Leucoptera malifoliella Costa (Lepidoptera: Lyonetiidae).

Authors:  Yu-Peng Wu; Jin-Liang Zhao; Tian-Juan Su; Jie Li; Fang Yu; Douglas Chesters; Ren-Jun Fan; Ming-Chang Chen; Chun-Sheng Wu; Chao-Dong Zhu
Journal:  DNA Cell Biol       Date:  2012-08-02       Impact factor: 3.311

5.  The complete mitochondrial genomes of two ghost moths, Thitarodes renzhiensis and Thitarodes yunnanensis: the ancestral gene arrangement in Lepidoptera.

Authors:  Yong-Qiang Cao; Chuan Ma; Ji-Yue Chen; Da-Rong Yang
Journal:  BMC Genomics       Date:  2012-06-22       Impact factor: 3.969

6.  The complete mitochondrial genome of the rice moth, Corcyra cephalonica.

Authors:  Yu-Peng Wu; Jie Li; Jin-Liang Zhao; Tian-Juan Su; A-Rong Luo; Ren-Jun Fan; Ming-Chang Chen; Chun-Sheng Wu; Chao-Dong Zhu
Journal:  J Insect Sci       Date:  2012       Impact factor: 1.857

7.  Comparative mitochondrial genome analysis of Eudocima salaminia (Cramer, 1777) (Lepidoptera: Noctuoidea), novel gene rearrangement and phylogenetic relationship within the superfamily Noctuoidea.

Authors:  Muzafar Riyaz; Rauf Ahmad Shah; Ignacimuthu Savarimuthu; Sivasankaran Kuppusamy
Journal:  Mol Biol Rep       Date:  2021-06-09       Impact factor: 2.316

8.  Host and phenology shifts in the evolution of the social moth genus Thaumetopoea.

Authors:  Mauro Simonato; Andrea Battisti; Carole Kerdelhué; Christian Burban; Carlos Lopez-Vaamonde; Isabelle Pivotto; Paola Salvato; Enrico Negrisolo
Journal:  PLoS One       Date:  2013-02-27       Impact factor: 3.240

9.  The complete mitochondrial genome of the pink stem borer, Sesamia inferens, in comparison with four other Noctuid moths.

Authors:  Huan-Na Chai; Yu-Zhou Du
Journal:  Int J Mol Sci       Date:  2012-08-16       Impact factor: 6.208

10.  Characterization of the complete mitochondrion genome of diurnal Moth Amata emma (Butler) (Lepidoptera: Erebidae) and its phylogenetic implications.

Authors:  Hui-Fen Lu; Tian-Juan Su; A-Rong Luo; Chao-Dong Zhu; Chun-Sheng Wu
Journal:  PLoS One       Date:  2013-09-12       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.