| Literature DB >> 20376208 |
Fang Liao1, Lin Wang, Song Wu, Yu-Ping Li, Lei Zhao, Guo-Ming Huang, Chun-Jing Niu, Yan-Qun Liu, Ming-Gang Li.
Abstract
The complete mitochondrial genome (mitogenome) of the fall webworm, Hyphantria cunea (Lepidoptera: Arctiidae) was determined. The genome is a circular molecule 15 481 bp long. It presents a typical gene organization and order for completely sequenced lepidopteran mitogenomes, but differs from the insect ancestral type for the placement of tRNA(Met). The nucleotide composition of the genome is also highly A + T biased, accounting for 80.38%, with a slightly positive AT skewness (0.010), indicating the occurrence of more As than Ts, as found in the Noctuoidea species. All protein-coding genes (PCGs) are initiated by ATN codons, except for COI, which is tentatively designated by the CGA codon as observed in other lepidopterans. Four of 13 PCGs harbor the incomplete termination codon, T or TA. All tRNAs have a typical clover-leaf structure of mitochondrial tRNAs, except for tRNA(Ser)(AGN), the DHU arm of which could not form a stable stem-loop structure. The intergenic spacer sequence between tRNA(Ser)(AGN) and ND1 also contains the ATACTAA motif, which is conserved across the Lepidoptera order. The H. cunea A+T-rich region of 357 bp is comprised of non-repetitive sequences, but harbors several features common to the Lepidoptera insects, including the motif ATAGA followed by an 18 bp poly-T stretch, a microsatellite-like (AT)(8) element preceded by the ATTTA motif, an 11 bp poly-A present immediately upstream tRNA(Met). The phylogenetic analyses support the view that the H. cunea is closerly related to the Lymantria dispar than Ochrogaster lunifer, and support the hypothesis that Noctuoidea (H. cunea, L. dispar, and O. lunifer) and Geometroidea (Phthonandria atrilineata) are monophyletic. However, in the phylogenetic trees based on mitogenome sequences among the lepidopteran superfamilies, Papillonoidea (Artogeia melete, Acraea issoria, and Coreana raphaelis) joined basally within the monophyly of Lepidoptera, which is different to the traditional classification.Entities:
Keywords: Arctiidae; Fall webworm; Hyphantria cunea; Lepidoptera; Mitochondrial genome; Phylogeny
Mesh:
Substances:
Year: 2010 PMID: 20376208 PMCID: PMC2850540 DOI: 10.7150/ijbs.6.172
Source DB: PubMed Journal: Int J Biol Sci ISSN: 1449-2288 Impact factor: 6.580
List of the complete mitogenome of Lepidoptera
| Superfamily / Family | Species | Acc. number | Reference |
|---|---|---|---|
| Noctuoidea | |||
| Arctiidae | GU592049 | This study | |
| Lymantriidae | FJ617240 | Zhu et al. unpublished | |
| Notodontidae | AM946601 | ||
| Geometroidea | |||
| Geometridae | EU569764 | ||
| Bombycoidea | |||
| Saturniidae | AY242996 | ||
| Saturniidae | EU726630 | ||
| Saturniidae | EF622227 | ||
| Saturniidae | FJ685653 | ||
| Bombycidae | AB070264 | ||
| Bombycidae | Chinese | AY301620 | |
| Bombycidae | Japanese | NC_003395 | |
| Sphingidae | EU286785 | ||
| Pyraloidea | |||
| Crambidae | FJ240227 | Li and Yue, unpublished | |
| Crambidae | NC_003368 | ||
| Crambidae | NC_003367 | ||
| Tortricoidea | |||
| Tortricidae | DQ073916 | ||
| Papilionoidea | |||
| Nymphalidae | NC_013604 | ||
| Pieridae | EU597124 | ||
| Lycaenidae | DQ102703 | ||
| Papilionidae | EU622524 | Liu et al. unpublished | |
| Papilionidae | EF621724 | Feng et al. unpublished |
Fig 1Linear map of the mitogenome of Hyphantria cunea. The tRNAs are labeled according to the IUPAC-IUB single letter amino acid codes above the bar indicating coding sequence on major strand or below the bar showing on minor strand. One-letter symbol L, L*, S and S* denote codon tRNA, tRNA, tRNA, and tRNA, respectively. Underlined PCGs or rRNA genes are located on minor strand and PCGs that are not underlined are located on major strand. Overlapping lines (F1-F4) under the map denote four overlapping PCR fragments amplified for sequencing. The line at the lower left represents the map scale.
Primers used to amplify the Hyphantria cunea mitogenome
| Primer | Sequence (5'- 3') | Fragment | Reference |
|---|---|---|---|
| LYQ1 | GGTCAACAAATCATAAAGATATTGG | F1 | |
| LYQ2 | TAAACTTCAGGGTGACCAAAAAATCA | ||
| LYQ5 | TATGTACTACCATGAGGACAAATATC | F2 | |
| LYQ6 | ATTACACCTCCTAATTTATTAGGAAT | ||
| LYQ29 | CTTTTCTATTACTTCTTTCTCTTCCTGTTTTA | F3 | This study |
| LYQ32 | TAAAATAATAAATGGTAATAAAAAATGAAATG | ||
| LYQ30 | TAAAACAGGAAGAGAAAGAAGTAATAGAAAAG | F4 | This study |
| LYQ31 | CATTTCATTTTTTATTACCATTTATTATTTTA |
Annotation and gene organization of the Hyphantria cunea mitogenome
| Gene | Strand | Nucleotide no. | Size(bp) | Anticodon | Non | OL | Start codon | Stop codon |
|---|---|---|---|---|---|---|---|---|
| J | 1-67 | 67 | CAT | |||||
| J | 68-136 | 69 | GAT | 11 | ||||
| N | 148-216 | 69 | TTG | 50 | ||||
| J | 267-1277 | 1011 | 18 | ATT | TAA | |||
| J | 1 296-1 364 | 69 | TCA | 8 | ||||
| N | 1 357-1 419 | 63 | GCA | 4 | ||||
| N | 1 424-1 489 | 66 | GTA | 11 | ||||
| J | 1 501-3 034 | 1534 | CGA | T-tRNA | ||||
| J | 3 035-3 100 | 66 | TAA | |||||
| J | 3 101-3 782 | 682 | ATG | T-tRNA | ||||
| J | 3 783-3 853 | 71 | CTT | 1 | ||||
| J | 3 853-3 918 | 66 | GTC | |||||
| J | 3 919-4 080 | 162 | 7 | ATA | TAA | |||
| J | 4 074-4 750 | 677 | 5 | ATG | TA-COIII | |||
| J | 4 756-5 547 | 792 | 9 | ATG | TAA | |||
| J | 5 557-5 621 | 65 | TCC | |||||
| J | 5 622-5 975 | 354 | ATT | TAA | ||||
| J | 5 976-6 042 | 67 | TGC | 7 | ||||
| J | 6 050-6 116 | 67 | TCG | 5 | ||||
| J | 6 122-6 188 | 67 | GTT | 9 | ||||
| J | 6 198-6 263 | 66 | GCT | 21 | ||||
| J | 6 285-6 352 | 68 | TTC | 2 | ||||
| N | 6 351-6 418 | 68 | GAA | 3 | ||||
| N | 6 422-8 167 | 1746 | ATA | TAA | ||||
| N | 8 168-8 235 | 68 | GTG | |||||
| N | 8 236-9 574 | 1339 | ATG | T-tRNA | ||||
| N | 9 575-9 862 | 288 | 5 | ATG | TAA | |||
| J | 9 868-9 932 | 65 | TGT | |||||
| N | 9 933-9 997 | 65 | TGG | 7 | ||||
| J | 10 005-10 535 | 531 | 13 | ATT | TAA | |||
| J | 10 549-11 697 | 1149 | 17 | ATA | TAA | |||
| J | 11 715-11 782 | 68 | TGA | 34 | ||||
| N | 11 817-12 755 | 939 | 1 | ATG | TAA | |||
| N | 12 757-12 824 | 68 | TAG | |||||
| N | 12 825-14 250 | 1426 | ||||||
| N | 14 251-14 316 | 66 | TAC | |||||
| N | 14 317-15 124 | 808 | ||||||
| A+T-rich region | 15 125-15 481 | 357 |
J-strand, majority-coding strand; N-strand, minority-coding strand; Non, non-coding region; OL, overlapping region.
Composition and skewness in the major strand of lepidopteran mitogenomes
| Species | size (bp) | A% | G% | T% | C% | A+T % | AT skew | GC skew |
|---|---|---|---|---|---|---|---|---|
| Whole genome | ||||||||
| 15 481 | 40.58 | 7.55 | 39.80 | 12.07 | 80.38 | 0.010 | -0.230 | |
| 15 569 | 40.58 | 7.57 | 39.30 | 12.55 | 79.88 | 0.016 | -0.247 | |
| 15 593 | 40.09 | 7.56 | 0.030 | - | ||||
| 15 499 | 40.78 | 7.67 | 40.24 | 11.31 | 81.02 | 0.007 | -0.192 | |
| 15 566 | 39.22 | 7.77 | 40.94 | 12.06 | 80.16 | -0.021 | -0.216 | |
| 15 338 | 39.26 | 7.69 | 41.04 | 12.02 | 80.29 | -0.022 | -0.219 | |
| 15 360 | 39.34 | 7.58 | 41.28 | 11.79 | 80.62 | -0.024 | -0.217 | |
| 15 327 | 39.17 | 7.63 | 41.65 | 11.55 | 80.82 | -0.030 | -0.204 | |
| 15 656 | 43.06 | 7.31 | 38.30 | 11.33 | 81.36 | -0.216 | ||
| Chinese | 15 682 | 7.40 | 38.48 | 11.01 | 81.59 | 0.057 | -0.196 | |
| Japanese | 43.08 | 7.21 | 38.60 | 11.11 | 81.68 | 0.055 | -0.213 | |
| 15 516 | 40.67 | 7.46 | 41.11 | 10.76 | 81.79 | -0.005 | -0.181 | |
| 15 490 | 40.87 | 7.42 | 39.15 | 12.56 | 80.02 | 0.021 | -0.257 | |
| 14 535 | 41.36 | 38.81 | 11.82 | 80.17 | 0.031 | -0.192 | ||
| 14 536 | 41.46 | 7.91 | 38.92 | 11.71 | 80.37 | 0.032 | -0.194 | |
| 15 680 | 40.15 | 7.88 | 40.24 | 11.73 | 80.39 | -0.001 | -0.178 | |
| 15 245 | 7.74 | 40.81 | 12.50 | 79.76 | -0.023 | -0.235 | ||
| 40.38 | 7.87 | 39.41 | 12.35 | 79.78 | 0.012 | -0.222 | ||
| 15 314 | 39.37 | - | - | |||||
| 13 860 | 40.07 | 7.74 | 40.44 | 11.75 | 80.51 | -0.005 | -0.206 | |
| 13 964 | 39.53 | 7.85 | 40.45 | 12.17 | 79.98 | -0.012 | -0.216 | |
| A+T-rich region | ||||||||
| 357 | 45.66 | 1.12 | 49.30 | 3.92 | 94.96 | -0.038 | -0.556 | |
| 435 | 45.29 | 1.61 | 50.50 | 2.30 | 96.09 | -0.054 | -0.176 | |
| 44.51 | 1.57 | 48.90 | 5.02 | 93.42 | -0.047 | -0.524 | ||
| 457 | 40.70 | 0.66 | 57.55 | - | -0.246 | |||
| 552 | 49.28 | 5.43 | 90.40 | -0.090 | -0.127 | |||
| 334 | 41.62 | 3.59 | 47.90 | 89.52 | -0.070 | -0.315 | ||
| 330 | 42.12 | 2.12 | 49.39 | 6.36 | 91.52 | -0.079 | -0.500 | |
| 358 | 42.18 | 2.51 | 50.00 | 5.31 | 92.18 | -0.085 | -0.358 | |
| 494 | 44.94 | 1.62 | 50.61 | 2.83 | 95.55 | -0.059 | -0.272 | |
| Chinese | 484 | 46.49 | 2.69 | 47.93 | 2.89 | 94.42 | -0.015 | -0.036 |
| Japanese | 45.52 | 2.41 | 49.67 | 2.41 | 95.18 | -0.043 | ||
| 324 | 45.00 | 1.54 | 50.31 | 3.29 | 95.37 | -0.055 | -0.334 | |
| 335 | 43.28 | 4.48 | 94.93 | -0.088 | - | |||
| 489 | 2.86 | 45.81 | 2.86 | 94.27 | ||||
| 430 | 45.81 | 1.40 | 50.23 | 2.56 | 96.05 | -0.046 | -0.293 | |
| 351 | 43.87 | 3.13 | 7.69 | -0.016 | -0.421 | |||
| 375 | 44.27 | 1.33 | 49.87 | 4.53 | 94.13 | -0.059 | -0.545 |
* partial mitogenome lacking of the A+T-rich region.
Fig 2Alignment of initiation region for the cytochrome oxidase subunit I (COI) genes of lepidopteran insects. The Diptera insect Anopheles funestus was included due to fact that the translation initiation codon for the COI gene was determined by analysis of the transcript information analysis 41. The first four or five codons and their amino acids are shown on the right-hand side of the figure. Boxed nucleotides are the presumed translation initiators, which have been postulated as the initiation codon for COI in each species. Underlined nucleotides indicate the adjacent partial sequence of tRNA. Arrows indicate the direction of transcription.
Codon usage of the protein-coding genes in Hyphantria cunea mitogenome*
| Codon (aa) | n | % | RSCU | Codon (aa) | n | % | RSCU | Codon (aa) | n | % | RSCU |
|---|---|---|---|---|---|---|---|---|---|---|---|
| UUU(F) | 8.95 | 1.99 | UCU(S) | 113 | 3.05 | 2.86 | UAU(Y) | 180 | 4.85 | 1.82 | |
| UUC(F) | 2 | 0.59 | 0.01 | UCC(S) | 10 | 0.27 | 0.25 | UAC(Y) | 18 | 0.49 | 0.18 |
| UUA(L) | 0.10 | 4.85 | UCA(S) | 81 | 2.18 | 2.05 | UAA(*) | - | - | - | |
| UUG(L) | 35 | 0.94 | 0.38 | UCG(S) | 2 | 0.05 | 0.05 | UAG(*) | - | - | - |
| CUU(L) | 43 | 1.16 | 0.47 | AGU(S) | 28 | 0.75 | 0.71 | CAU(H) | 56 | 1.51 | 1.65 |
| CUC(L) | 1 | 0.03 | 0.01 | AGC(S) | 0 | 0 | 0 | CAC(H) | 12 | 0.32 | 0.35 |
| CUA(L) | 20 | 0.54 | 0.22 | AGA(S) | 82 | 2.21 | 2.08 | CAA(Q) | 60 | 1.62 | 1.97 |
| CUG(L) | 2 | 0.05 | 0.02 | AGG(S) | 0 | 0 | 0 | CAG(Q) | 1 | 0.03 | 0.03 |
| AUU(I) | 11.05 | 1.84 | ACU(U) | 73 | 1.97 | 1.93 | AAU(N) | 215 | 5.79 | 1.69 | |
| AUC(I) | 35 | 0.94 | 0.16 | ACC(U) | 14 | 0.38 | 0.37 | AAC(N) | 39 | 1.05 | 0.31 |
| AUA(M) | 7.14 | 1.91 | ACA(U) | 62 | 1.68 | 1.64 | AAA(K) | 100 | 2.69 | 1.82 | |
| AUG(M) | 12 | 0.32 | 0.09 | ACG(U) | 2 | 0.05 | 0.05 | AAG(K) | 10 | 0.27 | 0.18 |
| GUU(V) | 90 | 2.43 | 2.37 | GCU(A) | 84 | 2.26 | 2.63 | GGU(G) | 68 | 1.83 | 1.35 |
| GUC(V) | 5 | 0.13 | 0.13 | GCC(A) | 6 | 0.16 | 0.19 | GGC(G) | 3 | 0.08 | 0.06 |
| GUA(V) | 50 | 1.35 | 1.32 | GCA(A) | 38 | 1.02 | 1.19 | GGA(G) | 108 | 2.91 | 2.14 |
| GUG(V) | 7 | 0.19 | 0.18 | GCG(A) | 0 | 0 | 0 | GGG(G) | 23 | 0.62 | 0.26 |
| UGU(C) | 28 | 0.75 | 2.00 | CGU(R) | 14 | 0.38 | 1.08 | CCU(P) | 75 | 2.02 | 2.36 |
| UGC(C) | 0 | 0 | 0 | CGC(R) | 0 | 0 | 0 | CCC(P) | 11 | 0.30 | 0.35 |
| UGA(W) | 92 | 2.48 | 1.88 | CGA(R) | 36 | 0.97 | 2.77 | CCA(P) | 41 | 1.10 | 1.29 |
| UGG(W) | 6 | 0.16 | 0.12 | CGG(R) | 2 | 0.05 | 0.15 | CCG(P) | 0 | 0 | 0 |
| GAU(D) | 59 | 1.59 | 1.84 | GAA(E) | 66 | 1.78 | 1.74 | ||||
| GAC(D) | 5 | 0.13 | 0.16 | GAG(E) | 10 | 0.27 | 0.26 |
*A total of 3711 codons were analyzed excluding the initiation and termination codons. RSCU, relative sunonymous codon usage.
Fig 3Inferred secondary structure of the 22 tRNAs of the Hyphantria cunea mitogenome. +, GT pairs; -, AT/GC pairs. The tRNAs are labeled with the abbreviations of their corresponding amino acids. Arms of tRNAs (clockwise from top) are the amino acid acceptor arm, the TψC arm, the anticodon arm, and the dihydrouridine arm.
Fig 4Alignment of the intergenic spacer region between tRNAand ND1 of lepidopteran insects. The shaded ATACTAA motif 19 was conserved across the Lepidoptera order.
Fig 5The features present in the A+T-rich region of Hyphantria cunea. The sequence is shown in the N strand.
Fig 6Phylogeny of lepidopteran insects. (A) Current hypothesis of lepidopteran superfamily relationships after Kristensen and Skalski (1999) 50. Phylogenetic trees inferred from amino acid sequences of 13 PCGs of the mitogenome by using MP analysis (B) and NJ analysis (C). Drosophila yakuba 33 and Anopheles gambiae 34 were used as outgroups. The numbers above branches specify bootstrap percentages (1000 replicates).