| Literature DB >> 22577351 |
Jiasheng Hao1, Qianqian Sun, Huabin Zhao, Xiaoyan Sun, Yonghua Gai, Qun Yang.
Abstract
We here report the first complete mitochondrial (mt) genome of a skipper, Ctenoptilum vasava Moore, 1865 (Lepidoptera: Hesperiidae: Pyrginae). The mt genome of the skipper is a circular molecule of 15,468 bp, containing 2 ribosomal RNA genes, 24 putative transfer RNA (tRNA), genes including an extra copy of trnS (AGN) and a tRNA-like insertion trnL (UUR), 13 protein-coding genes and an AT-rich region. All protein-coding genes (PCGs) are initiated by ATN codons and terminated by the typical stop codon TAA or TAG, except for COII which ends with a single T. The intergenic spacer sequence between trnS (AGN) and ND1 genes also contains the ATACTAA motif. The AT-rich region of 429 bp is comprised of nonrepetitive sequences, including the motif ATAGA followed by an 19 bp poly-T stretch, a microsatellite-like (AT)(3) (TA)(9) element next to the ATTTA motif, an 11 bp poly-A adjacent to tRNAs. Phylogenetic analyses (ML and BI methods) showed that Papilionoidea is not a natural group, and Hesperioidea is placed within the Papilionoidea as a sister to ((Pieridae + Lycaenidae) + Nymphalidae) while Papilionoidae is paraphyletic to Hesperioidea. This result is remarkably different from the traditional view where Papilionoidea and Hesperioidea are considered as two distinct superfamilies.Entities:
Year: 2012 PMID: 22577351 PMCID: PMC3335176 DOI: 10.1155/2012/328049
Source DB: PubMed Journal: Comp Funct Genomics ISSN: 1531-6912
Primer sequences for the long PCR amplification used in this study.
| Primers | Upper primer sequence (5′-3′) | Lower primer sequence (5′-3′) |
|---|---|---|
| COI-COIII | GGAAATTGACTTGTGCCT | TTGTATGTTTACCTTGGA |
| COIII-ND4 | AAAGGATTACGATGAGGT | GGTCTTGTTATTGGTGGA |
| ND4-Cytb | CGTCTATGTAAACGCTCA | ATAAGGGTTTTCTACTGGT |
| Cytb-12S | TTTTACATCAAACAGGA | ACTAGGATTAGATACCC |
| 12S-COI | GAAACACTTTCCAGTACCT | CTAAACCAATTCAACATCC |
Lepidopteran mt genome sequences used in this study.
| Species | Superfamily and family | GenBank acc. no. | References | |
|---|---|---|---|---|
|
| Noctuoidea | Arctiidae | NC_014058 | [ |
|
| Lymantriidae | NC_012893 | [ | |
|
| Noctuidae | NC_014668 | [ | |
|
| Notodontidae | NC_011128 | [ | |
|
| Notodontidae | JF440342 | (Sun et al., unpublished) | |
|
| Geometroidea | Geometridae | NC_010522 | [ |
|
| Tortricoidea | Tortricidae | NC_014294 | [ |
|
| Tortricidae | NC_008141 | [ | |
|
| Tortricidae | NC_014806 | [ | |
|
| Bombycoidea | Saturniidae | NC_004622 | [ |
|
| Saturniidae | NC_012739 | [ | |
|
| Saturniidae | NC_012727 | [ | |
|
| Saturniidae | NC_010613 | [ | |
|
| Bombycidae | AY301620 | [ | |
|
| Bombycidae | NC_003395 | [ | |
|
| Bombycidae | NC_002355 | [ | |
|
| Sphingidae | NC_010266 | [ | |
|
| Pyraloidea | Crambidae | NC_013274 | [ |
|
| Crambidae | NC_003368 | [ | |
|
| Crambidae | NC_003367 | [ | |
|
| Papilionoidea | Nymphalidae | NC_014587 | [ |
|
| Nymphalidae | GQ376195 | [ | |
|
| Nymphalidae | NC_014223 | (Hakozaki et al., unpublished) | |
|
| Nymphalidae | JF439070 | [ | |
|
| Nymphalidae | HQ658143 | [ | |
|
| Papilionidae | NC_014055 | [ | |
|
| Papilionidae | NC_014053 | [ | |
|
| Papilionidae | NC_014398 | (Qin et al., unpublished) | |
|
| Papilionidae | EU625344 | (Jiang et al., unpublished) | |
|
| Lycaenidae | NC_007976 | [ | |
|
| Pieridae | HM156697 | [ | |
|
| Hesperoidea | Hesperiidae | JF713818 | This study |
Figure 1Map of the circular mitochondrial genome of Ctenoptilum vasava. Genes encoded in the H-strand (clockwise orientation) are colored in red or pink. Genes encoded in the L-strand (anticlockwise orientation) are colored in blue or green. The abbreviations for the genes are as follows: COI-III stands for cytochrome oxidase subunits, Cytb for cytochrome b, and ND1-6 for NADH dehydrogenase components. tRNAs are denoted as one-letter symbol according to the IUPAC-IUB single-letter amino acid codes.
Comparison of the mt genome characteristics of all butterfly species available. All genomes share the features of 13 protein-coding genes (ND1-6, ND4L, COI-III, Cytb, ATP6, ATP8), 2 ribosomal RNAs (lsu rRNA (12S) and lsu rRNA (16S)), and an AT-rich region.
| Taxon | Whole genomic DNA | lrRNA | srRNA | A+T-rich region | GenBank accession no. | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Size (bp) | A+T % | No. codonsa | PCGb A+T % | Size (bp) | A+T % | Size (bp) | A+T % | Size (bp) | A+T % | ||
|
| 15,680 | 80.4 | 3,748 | 78.5 | 1,387 | 83.6 | 779 | 85.4 | 489 | 94.3 | DQ073916 |
|
| 15,575 | 80.2 | 3,732 | 78.5 | 1,369 | 83.9 | 775 | 84.1 | 552 | 90.4 | AY242996 |
|
| 15,338 | 80.2 | 3,714 | 79.3 | 1,380 | 83.5 | 776 | 85.9 | 334 | 90.4 | EU726630 |
|
| 15,360 | 80.6 | 3,734 | 79.1 | 1,391 | 84.8 | 774 | 84.1 | 330 | 91.5 | EF622227 |
|
| 15,327 | 80.8 | 3,711 | 79.3 | 1,338 | 84.6 | 762 | 84.5 | 374 | 91.7 | FJ685653 |
|
| 15,656 | 81.4 | 3,720 | 79.5 | 1,378 | 84.4 | 783 | 85.6 | 494 | 95.5 | AB070264 |
|
| 15,516 | 81.8 | 3,718 | 80.2 | 1,391 | 84.7 | 777 | 86.8 | 324 | 95.1 | EU286785 |
|
| 15,499 | 81.1 | 3,724 | 79.0 | 1,400 | 85.1 | 803 | 87.5 | 457 | 98.5 | NC010522 |
|
| 15,389 | 81.3 | 3,734 | 80.2 | 1,344 | 83.8 | 773 | 85.1 | 504 | 93.6 | FJ871125 |
|
| 15,140 | 79.8 | 3,715 | 78.4 | 1,319 | 83.4 | 777 | 86.9 | 351 | 88.0 | NC010568 |
|
| 15,314 | 82.7 | 3,708 | 81.5 | 1,330 | 85.3 | 777 | 85.8 | 375 | 94.1 | DQ102703 |
|
| 15,489 | 79.1 | 3,728 | 76.8 | 1,335 | 83.7 | 775 | 85.3 | 678 | 94.5 | GQ868707 |
|
| 15,242 | 80.5 | 3,711 | 78.9 | 1,333 | 86.0 | 776 | 84.9 | 403 | 92.5 | JF437925 |
|
| 15,267 | 80.4 | 3,737 | 78.8 | 1,337 | 83.8 | 773 | 85.9 | 403 | 92.5 | HQ658143 |
|
| 15,156 | 80.8 | 4,164 | 79.7 | 1,330 | 84.5 | 778 | 85.2 | 349 | 95.4 | JF439070 |
|
| 15,245 | 79.7 | 3,745 | 78.0 | 1,331 | 83.8 | 788 | 83.8 | 430 | 96.0 | GQ376195 |
|
| 15,593 | 77.9 | 3,746 | 75.7 | 1,351 | 81.4 | 806 | 84.3 | 319 | 92.7 | AM946601 |
|
| 15,569 | 79.9 | 3,742 | 77.8 | 1,351 | 84.2 | 799 | 85.2 | 435 | 96.1 | FJ617240 |
|
|
|
|
|
|
|
|
|
|
|
|
|
Summary of the skipper mt genome. IGNc stands for intergenic nucleotides, and the positive number indicates interval nucleotides (base pairs) between genes, while the negative number indicates the overlapped nucleotides (base pairs) between genes.
| Gene | Direction | Nucleotide no. | Size | IGNc | Start codon | Stop codon |
|---|---|---|---|---|---|---|
| tRNAMet | F | 1–68 | 68 | 0 | ||
| tRNAlle | F | 69–133 | 65 | 0 | ||
| tRNAGln | R | 136–204 | 69 | 2 | ||
| ND2 | F | 266–1270 | 1005 | 61 | ATT | TAG |
| tRNATrp | F | 1269–1335 | 67 | −2 | ||
| tRNACys | R | 1331–1395 | 65 | −5 | ||
| tRNATyr | R | 1407–1471 | 65 | 11 | ||
| COI | F | 1469–2990 | 1522 | −3 | ATT | T-tRNA |
| tRNALeu(UUR) | F | 2991–3057 | 67 | 0 | ||
| COII | F | 3058–3731 | 674 | 0 | ATG | T-tRNA |
| tRNALys | F | 3732–3801 | 70 | 0 | ||
| tRNAAsp | F | 3816–3884 | 69 | 14 | ||
| ATP8 | F | 3885–4052 | 168 | 0 | ATA | TAA |
| ATP6 | F | 4046–4723 | 678 | −7 | ATG | TAA |
| COIII | F | 4723–5505 | 783 | −1 | ATG | TAA |
| tRNAGly | F | 5508–5574 | 67 | 2 | ||
| ND3 | F | 5575–5928 | 354 | 0 | ATT | TAG |
| tRNAAla | F | 5927–5990 | 64 | −2 | ||
| tRNAArg | F | 5996–6061 | 66 | 5 | ||
| tRNAAsn | F | 6061–6125 | 65 | −1 | ||
| tRNASer(AGN) | F | 6133–6194 | 62 | 7 | ||
| tRNASer(AGN) | F | 6208–6267 | 60 | 13 | ||
| tRNAGlu | F | 6279–6344 | 66 | 11 | ||
| tRNAPhe | R | 6343–6409 | 67 | −2 | ||
| ND5 | R | 6409–8145 | 1737 | −1 | ATA | TAA |
| tRNAHis | R | 8161–8228 | 68 | 15 | ||
| ND4 | R | 8228–9568 | 1341 | −1 | ATG | TAA |
| ND4L | R | 9574–9855 | 282 | 5 | ATG | TAA |
| tRNAThr | F | 9869–9933 | 65 | 13 | ||
| tRNAPro | R | 9934–9998 | 65 | 0 | ||
| ND6 | F | 10001–10531 | 531 | 2 | ATA | TAA |
| Cytb | F | 10531–11679 | 1149 | −1 | ATG | TAA |
| tRNASer | F | 11679–11745 | 67 | −1 | ||
| ND1 | R | 11763–12704 | 942 | 17 | ATG | TAA |
| tRNALeu(CUN) | R | 12706–12773 | 68 | 1 | ||
| tRNALeu(UUR) | F | 12774–12852 | 79 | 0 | ||
| 16S | R | 12853–14195 | 1343 | 0 | ||
| tRNAVal | R | 14196–14265 | 70 | 0 | ||
| 12S | R | 14266–15039 | 774 | 0 | ||
| A+T-rich region | 15040–15468 | 429 | 0 |
Amino acid composition of the mt genome of the skipper. Start codons and stop codons were excluded in total codon counts.
| Ala | Cys | Asp | Glu | Phe | Gly | His | Ile | Lys | Leu | Met | Asn | Pro | Gln | Arg | Ser | Thr | Val | Trp | Tyr | Total | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ND2 | 2.10 | 1.20 | 0.30 | 1.50 | 17.72 | 3.30 | 0.60 | 13.81 | 5.11 | 11.11 | 8.41 | 8.41 | 1.50 | 1.20 | 0.90 | 12.01 | 3.00 | 1.20 | 2.40 | 4.20 | 333 |
| COI | 5.72 | 0.39 | 2.76 | 1.58 | 8.88 | 7.89 | 2.96 | 10.65 | 1.78 | 11.64 | 6.34 | 4.93 | 5.13 | 1.97 | 1.38 | 9.47 | 5.52 | 4.14 | 2.56 | 4.34 | 507 |
| COII | 3.59 | 0.9 | 3.59 | 4.48 | 7.17 | 2.69 | 2.24 | 13.9 | 2.24 | 12.11 | 4.48 | 9.87 | 4.48 | 4.04 | 3.14 | 6.73 | 4.48 | 3.59 | 2.69 | 3.59 | 223 |
| ATP8 | 0.00 | 0.00 | 0.00 | 0.00 | 12.96 | 0.00 | 0.00 | 20.37 | 9.26 | 5.56 | 9.26 | 12.96 | 5.56 | 1.85 | 0.00 | 3.70 | 1.85 | 0.00 | 5.56 | 11.11 | 54 |
| ATP6 | 2.23 | 0.00 | 0.45 | 1.79 | 9.82 | 4.46 | 2.23 | 12.95 | 0.89 | 17.41 | 4.91 | 7.14 | 4.91 | 2.23 | 1.34 | 11.16 | 7.14 | 4.02 | 2.23 | 2.68 | 224 |
| COIII | 5.02 | 0.77 | 1.54 | 2.70 | 11.20 | 6.56 | 5.02 | 12.74 | 1.54 | 10.42 | 2.70 | 4.25 | 3.86 | 2.32 | 1.54 | 6.18 | 7.72 | 3.47 | 5.02 | 5.41 | 259 |
| ND1 | 2.88 | 1.28 | 2.24 | 3.21 | 10.26 | 5.13 | 0.00 | 9.62 | 3.53 | 18.59 | 5.13 | 4.49 | 1.92 | 1.28 | 1.60 | 11.86 | 3.21 | 4.49 | 1.92 | 7.37 | 312 |
| Cytb | 4.46 | 0.79 | 1.84 | 1.05 | 9.45 | 5.77 | 2.10 | 14.17 | 2.89 | 14.44 | 3.67 | 7.09 | 5.77 | 2.36 | 1.84 | 5.51 | 5.25 | 3.67 | 3.15 | 4.72 | 381 |
| ND6 | 1.14 | 0.57 | 0.57 | 0.57 | 12.57 | 2.86 | 0.57 | 15.43 | 4.57 | 18.29 | 7.43 | 10.86 | 1.14 | 1.14 | 0.57 | 6.29 | 3.43 | 2.86 | 1.14 | 8.00 | 175 |
| ND4L | 1.09 | 1.09 | 1.09 | 4.35 | 10.87 | 4.35 | 3.26 | 15.22 | 3.26 | 18.48 | 11.96 | 3.26 | 0.00 | 1.09 | 1.09 | 9.78 | 0.00 | 5.43 | 0.00 | 4.43 | 92 |
| ND4 | 3.15 | 2.02 | 0.90 | 2.02 | 9.66 | 6.07 | 1.35 | 9.66 | 0.36 | 17.30 | 10.56 | 4.94 | 2.47 | 1.35 | 1.12 | 8.99 | 1.80 | 3.15 | 2.70 | 7.19 | 445 |
| ND5 | 3.29 | 0.87 | 2.25 | 1.91 | 9.36 | 4.16 | 0.69 | 10.75 | 3.81 | 16.64 | 9.88 | 5.72 | 1.91 | 1.04 | 0.87 | 11.44 | 2.60 | 4.51 | 1.56 | 6.76 | 577 |
| ND3 | 1.72 | 1.72 | 2.59 | 3.45 | 11.21 | 1.72 | 1.72 | 19.83 | 6.03 | 12.96 | 5.17 | 6.03 | 3.45 | 0.86 | 1.72 | 10.34 | 3.45 | 1.72 | 2.59 | 1.72 | 116 |
|
| |||||||||||||||||||||
| Total | 36.39 | 11.60 | 20.12 | 28.61 | 141.13 | 54.96 | 22.74 | 179.10 | 45.27 | 185.0 | 89.90 | 89.95 | 42.10 | 22.73 | 17.11 | 103.12 | 46.00 | 42.25 | 33.52 | 71.52 | 3698 |
The codon usage of the skipper mt genome. n: frequency of codon used; RSCU: relative synonymous codon usage; *stop codon. Start codons and stop codons were excluded in total codon counts.
| Codon (Aa) |
| Codon (Aa) |
| Codon (a) |
| Codon (Aa) |
|
|---|---|---|---|---|---|---|---|
| UUU (F) | 351.0 (1.81) | UCU (S) | 116.0 (2.71) | UAU (Y) | 192.0 (1.90) | UGU (C) | 34.0 (1.94) |
| UUC (F) | 37.0 (0.19) | UCC (S) | 16.0 (0.37) | UAC (Y) | 10.0 (0.10) | UGC (C) | 1.0 (0.06) |
| UUA (L) | 458.0 (5.07) | UCA (S) | 84.0 (1.96) | UAA (*) | 0.0 (0.00) | UGA (W) | 86.0 (1.87) |
| UUG (L) | 13.0 (0.14) | UCG (S) | 7.0 (0.16) | UAG (*) | 0.0 (0.00) | UGG (W) | 6.0 (0.13) |
| CUU (L) | 45.0 (0.50) | CCU (P) | 74.0 (2.45) | CAU (H) | 55.0 (1.72) | CGU (R) | 15.0 (1.20) |
| CUC (L) | 6.0 (0.07) | CCC (P) | 17.0 (0.56) | CAC (H) | 9.0 (0.28) | CGC (R) | 0.0 (0.00) |
| CUA (L) | 19.0 (0.21) | CCA (P) | 29.0 (0.96) | CAA (Q) | 63.0 (1.97) | CGA (R) | 31.0 (2.48) |
| CUG (L) | 1.0 (0.01) | CCG (P) | 1.0 (0.03) | CAG (Q) | 1.0 (0.03) | CGG (R) | 4.0 (0.32) |
| AUU (I) | 431.0 (1.89) | ACU (T) | 94.0 (2.54) | AAU (N) | 209.0 (1.79) | AGU (S) | 36.0 (0.84) |
| AUC (I) | 26.0 (0.11) | ACC (T) | 7.0 (0.19) | AAC (N) | 25.0 (0.21) | AGC (S) | 3.0 (0.07) |
| AUA (M) | 245.0 (1.91) | ACA (T) | 45.0 (1.22) | AAA (K) | 106.0 (1.77) | AGA (S) | 74.0 (1.73) |
| AUG (M) | 12.0 (0.09) | ACG (T) | 2.0 (0.05) | AAG (K) | 14.0 (0.23) | AGG (S) | 6.0 (0.14) |
| GUU (V) | 61.0 (1.86) | GCU (A) | 77.0 (2.44) | GAU (D) | 58.0 (1.81) | GGU (G) | 46.0 (1.00) |
| GUC (V) | 1.0 (0.03) | GCC (A) | 13.0 (0.41) | GAC (D) | 6.0 (0.19) | GGC (G) | 5.0 (0.11) |
| GUA (V) | 61.0 (1.86) | GCA (A) | 33.0 (1.05) | GAA (E) | 71.0 (1.84) | GGA (G) | 109.0 (2.37) |
| GUG (V) | 8.0 (0.24) | GCG (A) | 3.0 (0.10) | GAG (E) | 6.0 (0.16) | GGG (G) | 24.0 (0.52) |
Figure 2Predicted secondary clover-leaf structure for the tRNA genes of Ctenoptilum vasava. Dashes: Watson-Crick base pairing; centered dots (+): unmatched base pairing.
Figure 3Repeated segments in A+T-rich region in the skipper mt genome. The consensus pattern (20 bp) repeat unit (TATAATTTAATAAAAAAAATA) is highlighted in red and purple. Numbers are shown as the positions of starting nucleotides.
Figure 4Phylogenetic trees of the lepidopterans based on 13 PCG nucleotide sequences. (a) ML tree. (b) BI tree. Numbers at each node indicate bootstrap percentages of 100 pseudoreplicates for ML analysis and posterior probability for BI analysis.