Literature DB >> 28767123

Complete mitochondrial genome of the lappet moth, Kunugia undans (Lepidoptera: Lasiocampidae): genomic comparisons among macroheteroceran superfamilies.

Min Jee Kim1, Jun Seong Jeong1, Jong Seok Kim1, Su Yeon Jeong1, Iksoo Kim1.   

Abstract

The mitochondrial genome (mitogenome) characteristics of the monotypic Lasiocampoidea are largely unknown, because only limited number of mitogenomes is available from this superfamily. In this study, we sequenced the complete mitogenome of the lappet moth, Kunugia undans (Lepidoptera: Lasiocampidae) and compared it to those of Lasiocampoidea and macroheteroceran superfamilies (59 species in six superfamilies). The 15,570-bp K. undans genome had one additional trnR that was located between trnA and trnN loci and this feature was unique in Macroheterocera, including Lasiocampoidea. Considering that the two trnR copies are located in tandem with proper secondary structures and identical anticodons, a gene duplication event might be responsible for the presence of the two tRNAs. Nearly all macroheteroceran species, excluding Lasiocampoidea, have a spacer sequence (1-34 bp) at the trnS2 and ND1 junction, but most lasiocampid species, including K. undans, have an overlap at the trnS2 and ND1 junction, which represents a different genomic feature in Lasiocampoidea. Nevertheless, a TTAGTAT motif, which is typically detected in Macroheterocera at the trnS2 and ND1 junction, was also detected in all Lasiocampoidea. In summary, the general mitogenome characteristics of Lasiocampoidea did not differ greatly from the remaining macroheteroceran superfamilies, but it did exhibit some unique features.

Entities:  

Year:  2017        PMID: 28767123      PMCID: PMC5596373          DOI: 10.1590/1678-4685-GMB-2016-0298

Source DB:  PubMed          Journal:  Genet Mol Biol        ISSN: 1415-4757            Impact factor:   1.771


Introduction

The typical metazoan mitochondrial genome (mitogenome) consists of 13 protein-coding genes (PCGs), 22 tRNAs, two rRNAs, and a major non-coding sequence referred to as the A+T-rich region. The characteristic features of the mitogenome (e.g., fast evolution, low recombination rates, and multiple copies per cell) are considered beneficial in several biological fields (Cameron, 2014). In particular, whole mitogenome sequences have been utilized for phylogenic analyses of several insect lineages (Dowton ; Kim ; Lu ; Mao , 2015; Timmermans ), and genomic characteristics have also been scrutinized to understand phylogenetic and evolutionary features of given taxonomic groups (Cameron and Whiting; Wan ; Kim ). Mitogenome sequences in insects have been compiled in nearly 1,000 species that represent all insect orders and the Lepidoptera. As one of the four most species-rich insect orders, Lepidoptera is represented by 338 mitogenomes in GenBank (last visited on August 14, 2016), including 37 nearly complete sequences from 23 superfamilies. Among these, the monotypic Lasiocampoidea is represented by four species in two genera. Considering that the monotypic superfamily consists of 1,952 species with five subfamilies (van Nieukerken ), mitogenome sequences from additional diverse taxonomic groups could be required for mitogenome-based phylogenetic studies. In fact, recent large-scale mitogenome-based lepidopteran phylogenies only included a single genus or a single species (Timmermans ; Ramírez-Ríos ). The lappet moth, Kunugia undans (Walker) (Lepidoptera: Lasiocampidae), is distributed in South Korea (excluding the far eastern Ulleungdo Island), far eastern Russia, Japan, and Australia (Park ; Shin, 2001). In Korea, adults are found from September to October, eggs then overwinter, and larvae hatch in the spring (Park ). Its host plants are Castanea crenata S. et Z., Quercus acutissima Carr., Quercus variabilis Bl. in Fagaceae, and Malus pumila var. dulcissima Koidz. in Rosaceae (Park ). Variations in size, coloration, and lines on the wings are present. The wingspan of the species is 56–65 mm in males and 79–92 mm in females, and forewings have a small white spot at the medial cell (Shin, 2001). In this study, we determined the complete mitogenome sequence of the lappet moth K. undans, adding a new mitogenome sequence of a previously unreported genus of Lasiocampoidea. The genomic characteristics of the sequence were compared to those of other lasiocampid species in terms of genome structure, genomic arrangement, nucleotide composition, codon usage, etc. Furthermore, to better understand the evolutionary characteristics of the Lasiocampoidea, including K. undans, the mitogenome sequences were compared to the representatives of the Macroheterocera clade, to which Lasiocampoidea belongs.

Materials and Methods

DNA extraction, PCR and sequencing

An adult K. undans was collected from Shinan-gun in Jeollanamdo Province in Korea (34°3'60″ N, 125°6'50″ E) in 2009. After collection in the field, the sample was prepared as a dried specimen and deposited at Chonnam National University, Gwangju, Korea under the accession code KTOL-Bom-27. DNA was extracted from the hind legs using a Wizard Genomic DNA Purification Kit, in accordance with the manufacturer's instructions (Promega, Madison, WI, USA). For whole mitogenome sequencing, primers that amplify three long overlapping fragments (LF1 from COI and ND4, LF2 from ND5 to lrRNA, and LF3 from lrRNA to COI) were adapted from Kim . Three long fragments (LFs) were amplified using LA TaqTM (Takara Biomedical, Tokyo, Japan) under the following conditions: 96 °C for 2 min; 30 cycles of 98 °C for 10 sec and 48 °C for 15 min; and a final extension step of 72 °C for 10 min. Using the LFs as templates, 26 overlapping short fragments (SF) were amplified using the primers adapted from Kim and AccuPower® PCR Pre-Mix (Bioneer, Daejeon, Korea). The PCR conditions for SFs were as follows: denaturation for 5 min at 94 °C; 35 cycles of 1 min denaturation at 94 °C; 1 min annealing at 48–51 °C; 1 min extension at 72 °C; and a final extension of 7 min at 72 °C. Primers used to amplify and sequence the LFs and SFs are presented in Table S1. DNA sequencing was conducted using the ABI PRISM® BigDye® Terminator v3.1 Cycle Sequencing Kit and an ABI PRISMTM 3100 Genetic Analyzer (PE Applied Biosystems, Foster City, CA, USA). All products were sequenced from both directions.

Gene annotation

Individual SF sequences were assembled into the complete mitogenome using Seqman software (DNASTAR, Madison, Wisconsin, USA). Identification, boundary delimitation, and secondary structure folding of tRNAs were performed using tRNAscan-SE 1.21 with the search mode set as default, the Mito/Chloroplast as the searching source, the genetic code of invertebrate mitogenomes for tRNA isotype prediction, and a cove score cut-off of 1 (Lowe and Eddy, 1997). Twenty-one tRNAs were detected based on these parameters. However, trnS , which has a truncated DHU arm, was detected using a hand-drawn secondary structure in conjunction with an alignment of the predicted trnS regions of other lasiocampid species, and the anticodon was given particular consideration (Timmermans ; Qin ; Kim ). Individual PCGs were identified, and a boundary was delimited using the blastx and tblastn programs in BLAST (http://blast.ncbi.nlm.nih.gov/BLAST.cgi). With the aid of sequences from other lasiocampid species, the start and stop codons of PCGs were confirmed using MAFFT ver. 6 (Katoh ). Two rRNAs and the A+T-rich region were identified and delimited using the nucleotide blast algorithm in Blast, and it was further confirmed with the alignment of mitochondrial rRNA genes and sequences of the A+T-rich region of other lasiocampid species using MAFFT ver. 6.

Comparative analysis

For the comparative analysis of the K. undans mitogenome, available lasiocampid species and one species from each genus of the macroheteroceran superfamily were downloaded from either GenBank or AMiGA (Feijao ), resulting in 11 mitogenome sequences from four Lasiocampidae species (including K. undans) and 48 species from five macroheteroceran superfamilies (Bombycoidea, Geometroidea, Noctuoidea, Drepanoidea, and Mimallonoidea). The nucleotide sequences of the PCGs were translated based on the invertebrate genetic code for mitochondrial DNA (mtDNA). Codon usage and nucleotide composition were determined by MEGA 6 (Tamura ), and gene overlap and intergenic-space sequences were hand-counted. The A/T content of each gene, whole genome, and each codon position of the PCGs were calculated with DNASTAR (Madison, USA) (Burland, 2000). The K. undans sequence data were deposited to GenBank under accession no. KX822016.

Results and Discussion

Mitogenome organization and composition

The mitogenome size of K. undans is 15,570 bp, and is slightly larger than that of any other lasiocampid species, which range in size from 15,407 bp in Dendrolimus punctatus (KJ913814) to 15,552 bp in Apatelopteryx phenax (KJ508055) (Table 1). K. undans contains 3,735 codons, excluding termination codons, and this number is the third largest in the sequenced Lasiocampoidea (next to D. spectabilis and A. phenax; Table 1). The size and codon counts of the lasiocampid species are well within the range found in macroheteroceran species, and no peculiarities associated with total size and codon count were detected in Lasiocampoidea (Table 1, Table S2).
Table 1

Characteristics of Lasiocampoidea mitogenomes.

TaxonSize (bp)A/T content (%)PCGsrRNAlrRNAtRNAA+T-rich regionGenBank accession no.References
No. codons a AT%Size (bp)AT%Size (bp)AT%Size (bp)AT%Size (bp)AT%
Lasiocampoidea
Lasiocampidae
Kunugia undans 15,570 78.64 3,735 76.64 782 86.06 1,514 83.29 1,479 81.54 317 88.64 KX822016 This study
Apatelopteryx phenax 15,55280.333,736 b 78.4274784.871,34684.771,49381.6545894.54KJ508055 Timmermans et al., 2014
Dendrolimus spectabilis 15,40979.453,72477.5677785.201,45483.841,46981.2132092.50KU558688 Kim et al., 2016
Dendrolimus spectabilis 15,41279.363,72677.4178185.151,45483.911,46881.2032093.44KJ913815 Qin et al., 2015
Dendrolimus spectabilis 15,41079.383,72677.4477985.111,45483.911,46881.2032093.44KJ913816 Qin et al., 2015
Dendrolimus spectabilis 15,41179.503,74077.6562884.081,35783.491,46681.0446592.04KM244678 Tang et al., 2014
Dendrolimus punctatus 15,41979.403,72777.4677984.601,46284.821,46980.8732092.50KJ913811 Qin et al., 2015
Dendrolimus punctatus 15,41879.463,72777.5177984.981,46284.821,46981.0132092.50KJ913812 Qin et al., 2015
Dendrolimus punctatus 15,41179.463,72777.5577984.981,46184.601,46980.8732092.81KJ913813 Qin et al., 2015
Dendrolimus punctatus 15,40779.383,72777.5078084.741,45284.441,46980.8032091.88KJ913814 Qin et al., 2015
Dendrolimus tabulaeformis 15,41179.533,72677.6377884.701,45984.721,46981.0132092.81KJ913817 Qin et al., 2015
Dendrolimus tabulaeformis 15,40979.403,72677.4477984.981,45684.621,46881.0632092.81KJ913818 Qin et al., 2015

Termination codons were excluded in the total codon count.

Sequences include a few undetermined nucleotides.

Termination codons were excluded in the total codon count. Sequences include a few undetermined nucleotides. Compared to the typical sets of genes and regions found in animal mitogenomes (13 PCGs, 22 tRNAs, 2 rRNA genes, and one non-coding A+T-rich region), the K. undans mitogenome contains one extra trnR, which is located in tandem to another trnR [referred to as trnR (A) for the copy located next to trnA and trnR (B) for the copy located next to trnN] between trnA and trnN (Figure 1). Pairwise sequence divergence between the two tRNAs was 10.94% (7 bp). Among lasiocampid species (data not shown), pairwise sequence divergence was 3.18-7.81% and 10.94% compared to trnR (A) and trnR (B), respectively, indicating that trnR (A) is more likely to be a functional copy, in that the sequence divergence range reflects the current taxonomic hierarchy. Nevertheless, both trnR copies have an identical anticodon (TCG) that is found in all other Lasiocampoidea (Table 2, Table S3), and they exhibit the proper secondary cloverleaf structure (Figure S1). Thus, the functionality of trnR (B) remains unknown. The tandem location of two trnR copies that exhibit proper secondary structures and an identical anticodon may indicate a gene duplication event rather than horizontal transfer (Higgs ). In Lepidoptera, Coreana raphaelis (Papilionoidea) was the first species reported to have 23 tRNA genes instead of the usual 22 because of a tandemly duplicated trnS between trnN and trnE (Kim ). Ctenoptilum vasava (Papilionoidea) was subsequently reported to have an extra trnS (Kim ; Hao ). However, the extra trnR found in the K. undans mitogenome is likely unique in Macroheterocera, in that our careful reexamination of all available lasiocampid species and all Macroheterocera did not reveal extra tRNAs (data not shown). Currently, the K. undans mitogenome is the only available Kunugia sequence, so whether this duplication event was species- or genus-specific is an intriguing question.
Figure 1

Schematic illustration of the gene arrangement with the duplicated trnR detected in Kunugia undans. Gene sizes are not drawn to scale. Gene names that are not underlined indicate a forward transcriptional direction, whereas underlined sequences indicate a reversed transcriptional direction. tRNAs are denoted by one-letter symbols in accordance with the IUPAC-IUB single-letter amino acid codes. The remaining genes and gene order configurations that are identical to ancestral insects are omitted.

Table 2

Genomic summary of Kunugia undans.

GeneAnticodonStart codonStop codonNucleotide position (size)
trnM CAT--1-68 (68)
trnI GAT--72-135 (64)
trnQ TTG--136-205 (70)
ND2 ATTTAA263-1276 (1014)
trnW TCA--1275-1344 (70)
trnC GCA--1337-1402 (66)
trnY GTA--1412-1479 (68)
COI CGAT-tRNA1500-3057 (1558)
trnL 2 TAA--3058-3125 (67)
COII ATAT-tRNA3125-3806 (682)
trnK CTT--3807-3877 (71)
trnD GTC--3879-3947 (69)
ATP8 ATCTAA3948-4109 (162)
ATP6 ATGTAA4103-4780 (678)
COIII ATGTAA4787-5575 (790)
trnG TCC--5578-5644 (67)
ND3 ATCTAA5645-5998 (354)
trnA TGC--6003-6070 (68)
trnR (A)TCG--6084-6147 (64)
trnR (B)TCG--6175-6241 (67)
trnN GTT--6242-6308 (67)
trnS 1 GCT--6308-6375 (68)
trnE TTC--6376-6440 (65)
trnF GAA--6471-6537 (67)
ND5 ATTT-tRNA6538-8275 (1738)
trnH GTG--8276-8343 (68)
ND4 ATGTAG8348-9682 (1335)
ND4L ATGTAG9688-9981 (294)
trnT TGT--9986-10050 (65)
trnP TGG--10051-10115 (65)
ND6 ATATAA10124-10654 (531)
CytB ATGTAA10662-11807 (1146)
trnS 2 TGA--11809-11875 (67)
ND1 ATGTAA11869-12825 (957)
trnL 1 TAG--12827-12892 (66)
lrRNA --12893-14406 (1513)
trnV TAC--14407-14471 (65)
srRNA --14472-15253 (782)
A+T–rich region--15254-15570 (317)

tRNA abbreviations follow the IUPAC-IUB three letter code.

-, not applicable.

tRNA abbreviations follow the IUPAC-IUB three letter code. -, not applicable. The A/T nucleotide composition of the whole genome was 78.64% in K. undans, indicating biased A/T nucleotides, but it represents the lowest percentage detected in lasiocampid species (Table 1). Among macroheteroceran superfamilies, the A/T composition of the whole mitogenome in Lasiocampoidea is slightly lower than that of any other macroheteroceran superfamily (79.47% vs 80.23-80.79%), but the difference is slight (Table S2). The A/T content among K. undans genes varied between RNA (86.06% in srRNA, 83.29% in lrRNA, and 81.54% in tRNAs) and PCG (76.64%) genes, and the same trend was also found in other sequenced Macroheterocera, including Lasiocampoidea (Table 1, Table S2). The K. undans gene arrangement is identical to that of other ditrysian Lepidoptera that exhibit the trnM-trnI- order (where the underline indicates a gene inversion) at the A+T-rich region and ND2 junction, with the exception of the duplicated trnR (Table 2; Kim ; Timmermans ; Park ; Zhao ). This arrangement is found in all sequenced Macroheterocera (Park ), including Lasiocampoidea (Table 2; Table S3). However, it differs from the ancestral trnI- -trnM order found in the majority of insects and the lepidopteran superfamilies Hepialoidea and Nepticuloidea, which are ancient, non-ditrysian lepidopteran groups (Cao ; Timmermans ). Thus, this tRNA rearrangement has been regarded as synapomorphy for Ditrysia. However, a new arrangement, trnI-trnM- , was reported from a butterfly species belonging to Nymphalidae in Papilionoidea (Xuan ). Therefore, the latter arrangement might represent an autapomorphy, in that no other congeneric species has the arrangement (Park ).

Genes

Twelve of the 13 K. undans PCGs started with ATN, but COI started with an alternative CGA start codon, as observed in other moths (Figure S2). There is no typical start codon at the 5'-end of trnY and the intergenic spacer sequence located between trnY and COI, so CGA is the only possible start codon for COI in K. undans. The CGA start codon is found in all other sequenced macroheteroceran superfamilies, but some authors designate the typical ATN codon as the start codon for COI (Figure S2). This start codon has been reported to be highly conserved at the start region of COI in other Lepidoptera, and it was confirmed in a species of Lepidoptera based on expressed sequence tag data (Margam ; Kim ; Park ). Thus, the presence of a CGA start codon is now considered a synapomorphic trait in Lepidoptera, although some exceptions exist. The mitochondrial PCGs available for Lasiocampoidea, including K. undans, ended with TAA in the majority of PCGs, but they also infrequently ended with a single T (Table 2; Table S3). The TAG stop codon was uniquely used in K. undans for ND4 and ND4L, while other lasiocampid species used a single T for ND4 and TAA for ND4L (Table 2; Table S3). The incomplete termination codon is known to result in a complete TAA stop codon via posttranslational modifications that occur during the mRNA maturation process (Ojala ). The biased A/T content was reflected in the form of codon usage. For instance, among the 64 available codons, the most frequently used codons [TTA (leucine), ATT (isoleucine), TTT (phenylalanine), and ATA (methionine)] accounted for 37.2% in K. undans, and this value was the lowest frequency detected in Lasiocampoidea (Table 3). These four codons are all comprised of A or T nucleotides, thus indicating the biased usage of A/T nucleotides in Lasiocampoidea PCGs, including K. undans. Other macroheteroceran superfamilies have also shown a similar pattern, revealing 39.1–40.7% in Bombycoidea, 37.5–40.4% in Geometroidea, 38.0–44.6% in Noctuoidea, 40.8–40.9% in Drepanoidea, and 39.3% in Mimallonoidea (Table S4).
Table 3

Frequency of the four most frequently used codons in Lasiocampoidea.

SpeciesCodonTotal
TTA (L)ATT (I)TTT (F)ATA (M)
Lasiocampoidea
Lasiocampidae
Kunugia undans 409/10.7 381/10.4 323/8.6 270/7.5 1383/37.2
Apatelopteryx phenax 453/12.1414/11.1337/9.0285/7.61489/39.8
Dendrolimus spectabilis (KU558688)450/12.0402/10.7338/9.0280/7.51470/39.2
Dendrolimus spectabilis (KJ913815)449/12.1399/10.7331/8.9281/7.51460/39.2
Dendrolimus spectabilis (KJ913816)449/12.1399/10.7331/8.9281/7.51460/39.2
Dendrolimus spectabilis (KM244678)450/12.0402/10.7338/9.0280/7.51470/39.2
Dendrolimus punctatus (KJ913811)464/12.4398/10.7330/8.9268/7.21460/39.2
Dendrolimus punctatus (KJ913812)463/12.4399/10.7330/8.9268/7.21460/39.2
Dendrolimus punctatus (KJ913813)463/12.4401/10.8331/8.9267/7.21462/39.3
Dendrolimus punctatus (KJ913814)457/12.3399/10.7331/8.9275/7.41462/39.3
Dendrolimus tabulaeformis (KJ913817)461/12.4401/10.8331/8.9269/7.21462/39.3
Dendrolimus tabulaeformis (KJ913818)455/12.2399/10.7335/9.0268/7.21457/39.1
Average452/12.1400/10.7332/8.9274/7.31458/39.1

The corresponding amino acids are located in parentheses. Values after the backslash indicate the percentage of corresponding codons.

The corresponding amino acids are located in parentheses. Values after the backslash indicate the percentage of corresponding codons. The nucleotide composition of the 13 concatenated PCGs in the K. undans mitogenome was 33.5, 43.2, 11.8, and 11.5% for adenine, thymine, cytosine, and guanine, respectively, indicating A/T bias (Table 4). The base composition at each codon position of the K. undans PCGs indicated that the third codon position (86.5%) had a substantially higher A/T content than the first (72.6%) and second (70.4%) codon positions. A similar pattern was detected in other sequenced Lasiocampoidea, with averages of 77.6, 73.0, and 89.0 in the first, second, and third positions, respectively (Table 4).
Table 4

Codon position-based nucleotide composition of 13 concatenated Lasiocampoidea PCGs.

SpeciesOverall1st codon position2nd codon position3rd codon position
ATCGATCGATCGATCG
Lasiocampoidea
Lasiocampidae
Kunugia undans 33.5 43.2 11.8 11.5 37.6 35.0 11.0 16.0 22.4 48.0 16.0 13.4 40.5 46.0 8.5 5.2
Apatelopteryx phenax 34.044.510.411.037.936.09.716.121.949.016.313.142.349.05.33.8
Dendrolimus spectabilis (KU558688)33.544.111.011.436.936.010.116.621.748.016.613.341.246.08.34.8
Dendrolimus spectabilis (KJ913815)33.444.011.111.536.936.010.016.721.748.016.613.441.747.06.64.5
Dendrolimus spectabilis (KJ913816)33.444.011.111.536.936.010.016.721.748.016.613.441.747.06.54.5
Dendrolimus spectabilis (KM244678)33.544.111.011.436.936.010.116.621.748.016.613.342.048.06.44.1
Dendrolimus punctatus (KJ913811)33.543.911.111.536.936.010.016.721.748.016.613.541.947.06.64.2
Dendrolimus punctatus (KJ913812)33.544.011.011.536.936.010.016.821.748.016.613.442.047.06.54.2
Dendrolimus punctatus (KJ913813)33.644.011.011.437.036.010.016.721.748.016.613.442.047.06.44.2
Dendrolimus punctatus (KJ913814)33.643.911.211.336.936.010.216.721.748.016.613.442.347.06.63.9
Dendrolimus tabulaeformis (KJ913817)33.644.011.011.436.936.010.016.721.748.016.613.442.148.06.24.1
Dendrolimus tabulaeformis (KJ913818)33.543.911.111.436.936.010.116.721.748.016.613.441.947.06.64.2
Average33.644.011.111.437.135.910.116.621.848.116.513.441.847.26.74.3

Stop codons were excluded in the count.

Stop codons were excluded in the count. Two rRNA genes in K. undans, lrRNA and srRNA, were of 1,514 and 782 bp, respectively, (Table 2), and the sizes of the two genes in K. undans were larger than those of any found in other lasiocampid species, which ranged from 1,346 bp (A. phenax) to 1,452 bp (D. punctatus) in lrRNA and 747 bp (A. phenax) to 780 bp (D. punctatus) in srRNA (Table S2). tRNA sizes ranged from 64 bp (trnI) to 71 bp (trnK) in K. undans, and similar size ranges were found in other sequenced lasiocampid species (Table 2; Table S3). All K. undans tRNAs possessed invariable lengths of 7 bp for the aminoacyl stem, 7 bp for the anticodon loop, and 5 bp for the anticodon stem (Figure S1), and most tRNA size variation resulted from length variations in the DHU and TΨC arms. For instance, trnS has an atypical cloverleaf secondary structure that lacked the DHU stem, but the remaining K. undans tRNAs formed the typical secondary cloverleaf structure (Figure S1). The aberrant trnS has been reported in many metazoan species, including insects (Garey and Wolstenholme, 1989; Wolstenholme, 1992). The DHU stem and loop are involved in tertiary interactions required for the proper folding and functioning of tRNA (Rich and RajBhandary, 1976). Thus, an atypical secondary structure may hamper the functionality of tRNA, but a nuclear magnetic resonance analysis from nematodes demonstrated that the aberrant trnS also was functionally similar to typical tRNAs based on structural adjustments required to ensure ribosome fitting (Ohtsuki ).

The A+T-rich region

The length of the A+T-rich region in K. undans was 317 bp, and A/T nucleotides made up 88.64% of the sequence (Table 2). This region contained the highest A/T content of any region of the K. undans mitogenome (Table 1). Moreover, this region was the shortest in length, and it contained the least A/T nucleotides among lasiocampid species (Table 2, Table S3). The insect A+T-rich region harbors signals for replication and transcription initiation, so it is known to have conserved sequences in the region, which are in the form of conserved sequence blocks (Fauron and Wolstenholme, 1980; Clary and Wolstenholme, 1987; Saito ). In fact, previous studies revealed several conserved blocks in a substantial number of lepidopteran groups (Liao ; Kim ), and a search for the A+T-rich region of lasiocampid species (including K. undans) resulted in the detection of several conserved sequences (Figure 2). The first conserved sequence, which is located close to the 5'-end of the srRNA, is the ATAGA motif followed by a poly-T stretch of varying length. The K. undans A+T-rich region contained a 14-bp T stretch that was upstream of the 5'-end of the srRNA (Figure 2), and this poly-T stretch is well-conserved in all sequenced lasiocampid (ranging in size from 12 bp to 14 bp; Figure 2) and macroheteroceran species (Figure S3). Saito previously reported for the Bombyx mori mitogenome the precise position of the replication origin for minor-strand mtDNA, which is immediately downstream of a poly-T stretch that is located upstream of the srRNA 5'-end. Thus, this poly-T stretch is thought to function as a possible recognition site for the initiation of replication of the minor mtDNA strand. Additionally, another conserved motif ATAGA is located immediately downstream of the poly-T stretch, and it is very well-conserved in all sequenced lasiocampid species, including K. undans (Figure 2) and macroheteroceran species (Figure S3). A previously suggested function of this motif is a regulatory role in conjunction with the poly-T stretch, but experimental data are required to support this hypothesis (Kim ). Excluding the previously described sequences, there are only a few additional conserved sequences in the A+T-rich region of lasiocampid [e.g., K. undans (Figure 2)] and macroheteroceran species (Figure S3), including two or more ATTTA sequences scattered in the A+T-rich region, a microsatellite-like TA repeat, and a poly-T stretch. Our careful reexamination of the A+T-rich regions of macroheteroceran species resulted in the detection of repeat sequences in several species, including two of each 55-bp and 24-bp repeats in Bombyx huttoni (Bombycoidea); six 26-bp and two 18-bp repeats in Phthonandria atrilineata, two 278-bp repeats in Dysstroma truncata, four 24-bp repeats in Operophtera brumata (Geometroidea), two 16-bp repeats in Agrotis ipsilon, and two 11-bp repeats in Risoba prominens (Noctuoidea) ( Yang ; Timmermans ; Derks ; Wu ; Yang ; Peng ). Nevertheless, repeat sequences that were longer than 10 bp were not detected in sequenced lasiocampid species, including K. undans.
Figure 2

Schematic illustration of the A+T-rich region of Lasiocampoidea, including Kunugia undans. The colored nucleotides indicate conserved sequences such as the ATAGA motif, poly-T stretch, ATTTA sequence, and microsatellite-like TA repeat sequences. Dots between sequences indicate omitted sequences, and arrows indicate the transcriptional direction. Subscripts indicate the repeat number. GenBank accession numbers of the species represented by more than one mitogenome sequence are enclosed in parentheses.

Non-coding sequences

Excluding the A+T-rich region, the K. undans mitogenome has non-coding sequences that total 172 bp (with a range of 1–57 bp) and spread over 17 regions (Table 2). Comparison of available lasiocampid species indicated that intergenic spacing patterns and sizes are largely consistent in Lasiocampoidea, including those of K. undans. In particular, the 57-bp spacer found at the trnQ and ND2 junction (with a range of 39–58 bp) is consistently found in all lasiocampid species, including K. undans (Figure 3). The origin of this spacer region has previously been ascribed to the partial duplication and random loss of ND2, leaving the current length of the spacer sequence at the trnQ and ND2 junction because the spacer exhibited sequence identity to the neighboring ND2, despite the fact that its non-coding nature may have allowed the spacer to diverge from the original ND2 (Kim ). Regarding K. undans, the sequence identity of the spacer to the neighboring ND2 was 58.33% (Figure 3) and over 50.60% in 59 species of macroheteroceran superfamilies (Figure S4).
Figure 3

Alignment of the spacer sequence (located between trnQ and ND2) and the neighboring partial ND2 of Lasiocampoidea, including Kunugia undans. Asterisks indicate consensus sequences in the alignment. Sequence homology between the spacer and ND2 is shown in the parentheses next to the species name and GenBank accession numbers of species represented by more than one mitogenome sequences. The beginning and end nucleotide positions of the sequences are indicated.

Other relatively long spacer sequences were found in several regions of lasiocampid species, including K. undans, including those at the trnY and COI junction (20–34 bp), at the trnA and trnR junction (13–20 bp), at the trnN and trnS junction (11–21 bp, excluding K. undans that has a 1-bp overlap), and at the ND4 and ND4L junction (5–24 bp, excluding A. phenax that has a 5-bp overlap) (Table 2, Table S3). These spacer sequences are mainly composed of A/T nucleotides that are often found within multiple runs of either T or A nucleotides (data not shown). Sequence alignment beyond the species level was nearly impossible due to considerable variability in length, sequence composition, and insertions/deletions (data not shown). The majority of the remaining spacer regions were short, with a few exceptions (e.g., less than 10 bp). In previous lepidopteran mitogenomic studies, other spacer sequences at the trnS and ND1 junction were consistently reported in lepidopteran lineages (Cameron and Whiting, 2008; Kim ; Yang ; Kim ; Park ). The important feature of this spacer is the presence of a short-length TTAGTAT motif within the spacer sequence, which is thought to be a possible binding site for the transcription termination peptide of mtDNA (mtTERM). This characterization is based on the fact that the spacer is located after the final major-strand coded CytB (Taanman, 1999; Cameron and Whiting, 2008). Regarding K. undans, there is a 7-bp overlap at the ND1 and trnS junction, but K. undans clearly possesses the same sequence motif (Figure 4). All other lasiocampid species, with the exception of A. phenax, have a 1-bp gene overlap in this region, but they also contain the 7-bp motif at the ND1 and trnS junction. On the other hand, A. phenax has an intergenic spacer sequence at 12 bp, which includes the 7-bp motif. In other macroheteroceran species, the 7-bp motif is found in nearly all species without modification, with the exception of one Noctuoidea species, which has ATAGTAT instead of TTAGTAT. In Macroheterocera, the 7-bp motif is nearly always located at the spacer instead of the coding region at the ND1 and trnS junction (Figure S5). Thus, the spacing pattern of Lasiocampoidea differs from that of other macroheteroceran superfamilies in this region, so mRNA expression data would be required to clarify the extension of ND1 at the ND1 and trnS junction.
Figure 4

Alignment of the internal spacer sequence located between ND1 and trnS of Lasiocampoidea, including Kunugia undans. The shaded nucleotides indicate the conserved heptanucleotide (TTAGTAT) region. Underlined nucleotides indicate the adjacent partial sequences of ND1 and trnS . Arrows indicate the transcriptional direction.

In summary, in addition to the typical set of genes, the 15,570-bp complete mitogenome sequence of K. undans has an extra trnR. The presence of the additional tRNA is unique in Macroheterocera, including Lasiocampoidea. The A+T-rich region of K. undans possesses a few conserved sequences, which were previously reported in other Macroheterocera (including Lasiocampoidea). Moreover, the intergenic spacing pattern and size for K. undans are largely consistent with those of other Macroheterocera (including Lasiocampoidea), but instead of an intergenic spacer, Lasiocampoidea (including K. undans) exhibit an overlap at the trnS and ND1 junction.
  41 in total

1.  The minimal tRNA: unique structure of Ascaris suum mitochondrial tRNA(Ser)(UCU) having a short T arm and lacking the entire D arm.

Authors:  Takashi Ohtsuki; Gota Kawai; Kimitsuna Watanabe
Journal:  FEBS Lett       Date:  2002-03-06       Impact factor: 4.124

2.  AMiGA: the arthropodan mitochondrial genomes accessible database.

Authors:  Pedro Cipriano Feijão; Lissiene Silva Neiva; Ana Maria Lima de Azeredo-Espin; Ana Cláudia Lessinger
Journal:  Bioinformatics       Date:  2006-01-29       Impact factor: 6.937

3.  Description of new mitochondrial genomes (Spodoptera litura, Noctuoidea and Cnaphalocrocis medinalis, Pyraloidea) and phylogenetic reconstruction of Lepidoptera with the comment on optimization schemes.

Authors:  Xinlong Wan; Min Jee Kim; Iksoo Kim
Journal:  Mol Biol Rep       Date:  2013-09-22       Impact factor: 2.316

4.  Reconstructing intraordinal relationships in Lepidoptera using mitochondrial genome data with the description of two newly sequenced lycaenids, Spindasis takanonis and Protantigius superans (Lepidoptera: Lycaenidae).

Authors:  Min Jee Kim; Ah Rang Kang; Heon Cheon Jeong; Ki-Gyoung Kim; Iksoo Kim
Journal:  Mol Phylogenet Evol       Date:  2011-07-26       Impact factor: 4.286

5.  Higher-level phylogeny of the Hymenoptera inferred from mitochondrial genomes.

Authors:  Meng Mao; Tracey Gibson; Mark Dowton
Journal:  Mol Phylogenet Evol       Date:  2014-12-24       Impact factor: 4.286

6.  Platyhelminth mitochondrial DNA: evidence for early evolutionary origin of a tRNA(serAGN) that contains a dihydrouridine arm replacement loop, and of serine-specifying AGA and AGG codons.

Authors:  J R Garey; D R Wolstenholme
Journal:  J Mol Evol       Date:  1989-05       Impact factor: 2.395

7.  The mitochondrial genome of the Korean hairstreak, Coreana raphaelis (Lepidoptera: Lycaenidae).

Authors:  I Kim; E M Lee; K Y Seol; E Y Yun; Y B Lee; J S Hwang; B R Jin
Journal:  Insect Mol Biol       Date:  2006-04       Impact factor: 3.585

8.  The complete mitochondrial genome of the fall webworm, Hyphantria cunea (Lepidoptera: Arctiidae).

Authors:  Fang Liao; Lin Wang; Song Wu; Yu-Ping Li; Lei Zhao; Guo-Ming Huang; Chun-Jing Niu; Yan-Qun Liu; Ming-Gang Li
Journal:  Int J Biol Sci       Date:  2010-03-29       Impact factor: 6.580

9.  The complete mitochondrial genomes of two ghost moths, Thitarodes renzhiensis and Thitarodes yunnanensis: the ancestral gene arrangement in Lepidoptera.

Authors:  Yong-Qiang Cao; Chuan Ma; Ji-Yue Chen; Da-Rong Yang
Journal:  BMC Genomics       Date:  2012-06-22       Impact factor: 3.969

10.  Evolutionary dynamics of the mitochondrial genome in the evaniomorpha (hymenoptera)—a group with an intermediate rate of gene rearrangement.

Authors:  Meng Mao; Tracey Gibson; Mark Dowton
Journal:  Genome Biol Evol       Date:  2014-07       Impact factor: 3.416

View more
  3 in total

1.  First mitochondrial genome from Yponomeutidae (Lepidoptera, Yponomeutoidea) and the phylogenetic analysis for Lepidoptera.

Authors:  Mingsheng Yang; Bingyi Hu; Lin Zhou; Xiaomeng Liu; Yuxia Shi; Lu Song; Yunshan Wei; Jinfeng Cao
Journal:  Zookeys       Date:  2019-10-09       Impact factor: 1.546

2.  Complete mitochondrial genome of Pterodecta felderi (Lepidoptera: Callidulidae).

Authors:  Keon Hee Lee; Min Jee Kim; Jeong Sun Park; Iksoo Kim
Journal:  Mitochondrial DNA B Resour       Date:  2020-11-11       Impact factor: 0.658

3.  Phylogeographic Relationships among Bombyx mandarina (Lepidoptera: Bombycidae) Populations and Their Relationships to B. mori Inferred from Mitochondrial Genomes.

Authors:  Min-Jee Kim; Jeong-Sun Park; Hyeongmin Kim; Seong-Ryul Kim; Seong-Wan Kim; Kee-Young Kim; Woori Kwak; Iksoo Kim
Journal:  Biology (Basel)       Date:  2022-01-03
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.