Literature DB >> 27974854

Characterization of the Complete Mitochondrial Genome of Leucoma salicis (Lepidoptera: Lymantriidae) and Comparison with Other Lepidopteran Insects.

Yu-Xuan Sun1, Lei Wang1, Guo-Qing Wei1, Cen Qian1, Li-Shang Dai1, Yu Sun1, Muhammad Nadeem Abbas1, Bao-Jian Zhu1, Chao-Liang Liu1.   

Abstract

The complete mitochondrial genome (mitogenome) of Leucoma salicis (Lepidoptera: Lymantriidae) was sequenced and annotated. It is a circular molecule of 15,334 bp, containing the 37 genes usually present in insect mitogenomes. All protein-coding genes (PCGs) are initiated by ATN codons, other than cox1, which is initiated by CGA. Three of the 13 PCGs had an incomplete termination codon, T or TA, while the others terminated with TAA. The relative synonymous codon usage of the 13 protein-coding genes (PCGs) was consistent with those of published lepidopteran sequences. All tRNA genes had typical clover-leaf secondary structures, except for the tRNASer (AGN), in which the dihydrouridine (DHU) arm could not form a stable stem-loop structure. The A + T-rich region of 325 bp had several distinctive features, including the motif 'ATAGA' followed by an 18 bp poly-T stretch, a microsatellite-like (AT)7 element, and an 11-bp poly-A present immediately upstream of tRNAMet. Relationships among 32 insect species were determined using Maximum Likelihood (ML), Neighbor Joining (NJ) and Bayesian Inference (BI) phylogenetic methods. These analyses confirm that L. salicis belongs to the Lymantriidae; and that Lymantriidae is a member of Noctuoidea, and is a sister taxon to Erebidae, Nolidae and Noctuidae, most closely related to Erebidae.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27974854      PMCID: PMC5156926          DOI: 10.1038/srep39153

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Leucoma salicis is a moth that is mainly distributed in China, Korea and Japan. It is a notorious plant pest and causes considerable economic losses. It typically consumes willow and tea leaves, influencing quality and quantity of tea products1; and damages roadside and garden trees in urban areas. Traditionally, the identification of this species was based on morphological characteristics of adult moths2. However, the moth appears mainly in June to August, the rest of its life go through egg and larva stages (which has no easily identifying morphological features), requiring eggs and larvae to be reared to adult stage for identification, which is time consuming and labor intensive. Molecular methods for identification are under development, including polymerase chain reaction-restriction fragment length polymorphism (PCR–RFLP)3. Most previous work on L. salicis has focused on sex pheromone synthesis4, or the nuclear polyhedrosis virus that infects larvae5. Previous studies have not focused on the mitochondrial genome, which can provide systematically-informative information for identification, phylogenetic analysis and evolutionary studies on L. salicis. Insect mitochondrial DNA (mtDNA) is a double-stranded, circular molecule, ranging in size from 14 to 20 kb. It usually contains a conserved set of 37 genes, including seven NADH dehydrogenase (nad1-nad6 and nad4L), three cytochrome c oxidase (cox1-cox3), two ATPase (atp6 and atp8), one cytochrome b (cob), two ribosomal RNA (rrnL and rrnS), 22 transfer RNA (tRNA) genes, and an adenine (A) + thymine (T)-rich region that contains initiation sites for transcription and replication of the genome67. Due to its simple genomic organization, high rate of evolution, and almost unambiguous orthology, mtDNA is typically considered to be an informative molecular marker for species identification and in studies of phylogenetic relationships and population structure89. A better understanding of the lepidopteran mitochondrial genome requires expanded taxon sampling. Lepidoptera contains more than 160,000 described species, classified into 45–48 superfamilies10. Lymantriidae includes about 360 genera and over 2500 species, many of which are agriculturally important. Only eight species have completely-sequenced mitogenomes that are publically available in GenBank, despite the large species diversity in the family. In this study, we sequenced and annotated the complete mitogenome sequence of L. salicis, and compared it with those of other members of Lymantriidae. Our results provide novel methods for species identification of an important pest, as well as phylogenetically-informative sequence data that addresses the position of L. salicis within Noctuoidea.

Results

Geno me organization and composition

The mitogenome of L. salicis was a circular DNA molecule, 15,334 bp in length (Fig. 1). It contained the typical insect mitogenome set of 22 tRNAs, 13 PCGs (nad1-6, nad4L, cox1-3, cob, atp6 and atp8), two rRNAs (rrnS and rrnL), and the non-coding A + T-rich region (Table 1). Nucleotide composition was highly A + T biased (A: 42.07%, T: 38.57%, G: 7.22%, C: 12.14%; Table 2). Nucleotide BLAST (blastn) of the entire L. salicis mitogenome against GenBank returned sequence identities with closely related species of 79% (Lachana alpherakii), 78% (Euproctis pseudoconspersa), 78% (Gynaephora menyuanensis), and 77% (Lymantria dispar) (Table S1).
Figure 1

Map of the mitogenome of L. salicis.

tRNA genes are labeled according to the IUPAC-IUB three-letter amino acids; cox1, cox2 and cox3 refer to the cytochrome c oxidase subunits; cob refers to cytochrome b; nad1-nad6 refer to NADH dehydrogenase components; rrnL and rrnS refer to ribosomal RNAs.

Table 1

Summary of characteristics of the mitogenome of L. salicis.

GeneDirectionLocationSizeAnti codonStart codonStop codonIntergenic Nucleotides
tRNAMetF1–6666CAT0
tRNAIleF67–13468GAT−3
tRNAGlnR132–20069TTG47
ND2F248–1233986ATTTAA3
tRNATrpF1237–130973TCA−8
tRNACysR1302–137069GCA0
tRNATyrR1371–144272GTA1
COIF1453–29871535CGAT2
tRNALeu(UUR)F2993–305967TAA0
COIIF3060–3740681ATTT0
tRNALysF3741–381171CTT0
tRNAAspF3812–387867GTC0
ATP8F3879–4040162ATATAA−7
ATP6F4034–4711678ATGTAA14
COIIIF4726–5514789ATGTAA2
tRNAGlyF5517–558266TCC0
ND3F5583–59336354ATTTAA10
tRNAAlaF5947–601771TGC−1
tRNAArgF6017–608468TCG1
tRNAAsnF6086–615267GTT0
tRNASer(AGN)F6153–621967GCT17
tRNAGluF6237–630165TTC0
tRNAPheR6302–636867GAA20
ND5R6389–810317158
tRNAHisR8112–817564GTG1
ND4R8177–95161341ATGTA15
ND4LR9532–98252945
tRNAThrF9831–989767TGT0
tRNAProR9898–996568TGG2
ND6F9968–10514547ATATAA8
CytbF10523–116551133ATGTAA1
tRNASer(UCN)F11657–1172165TGA10
ND1R11732–12684953ATTTAA0
tRNALeu(CUN)R12685–1275571TAG0
lrRNAR12756–1409913440
tRNAValR14100–1416970TAC0
srRNAR14170–150098400
A + T-rich region15010–15334325
Table 2

Composition and skew in different lepidopteran mitogenomes.

SpeciesSize (bp)A%G%T%C%A + T %AT skewnessGC skewness
Whole genome
 L. salicis1533442.077.2238.5712.1480.640.043−0.254
 C. agnata1526139.587.7141.5211.281.1−0.023−0.184
 H. cunea1548140.587.5539.8112.0680.390.009−0.229
 G. menyuanensis1577040.886.7740.611.7581.480.003−0.268
 L. dispar1550740.387.6139.2612.579.640.014−0.243
 E. pseudoconspersa1546140.427.6139.5112.4679.930.011−0.241
 G. argentata1533739.647.5642.0510.7581.69−0.029−0.174
 A. formosae1546338.677.5340.8312.9879.49−0.027−0.265
 P. distinctalis1535441.047.4941.2210.2482.27−0.002−0.155
 L. haraldusalis1521340.477.6641.0410.8381.52−0.006−0.171
 B. thibetaria1548442.387.5537.2412.8379.620.064−0.259
 R. menciana1530141.427.8237.4513.3178.860.050−0.259
 B. mori1566643.097.3138.2611.3481.350.059−0.216
 S. morio1529940.647.5840.5311.2681.170.0013−0.195
 S. lechriaspis1536839.867.6341.3411.1781.19−0.018−0.188
 C. benjaminii1527240.087.5240.711.780.78−0.007−0.217
PCG
 L. salicis1117142.247.8937.1612.7179.390.063−0.233
 C. agnata1123839.128.3740.7911.7279.91−0.020−0.166
 H. cunea1120539.998.3538.613.0678.590.017−0.219
 G. menyuanensis1122840.377.539.4112.7279.780.012−0.258
 L. dispar1123639.528.4438.1813.6277.710.017−0.234
 E. pseudoconspersa1118739698.4338.313.5877.990.017−0.233
 G. argentata1120339.058.2941.2711.3880.33−0.027−0.157
 A. formosae11,21738.188.2839.6213.9277.8−0.018−0.254
 P. distinctalis1118940.548.1240.5310.8181.070−0.142
 L. haraldusalis11,19339.888.4740.1611.4980.04−0.003−0.151
 B. thibetaria1121241.668.3635.9414.0477.60.073−0.253
 R. menciana1122540.978.5836.1214.3377.10.063−0.251
 B. mori1117742.938.1636.6412.2879.570.079−0.201
 S. morio1117940.288.2739.5611.8979.840.009−0.179
 S. lechriaspis1125839.318.3540.4111.9379.72−0.013−0.176
 C. benjaminii1115339.448.2339.7412.5979.18−0.003−0.209
tRNA
 L. salicis149843.196.8840.729.2183.910.029−0.145
 C. agnata147741.238.1940.2210.3681.450.012−0.117
 H. cunea147441.867.8739.8910.3881.750.024−0.138
 G. menyuanensis150441.297.3841.769.5783.05−0.006−0.129
 L. dispar146641.417.9839.510.9180.90.024−0.155
 E. pseudoconspersa146641.418.1940.1810.2381.580.015−0.111
 G. argentata146941.328.2440.2310.2181.550.013−0.107
 A. formosae145740.437.9640.3611.2680.780.001−0.172
 P. distinctalis153642.198.1439.789.981.970.029−0.098
 L. haraldusalis145141.087.8641.429.6582.49−0.004−0.102
 B. thibetaria147842.087.8539.2410.8381.330.035−0.160
 R. menciana148541.088.0839.9310.9181.010.014−0.149
 B. mori147042.047.8939.5210.5481.560.031−0.144
 S. morio146340.68.241.0110.1881.61−0.005−0.108
 S. lechriaspis151641.037.9241.099.9682.12−0.001−0.114
 C. benjaminii146740.98.0440.4910.5781.390.005−0.136
rRNA
 L. salicis218441.395.0440.812.7782.190.007−0.434
 C. agnata211240.015.0744.6510.2784.66−0.055−0.339
 H. cunea223442.084.9242.7510.2584.83−0.008−0.351
 G. menyuanensis231141.894.2842.8410.9984.73−0.011−0.439
 L. dispar214042.524.8141.8210.4284.350.008−0.368
 E. pseudoconspersa222542.564.5442.1110.7984.670.005−0.408
 G. argentata216540.64.7645.139.5285.73−0.053−0.333
 A. formosae216338.934.7244.8511.5183.77−0.071−0.418
 P. distinctalis217441.315.3444.029.3485.33−0.032−0.272
 L. haraldusalis2121442.24.6743.339.8185.534.664−0.355
 B. thibetaria224145.524.7739.5810.1385.10.070−0.360
 R. menciana214743.044.8440.7111.4183.740.028−0.404
 B. mori216143.734.5841.0910.684.820.031−0.397
 S. morio215241.734.8343.0810.3684.8−0.016−0.364
 S. lechriaspis216041.714.9543.849.4985.56−0.025−0.314
 C. benjaminii213241.74.8843.769.6685.46−0.024−0.329
AT RICH
 L. salicis32534.462.4657.235.8591.69−0.248−0.408
 C. agnata33446.711.546.715.0993.410.000−0.545
 H. cunea35745.661.1249.33.9294.96−0.038−0.556
 G. menyuanensis44943.652.4549.674.2393.32−0.065−0.266
 L. dispar37144.742.4349.63.2394.34−0.052−0.141
 E. pseudoconspersa38843.562.3250.263.8793.81−0.071−0.250
 G. argentata34043.241.4752.063.2495.29−0.093−0.376
 A. formosae48242.952.949.794.3692.74−0.074−0.201
 P. distinctalis34946.131.15493.7295.13−0.030−0.528
 L. haraldusalis31045.810.9750.322.996.13−0.047−0.499
 B. thibetaria35044.292.5748.294.8692.57−0.043−0.308
 R. menciana35743.73.3647.345.691.04−0.040−0.250
 B. mori49444.741.8250.612.8395.34−0.062−0.217
 S. morio31644.32.5348.424.7592.72−0.044−0.305
 S. lechriaspis44140.362.4952.384.7692.74−0.130−0.313
 C. benjaminii29346.423.0745.734.7892.150.007−0.218

Protein-coding genes and codon usage

The PCG region formed 72.9% of the L. salicis mitogenome, and was 11,172 bp long. Nine of 13 PCGs (nad2, cox1, cox2, atp8, atp6, cox3, nad3, nad6 and cob) were encoded on the H-strand, while the remaining four (nad5, nad4, nad4L and nad1) were encoded on the L-strand. Each PCG was initiated by a canonical ATN codon, except for cox1 (Table 1), which was initiated by a CGA codon. Ten of 13 PCGs used a typical TAA termination codon; but cox1 and cox2 terminated with a single T and nad4 terminated with TA (Table 1). Relative synonymous codon usage (RSCU) analysis of PCGs in L. salicis revealed that the codons encoding Asn, Ile, Leu (UUA, UUG), Lys, Tyr and Phe were the most frequently present, while those encoding Cys and Arg were rare (Fig. 2). In the PCGs of the eight moth species examined, codon distributions and amino acid content were largely consistent among species (Fig. 3). Codons with A or T in the third position were overused in comparison to other synonymous codons: for example, the codons for valine GTC and GTG were rare, while the synonymous codons GTT and GTA were prevalent (Fig. 4). All used codons were present in the PCGs of the L. salicis mitogenome, except for CGC and GGC. This is similar to codon usage in Hyphantria cunea, Spilonota lechriaspis, and Gabala argentata, which respectively lack CGG and CGC, GCG and CGG, and CGG and CGC.
Figure 2

Comparison of codon usage within the mitochondrial genome of members of the Lepidoptera.

Lowercase letters (a,b,c,d and e) above species names represent the superfamily to which the species belongs (a: Noctuoidea, b: Geometroidea, c: Bombycoidea, d: Pyraloidea, e: Tortricoidea).

Figure 3

Codon distribution in members of the Lepidoptera.

CDspT = codons per thousand codons.

Figure 4

Relative Synonymous Codon Usage (RSCU) of the mitochondrial genome of five superfamilies in the Lepidoptera.

Codon families are plotted on the x-axis. Codons indicated above the bar are not present in the mitogenome.

Ribosomal RNA and transfer RNA genes

The large (rrnL) and small (rrnS) ribosomal RNA subunit genes of L. salicis were located between the tRNA(CUN)/tRNA and the tRNA/A + Trich regions, respectively (Fig. 1, Table 1). The rrnL gene was 1,344 bp long, while rrnS was 840 bp long. A + T content of the rRNA genes was 83.91%. AT and GC skews were positive (0.029) and negative (−0.144), respectively. The L. salicis mitogenome included 22 tRNA genes, ranging from 64 bp (tRNA) to 73 bp (tRNA) long. Of these, 14 genes were encoded on the H-strand and eight on the L-strand (Table 1). The tRNA genes were highly A + T biased (82.19%) with a positive AT-skew (0.007) (Table 2). All the tRNAs possessed a typical clover-leaf secondary structure, except tRNA(AGN), which lacks the dihydrouridine (DHU) arm and forms a simple loop (Fig. 5). Ten of the tRNA genes were each found to have 11 G-U mismatches in their respective secondary structures, which form a weak bond. Ten U-U mismatches were present in the respective amino acid acceptor stems of tRNA, tRNA, tRNA(UUR), tRNA, tRNA, tRNA(CUN), and tRNA (Fig. 5). All tRNA secondary structures of the tRNA genes were calculated using the tRNAscan-SE program.
Figure 5

Predicted secondary structures of the 22 tRNA genes of the L. salicis mitogenome.

Overlapping and intergenic spacer regions

We identified four overlapping gene sequences, varying from 1 bp to 8 bp, making up 19 bp in total. The longest overlapping region was 8 bp between tRNA and tRNA; there was a 7 bp overlap between atp8 and atp6; 3 bp overlap between tRNA and tRNA, and 1 bp between tRNA and tRNA (Table 1). Intergenic spacers were spread over 18 regions, and ranged in length from 1 bp to 47 bp. The longest (47 bp) contained an A + T-rich region and occurred between tRNA and nad2. The 10 bp spacer region between tRNA (UCN) and nad1 included an ‘ATACTAA’ motif (Fig. 6A).
Figure 6

(A) Alignment of the intergenic spacer region between tRNA (UCN) and nad1 of several Lepidopteran insects. (B) Features present in the A + T-rich region of L. salicis. The ‘ATATG’ motif is shaded. The poly-A stretch is double underlined, and the poly-T stretch is underlined. The single microsatellite T/A repeat sequence is indicated by dotted underlining.

The A + T-rich region

The 325 bp long A + T-rich region of L. salicis was located between the rrnS and tRNA genes (Table 1). A + T content in the A + T-rich region was 91.69%, and both AT (−0.248) and GC (−0.408) skews were negative (Table 2). The A + T-rich region did not contain long repeats, though some short repeating sequences scattered over the entire region were present: an ‘ATAGA’ motif followed by an 18 bp poly-T stretch, a microsatellite-like (AT)7 and a poly-A element upstream of the tRNA gene (Fig. 6B).

Phylogenetic relationships

We established phylogenetic relationships among 32 insects (Table 3), based on nucleotide sequences of 13 PCGs, using Maximum Likelihood (ML), Neighbor Joining (NJ) and Bayesian Inference (BI) methods. Species clustered by family (Fig. 7A, B and C). Within Lymantriidae, L. salicis was most closely related to G. menyuanensis. Lymantriidae clustered with Erebidae, while Noctuidae clustered with Nolidae. Noctuoidea was most closely related to Bombycoidea in ML and NJ trees, while in the BI tree Bombycoidea was most closely related to Geometroidea. Papilionoidea and Tortricoidea branched together in ML and NJ methods, but were separated from each other in the BI tree.
Table 3

Details of the lepidopteran mitogenomes used in this study.

SuperfamilyFamilySpeciesSize (bp)GenBank No.
BombycoideaBombycidaeBombyx mori15666KM875545.1
  Rondotia menciana15301KC881286.1
 SaturniidaeActias selene15,236NC_018133
  Antheraea pernyi15,566AY242996
  Antheraea yamamai15,338NC_012739
 SphingidaeSphinx morio15299KC470083.1
  Manduca sexta15,516NC_010266
NoctuoideaLymantriidaeLymantria dispar15507GU994783.1
  Gynaephora menyuanensis15770KC185412.1
  Euproctis pseudoconspersa15461KJ716847.1
  Leucoma salicis15334This study
 NoctuidaeCtenoplusia agnata15261KC414791.1
  Agrotis ipsilon15,377KF163965
 NolidaeEutelia adulatricoides15,360KJ185131
  Gabala argentata15,337KJ410747
 ErebidaeAmata formosae15463KC513737
  Hyphantria cunea15481GU592049.1
 NotodontidaePhalera flavescens15,659NC_016067
  Ochrogaster lunifer15,593NC_011128
GeometroideaGeometridaeApocheima cinerarium15,722KF836545
  Biston thibetaria15,484KJ670146.1
PyraloideaCrambidaeChilo suppressalis15,395NC_015612
  Diatraea saccharalis15,490NC_013274
  Paracymoriza distinctalis15354KF859965.1
 PyralidaeLista haraldusalis15213NC_024535
TortricoideaTortricidaeSpilonota lechriaspis15368HM204705.1
  Grapholita molesta15,717NC_014806
PapilionoideaPapilionidaePapilio maraho16,094NC_014055
  Teinopalpus aureus15,242NC_014398
 NymphalidaeApatura ilia15,242NC_016062
  Apatura metis15,236NC_015537
  Fabriciana nerippe15,140NC_016419
  Argynnis hyperbius15,156NC_015988
HepialoideaHepialidaeThitarodes renzhiensis16,173NC_018094
  Ahamus yunnanensis15,816NC_018095
HesperioideaHesperiidaeChoaspes benjaminii15272JX101620.1
Figure 7

(A) Tree showing the phylogenetic relationships among 32 species, constructed using Maximum Likelihood with 1000 bootstrap replicates. (B) Neighbor Joining (NJ) tree, with 1000 bootstrap replicates. (C) Tree constructed using Bayesian Inference (BI) MCMC consensus tree, with posterior probabilities shown at nodes. Drosophila melanogaster (NC_025936) and Locusta migratoria (NC_002084) were used as outgroups.

Discussion

At the family level, the length of the L. salicis mitogenome (15,334 bp) is marginally smaller than that of Euproctis pseudoconspersa (15,461 bp), but it falls within the range (15,140–16,173 bp) of other known lepidopteran mitogenomes. Gene order and orientation are the same as in previously-sequenced Lymantriidae. Nucleotide BLAST (blastn) result of the entire mitogenome against closely related species revealed that L. salicis has a high similarity with the Lymantriidae species (77% in L. dispar–79% in L. alpherakii). The conserved regions lie in 22 tRNAs and 13 PCGs, while A + T-rich region varies in these species. These remarkable characteristics have been reported in other lepidopteran species7 and could be used as potential markers for identification at genus and species level in recent molarcular techniques. The highly A + T biased nucleotide composition is within the range of previously sequenced lepidopterans (79.64% in L. dispar–81.48% in G. menyuanensis). The positive AT skew (0.043) observed here, indicating the presence of more As than Ts, is similar to that seen in many lepidopterans, including L. dispar (0.014), Rondotia menciana (0.050), and Biston thibetaria (0.064) (Table 2). It is slightly higher than that of other sequenced mitogenomes in Noctuoidea, including Ctenoplusia agnata (−0.023), G. menyuanensis (0.003) and E. pseudoconspersa (0.011). A similar trend has been observed in other lepidopteran superfamilies such as Bombycoidea, where AT skew varies from 0.001 (Sphinx morio) to 0.059 (Bombyx mori)11. In all sequenced lepidopteran mitogenomes, GC skew ranges from −0.268 in G. menyuanensis to −0.155 in Paracymoriza distinctalis (Table 2). The L. salicis mitogenome is moderately skewed (−0.254), showing the presence of more Cs than Gs. The AT skew value (0.063) of the protein-coding gene region in the L. salicis mitogenome is higher than that of several previously sequenced mitogenomes. Its negative GC skew (−0.234) is similar to that seen in other animals. Cox1 is thought to initiate with CGA, as found in other lepidopteran insects1213. Cox1 and cox2 terminate with a single T, while nad4 terminates with TA. Similar results have been documented in several sequenced lepidopteran mitogenomes, including Artogeia melete14, Phthonandria atrilineata15, Ochrogaster lunifer16, H. cunea17 and Amata emma18. The common termination codon TAA is usually created via post-transcriptional polyadenylation19. The relative synonymous codon usage of the 13 protein-coding genes (PCGs) in L. salicis is consistent with those of published lepidopteran sequences. Similarly, codons with A or T in the third codon position being overrepresented relative to other synonymous codons, is consistent with previous observations of lepidopterans9; likewise the absence or underrepresentation of high-GC codons1820. The A + T content (83.91%) of rRNA genes is similar to that seen in Lymantriidae (83.05% in G. menyuanensis). The positive AT (0.029) and negative GC (−0.144) skew seen in the L. salicis mitogenome has also been reported in several sequenced lepidopterans (Table 2). For example, H. cunea has a positive AT (0.024) and negative GC (−0.137) skew17; and L. dispar also has positive AT (0.023) and negative GC (−0.155) skew. The secondary structure of L. salicis tRNA(AGN) lacks the dihydrouridine (DHU) arm and forms a simple loop. This has also been observed in several other animal mitogenomes21, including those of insects152223. Ten tRNA genes have 11 mismatches in their secondary structures; most of these are located in the acceptor, DHU and anticodon stems. In addition, tRNA and tRNA (UCN) contain an A-A mismatch in the anticodon stem. Unmatched base pairs observed in tRNA sequences can be corrected by RNA-editing mechanisms that are well known for arthropod mtDNA24. Four overlapping sequences occur in the mitogenome of L. Salicis. The 7 bp overlap between atp8 and atp6 has been documented in several other lepidopteran mitogenomes2526. The 10 bp intergenic spacer region containing an ‘ATACTAA’ motif, between tRNA (UCN) and nad1, has also been documented in at least nine other species, suggesting that this region is highly conserved among most of the lepidopteran mtDNAs sequenced to date27. The length of the A + T-rich region of L. salicis (325 bp) is shorter than those of G. menyuanensis (449), L. dispar (371), H. cunea (357) and B. thibetaria (350), and longer than those of Lista haraldusalis (310) and Choaspes benjaminii (293). Extra tRNA-like structures are often found in the A + T-rich region of lepidopteran mitogenomes. For example Antheraea yamamai has tRNA(UCN)-like and tRNA-like sequences, each with correct anticodon structure and forming a clover-leaf structure, which suggests that they may be functional, though each has several mismatches in both aminoacyl and anticodon stem regions28. Extra tRNA-like structures have not been seen in L. salicis. The presence of multiple tandem-repeat elements is described as being characteristic of insect A + T-rich regions29. Antheraea pernyi has a repeat element of 38 bp tandemly repeated six times25; and Cnaphalocrocis medinalis has a duplicated 25 bp repeat element2530. Long conspicuous repeats were not observed in the A + T-rich region of L. salicis, though shorter repeating sequences, an ‘ATAGA’ motif and other features were. These characteristic features have each been found in previously sequenced lepidopteran species273132. In general, the L. salicis mitogenome contains several features in nucleotide composition, structure of tRNAs and PCGs as well as in the A + T rich region. Particularly in advanced technologies like PCR–RFLP methods3 and DNA barcodes33, these similarities and differences between L. salicis and other insects could be used as potential markers in species identification, especially the differences. Phylogenetic relationships were established using Maximum Likelihood (ML) Neighbor Joining (NJ) and Bayesian Inference (BI) methods. Species clustered in families, and results were broadly consistent with previous work, e.g. Dong et al.26 and Dai et al.34. Results obtained from our analyses also supported the classification proposed by Fibiger and Lafontaine35, including within Lymantriidae a clade comprised of E. pseudoconspersa, L. salicis, L. dispar and G. menyuanensis. The present analysis showed that within Lymantriidae, L. salicis was most closely related to G. menyuanensis, which is consistent with a recent study on E. pseudoconspersa26. Interestingly, L. dispar is more closely related to G. menyuanensis than E. pseudoconspersa in ML and NJ trees (Fig. 7A and B), whereas in the BI consensus tree L. dispar and E. pseudoconspersa branch together with 0.6406 posterior probabilities (Fig. 7C). We conclude from the above results that differences between BI, ML and NJ methods generate different results on the relationship among different Noctuoidea species. Because most previous classifications of Lymantriidae species have been based on morphological features, the precise position of Lymantriidae within the Noctuoidea is still unclear. Kitching has suggested that the Lymantriidae are the sister group to a paraphyletic Pantheidae, sharing apomorphies such as the presence of secondary setae in first instar larvae36. Zahiri et al. reclassified the Noctuoidea on the basis of molecular analyses, making the group currently named Lymantriinae a subfamily of Erebidae37. Our results suggest that Lymantriidae can be regarded as a sister group to other families (Erebidae, Nolidae and Noctuidae) in the Noctuoidea, being most closely related to Erebidae that is consistent with previous study of Fibiger and Lafontaine (2005) on higher Noctuoidea classification. They placed the Lymantriidae from a position in front of the Nolidae to a position after Arctiidae to reflect the close association of the arctiids and lymantriids, and moved the Nolidae, Arctiidae and Lymantriidae in front of the upgraded family Erebidae so that their close relationship with the “quadrifids’ is better reflected35. It is concluded that further studies are needed on sequencing and characterization of mitogenomes of the family Lymantriidae that will provide insight to classification of Noctuoidea. At the level of superfamilies, Noctuoidea was closely related to Bombycoidea in our ML and NJ analyses, while in the BI tree, Bombycoidea was closely related to Geometroidea. Papilionoidea and Tortricoidea branched together in ML and NJ trees, but in the BI tree they formed separate branches, more in line with previous studies. Hepialoidea was the sister group to all other superfamilies, as found previously by Salvato et al.16 and Chai et al.38. While several previous studies have been undertaken on mitogenomes of Noctuoidea, relatively little is known about Lymantriidae specifically. Further taxon sampling within Lymantriidae and related families is required to resolve the placement of Lymantriidae in Noctuoidea.

Materials and Methods

Sample collection and mitochondrial DNA extraction

L. salicis larvae were collected from willow trees within the campus of Anhui Agricultural University, Hefei, China. Total genomic DNA was extracted using the Aidlab Genomic DNA Extraction Kit (Aidlab Co., Beijing, China) according to the manufacturer’s instructions. Quality of extracted DNA was assessed by electrophoresis on a 1% agarose gel stained with ethidium bromide.

Primer design, PCR amplification and sequencing

The full mitochondrial genome of L. salicis was PCR amplified in thirteen overlapping fragments, based on primers that were designed from known mitogenomes of Lymantriidae, and synthesized by Invitrogen Co. Ltd. Shanghai, China (Table 4). All PCRs were performed in a 50 μL reaction volume, including 35 μL sterilized distilled water, 5 μL 10 × Taq buffer (Mg2 + ), 4 μL dNTP (25 mM), 1.5 μL DNA, 2 μL of each primer (10 μM) and 0.5 μL (1 unit) Taq (TaKaRa Co., Dalian, China). PCR conditions were as follows: 4 min at 94 °C, followed by 35 cycles of 30 s at 94 °C, 40 s at 46–58 °C (Table 4), and 1–3 min (depending on putative length of the fragments) at 72 °C; and then a final extension step of 72 °C for 10 min.
Table 4

Details of the primers used to amplify the mitogenome of L. salicis.

Primer pairPrimer sequence (5′–3′)Annealing temperature
F1TAAAAATAAGCTAAATTTAAGCTT52 °C
R1TATTAAAATTGCAAATTTTAAGGA
F2AAACTAATAATCTTCAAAATTAT46 °C
R2AAAATAATTTGTTCTATTAAAG
F3ATTCTATATTTCTTGAAATATTAT46 °C
R3CATAAATTATAAATCTTAATCATA
F4TGAAAATGATAAGTAATTTATTT48 °C
R4AATATTAATGGAATTTAACCACTA
F5TAAGCTGCTAACTTAATTTTTAGT53 °C
R5CCTGTTTCAGCTTTAGTTCATTC
F6CCTAATTGTCTTAAAGTAGATAA48 °C
R6TGCTTATTCTTCTGTAGCTCATAT
F7TAATGTATAATCTTCGTCTATGTAA50 °C
R7ATCAATAATCTCCAAAATTATTAT
F8ACTTTAAAAACTTCAAAGAAAAA53 °C
R8TCATAATAAATTCCTCGTCCAATAT
F9GTAAATTATGGTTGATTAATTCG53 °C
R9TGATCTTCAAATTCTAATTATGC
F10CCGAAACTAACTCTCTCTCACCT58 °C
R10CTTACATGATCTGAGTTCAAACCG
F11CGTTCTAATAAAGTTAAATAAGCA55 °C
R11AATATGTACATATTGCCCGTCGCT
F12TCTAGAAACACTTTCCAGTACCTC52 °C
R12AATTTTAAATTATTAGGTGAAATT
F13TAATAGGGTATCTAATCCTAGTT48 °C
R13ACTTAATTTATCCTATCAGAATAA
All PCR products were visualized by electrophoresis on a 1.0% TAE agarose gel, and purified using a DNA gel extraction kit (Transgen Co., Beijing, China). The purified PCR fragments were ligated into the T-vector (TaKaRa Co., Dalian, China) and transformed into Escherichia coli DH5α, using the manufacturer’s protocol. Recombinants were cultured overnight at 37 °C on Luria-Bertani (LB) solid medium containing Ampicillin (AMP), isopropylthiogalactoside (IPTG) and 5-bromo-4-chloro-3-indolyl-D-galactopyranoside (X-Gal). White colonies carrying insert DNA were selected, cultured overnight in liquid media, and vector inserts were directly sequenced by Sangon Biotech Co., (Shanghai, China).

Sequence assembly and gene annotation

The complete mtDNA sequence was assembled using the SeqManII program from the Lasergene software package (DNAStar Inc., Madison, USA). Sequence annotation was performed using the NCBI’s web interface for BLAST (http://blast.ncbi.nlm.nih.gov/Blast). Nucleotide sequences of the PCGs were translated into putative proteins based on insect sequences available in GenBank. Initiation and termination codons were identified using an alignment created in ClustalX version 2.0, with other lepidopteran sequences as references. To describe base composition, we analyzed skew as described by Junqueira39: AT skew = [A − T]/[A + T], GC skew = [G − C]/[G + C]. The relative synonymous codon usage (RSCU) was obtained using MEGA 540. The tRNA genes were verified using the program tRNAscan-SE with default settings41, in addition to using the alignment to visually identify sequences with the appropriate anticodons capable of folding into the typical clover-leaf secondary structure. In the A + T-rich region, tandem repeats were found with the Tandem Repeats Finder program (http://tandem.bu.edu/trf/trf.html)42.

Phylogenetic analysis

A total of 29 sets of 13 PCG sequences were used to perform phylogenetic analysis, including those of L. salicis. Those from other taxa were downloaded from GenBank, with Drosophila melanogaster (U37541.1)43 and Locusta migratoria (JN858212)44 sequences used as an outgroup. Alignments of the 13 concatenated PCGs were conducted using ClustalX version 2.0. Maximum likelihood (ML) phylogenetic analysis was performed using MEGA 5.0 with Tamura-Nei model40. Neighbor Joining (NJ) distance analysis was performed using PAUP4b1045, and Bayesian Inference (BI) MCMC phylogenetic analysis was performed using MrBayes 3.246. The ML analysis was pseudosampled with 1000 bootstrapped datasets. The NJ analysis was done with 1000 bootstrap replicates. The BI analysis used four chains MCMC, running for 1,000,000 generations, with trees being sampled every 1000 generations. The consensus tree was visualized using FigTree v1.4.0 (http://tree.bio.ed.ac.uk/software/figtree/).

Additional Information

How to cite this article: Sun, Y.-X. et al. Characterization of the Complete Mitochondrial Genome of Leucoma salicis (Lepidoptera: Lymantriidae) and Comparison with Other Lepidopteran Insects. Sci. Rep. 6, 39153; doi: 10.1038/srep39153 (2016). Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
  36 in total

1.  A novel type of RNA editing occurs in the mitochondrial tRNAs of the centipede Lithobius forficatus.

Authors:  D V Lavrov; W M Brown; J L Boore
Journal:  Proc Natl Acad Sci U S A       Date:  2000-12-05       Impact factor: 11.205

2.  Genetic variability of four natural isolates of the Stilpnotia salicis multiple-enveloped nuclear polyhedrosis virus.

Authors:  L Strokovskaya; J Ziemnicka; J Michalik
Journal:  Acta Biochim Pol       Date:  1996       Impact factor: 2.149

3.  Molecular phylogeny of silkmoths reveals the origin of domesticated silkmoth, Bombyx mori from Chinese Bombyx mandarina and paternal inheritance of Antheraea proylei mitochondrial DNA.

Authors:  K P Arunkumar; Muralidhar Metta; J Nagaraju
Journal:  Mol Phylogenet Evol       Date:  2006-04-27       Impact factor: 4.286

4.  Complete mitochondrial genome of Chilo suppressalis (Walker) (Lepidoptera: Crambidae).

Authors:  Jiao Yin; Ai-Min Wang; Gui-Yun Hong; Ya-Zhong Cao; Zhao-Jun Wei
Journal:  Mitochondrial DNA       Date:  2011-06

Review 5.  Mitochondrial transcription initiation. Variation and conservation.

Authors:  G S Shadel; D A Clayton
Journal:  J Biol Chem       Date:  1993-08-05       Impact factor: 5.157

6.  Mitochondrial genome of the cotton bollworm Helicoverpa armigera (Lepidoptera: Noctuidae) and comparison with other Lepidopterans.

Authors:  Jiao Yin; Gui-Yun Hong; Ai-Min Wang; Ya-Zhong Cao; Zhao-Jun Wei
Journal:  Mitochondrial DNA       Date:  2010-10

7.  The complete mitochondrial genome of the fall webworm, Hyphantria cunea (Lepidoptera: Arctiidae).

Authors:  Fang Liao; Lin Wang; Song Wu; Yu-Ping Li; Lei Zhao; Guo-Ming Huang; Chun-Jing Niu; Yan-Qun Liu; Ming-Gang Li
Journal:  Int J Biol Sci       Date:  2010-03-29       Impact factor: 6.580

8.  Molecular species identification of Central European ground beetles (Coleoptera: Carabidae) using nuclear rDNA expansion segments and DNA barcodes.

Authors:  Michael J Raupach; Jonas J Astrin; Karsten Hannig; Marcell K Peters; Mark Y Stoeckle; Johann-Wolfgang Wägele
Journal:  Front Zool       Date:  2010-09-13       Impact factor: 3.172

9.  MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space.

Authors:  Fredrik Ronquist; Maxim Teslenko; Paul van der Mark; Daniel L Ayres; Aaron Darling; Sebastian Höhna; Bret Larget; Liang Liu; Marc A Suchard; John P Huelsenbeck
Journal:  Syst Biol       Date:  2012-02-22       Impact factor: 15.683

10.  The complete mitochondrial genome of the bag-shelter moth Ochrogaster lunifer (Lepidoptera, Notodontidae).

Authors:  Paola Salvato; Mauro Simonato; Andrea Battisti; Enrico Negrisolo
Journal:  BMC Genomics       Date:  2008-07-15       Impact factor: 3.969

View more
  10 in total

Review 1.  Integrins in the Immunity of Insects: A Review.

Authors:  Saima Kausar; Muhammad Nadeem Abbas; Isma Gul; Yu Liu; Bo-Ping Tang; Iram Maqsood; Qiu-Ning Liu; Li-Shang Dai
Journal:  Front Immunol       Date:  2022-06-09       Impact factor: 8.786

2.  Comparative mitogenomics of Drosophilidae and the evolution of the Zygothrica genus group (Diptera, Drosophilidae).

Authors:  Maiara Hartwig Bessa; Francine Cenzi de Ré; Rafael Dias de Moura; Elgion Lucio Loreto; Lizandra Jaqueline Robe
Journal:  Genetica       Date:  2021-10-05       Impact factor: 1.082

3.  Mitochondrial Genomes of Two Bombycoidea Insects and Implications for Their Phylogeny.

Authors:  Zhao-Zhe Xin; Xiao-Yu Zhu; Ying Wang; Hua-Bin Zhang; Dai-Zhen Zhang; Chun-Lin Zhou; Bo-Ping Tang; Qiu-Ning Liu
Journal:  Sci Rep       Date:  2017-07-26       Impact factor: 4.379

4.  Sequencing and characterization of the complete mitochondrial genome of Japanese Swellshark (Cephalloscyllium umbratile).

Authors:  Ke-Cheng Zhu; Yin-Yin Liang; Na Wu; Hua-Yang Guo; Nan Zhang; Shi-Gui Jiang; Dian-Chang Zhang
Journal:  Sci Rep       Date:  2017-11-10       Impact factor: 4.379

5.  A supertree of Northern European macromoths.

Authors:  Robert B Davis; Erki Õunap; Toomas Tammaru
Journal:  PLoS One       Date:  2022-02-18       Impact factor: 3.240

6.  Unfolding the mitochondrial genome structure of green semilooper (Chrysodeixis acuta Walker): An emerging pest of onion (Allium cepa L.).

Authors:  Soumia P S; Dhananjay V Shirsat; Ram Krishna; Guru Pirasanna Pandi G; Jaipal S Choudhary; Naiyar Naaz; Karuppaiah V; Pranjali A Gedam; Anandhan S; Major Singh
Journal:  PLoS One       Date:  2022-08-30       Impact factor: 3.752

7.  Comparative analysis of mitochondrial genomes of geographic variants of the gypsy moth, Lymantria dispar, reveals a previously undescribed genotypic entity.

Authors:  Abdelmadjid Djoumad; Audrey Nisole; Reza Zahiri; Luca Freschi; Sandrine Picq; Dawn E Gundersen-Rindal; Michael E Sparks; Ken Dewar; Don Stewart; Halim Maaroufi; Roger C Levesque; Richard C Hamelin; Michel Cusson
Journal:  Sci Rep       Date:  2017-10-27       Impact factor: 4.379

8.  Characterization of the complete mitochondrial genome of Orthaga olivacea Warre (Lepidoptera Pyralidae) and comparison with other Lepidopteran insects.

Authors:  Liangli Yang; Junjun Dai; Qiuping Gao; Guozhen Yuan; Jiang Liu; Yu Sun; Yuxuan Sun; Lei Wang; Cen Qian; Baojian Zhu; Chaoliang Liu; Guoqing Wei
Journal:  PLoS One       Date:  2020-03-06       Impact factor: 3.240

9.  The complete mitochondrial genome of Dysgonia stuposa (Lepidoptera: Erebidae) and phylogenetic relationships within Noctuoidea.

Authors:  Yuxuan Sun; Yeshu Zhu; Chen Chen; Qunshan Zhu; Qianqian Zhu; Yanyue Zhou; Xiaojun Zhou; Peijun Zhu; Jun Li; Haijun Zhang
Journal:  PeerJ       Date:  2020-03-16       Impact factor: 2.984

10.  Mitochondrial genome sequencing and phylogeny of Haemagogus albomaculatus, Haemagogus leucocelaenus, Haemagogus spegazzinii, and Haemagogus tropicalis (Diptera: Culicidae).

Authors:  Fábio Silva da Silva; Ana Cecília Ribeiro Cruz; Daniele Barbosa de Almeida Medeiros; Sandro Patroca da Silva; Márcio Roberto Teixeira Nunes; Lívia Carício Martins; Jannifer Oliveira Chiang; Poliana da Silva Lemos; Gabriel Muricy Cunha; Renato Freitas de Araujo; Hamilton Antônio de Oliveira Monteiro; Joaquim Pinto Nunes Neto
Journal:  Sci Rep       Date:  2020-10-12       Impact factor: 4.379

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.