Literature DB >> 29021958

Complete mitochondrial genome sequence of fruit-piercing moth Eudocima phalonia (Linnaeus, 1763) (Lepidoptera: Noctuoidea).

Kuppusamy Sivasankaran1, Pratheesh Mathew1, Sekar Anand1, Stanislaus Antony Ceasar1,2, Soosaimanikam Mariapackiam1, Savarimuthu Ignacimuthu1.   

Abstract

The complete mitochondrial genome of the fruit piercing moth Eudocima phalonia (Linnaeus, 1763) (Lepidoptera: Noctuoidea) was sequenced and characterized (Genbank Accession No: KY196412). The complete mitogenome is a circular molecule of 15,575 bp length, consisting of 13 protein-coding genes (PCGs), two ribosomal RNA genes (rrnS and rrnL), 22 transfer RNA (tRNA) genes and an A + T-rich region (D-loop). The nucleotide composition of the genome is highly A + T biased, accounting for 80.67% of nucleotides. All tRNAs have putative secondary structures that are characteristic of mitochondrial tRNA. Most of the PCGs were initiated by typical ATN codons. Five genes were initiated by unusual codons. Cox1 gene was initiated by an unusual CGA codon and terminated by the typical stop codon GAA. Six genes ended with a single T. The A + T-rich region of 336 bp consisted of repetitive sequences, including two ATAGA motifs, a 19 bp poly-T stretch and three microsatellite-like regions ((TA)4, (TA)6 and two (TA)7). Moreover, three large tandem (one 40 bp and two 25 bp) repeated elements were identified in A + T-rich region. Phylogenetic analysis using PCGs revealed that Superfamily Noctuoidea is a monophyletic group.

Entities:  

Keywords:  Eudocima phalonia; Lepidoptera; Mitochondrial genome; Protein coding genes

Year:  2017        PMID: 29021958      PMCID: PMC5633087          DOI: 10.1016/j.gdata.2017.09.004

Source DB:  PubMed          Journal:  Genom Data        ISSN: 2213-5960


Introduction

The genomes and genes of mitochondria have been widely used as informative molecular markers in studies of comparative and evolutionary genomics, reconstruction of phylogenetic relationships, population genetics and evolutionary biology [1], [2], [3], [4], [5]. Insect mitochondrial genome (mtDNA) is a circular molecule, ranging from 14 to 19 kb in length which encodes 37 genes including 13 protein-coding genes (PCGs), 2 ribosomal RNA (rRNA) genes and 22 transfer RNA (tRNA) genes. In addition, it contains a control region known as the adenine (A) + thymine (T)-rich region which is common in all insect mtDNA. This region is involved in the initiation of both transcription and replication [6]. The order Lepidoptera (butterflies and moths) is one of the largest groups of insects, accounting for > 157,424 described species. It represents one of the evolutionarily most successful lineages of phytophagous insects [7]. The superfamily Noctuoidea within the order Lepidoptera comprises 43,000 described species [8]. Besides, this order includes a number of biological model organisms, many severe pest species, and many best known and most popular invertebrates, giving emphasis to the fact that studies of lepidopteran phylogeny and evolution are of both scientific and public interest. Despite the huge taxonomic diversity, the current information on the lepidopteran mitogenome is very limited. Only a few reports are available for the characterization of the mitogenomes of Lepidoptera species [9]. Only 57 lepidopteran mitogenomes have been sequenced so far, including 27 butterflies and nearly 30 moths; only 27 are from superfamily Noctuoidea. Sequencing and characterization of mitogenome of more species in Noctuoidea will provide further insight into the deep level evolutionary relationships among these agriculturally important insects. In this study, we present the complete mitogenome sequence of Eudocima phalonia (Linnaeus, 1763) (Genbank Accession No: KY196412) and compare it with those of other Noctuoidea species. E. phalonia, commonly known as the fruit-piercing moth, is an important pest of citrus, several commercial fruit crops and beverages [10], [11]. E. phalonia uses its proboscis to perforate the fruits and suck sap to feed. Larval stage has no such necessity because host plants usually vary for larva and adult stages. Caterpillars are predominantly foliage feeders of numerous wild plants, shrubs and vines within the families Menispermaceae and Fabaceae. The adult species have sexual dimorphism, the males with more frequently lineated forewings and the females with more irregularly marked and mottled. This species is distributed in India, Thailand, Myanmar, Vietnam, Nepal, China, Taiwan, Japan, Korea, South of Russian Far East (migrant), Philippines, Indonesia (New Guinea), Micronesia, Australia, New Zealand, Central Africa (Gabon, Zaire, Congo) [12].

Materials and methods

Insect collection and total DNA extraction

Adult E. phalonia moths were collected from Kodaikanal (10° 23′ 5367″ N 77° 49′ 2933″ E) Western Ghats, Tamil Nadu, India. The total DNA was isolated from single individual (thoracic region) using DNA extraction Kit (Genotypic Technology Pvt. Ltd. Bangalore, India). The voucher specimen was deposited at Entomology Research Institute, Loyola College, Chennai (Vouchers Number: ERILMC-049).

Mitogenome sequencing

Whole mitogenome sequencing (WmtGS) libraries were prepared with Illumina-compatible NEXTflex DNA sequencing kit (BIOO Scientific, Austin, Texas, U.S.A.). Briefly, about 100 ng of mitochondrial enriched DNA was sheared using Covaris S2 sonicator (Covaris, Woburn, Massachusetts, USA) to generate approximate fragment size distribution from 200 bp to 400 bp. The fragment size distribution was checked on Agilent 2200 Tape Station with D1000 DNA screen tapes and reagents (Agilent Technologies, Palo Alto, CA, USA) and subsequently purified using HighPrep magnetic beads (Magbio Genomics Inc., USA). The purified fragments were end-repaired, adenylated and ligated to Illumina multiplex barcode adaptors as per NEXT Flex DNA sequencing kit protocol. The adapters used in the study were Illumina Universal Adapters: 5′AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT-3′ and Index Adapter: 5′-GATCGGAAGAGCACACGTCTGAACTCCAGTCAC [INDEX] ATCTCGTATGCCGTCTTCTGCTTG-3′. Index unique sequence to identify sample-specific sequence was “TCGGCA”. The adapter-ligated DNA was purified with HighPrep beads and then amplified for 10 cycles of PCR using Illumina-compatible primers provided in the NEXTFlex DNA sequencing kit. The final PCR product (sequencing library) was purified with HighPrep beads, followed by library quality control check. The Illumina-compatible sequencing library was initially quantified by Qubit fluorometer (Thermo Fisher Scientific, MA, USA) and its fragment size distribution was analysed on Agilent Tape Station.

Sequence assembly and gene annotation

The raw sequences were assembled using Denovo Genome assembly (SPAdes assembler). The sequence annotation was executed using MITOS online tool [13]. The PCGs were recognized by sequence resemblance with that of known lepidopteran sequences. The PCG's nucleotide sequences were deciphered on the basis of the invertebrate mitochondrial genetic code. The PCGs alignment of various lepidopteran mitogenome was executed using Clustal Omega [14]. The composition skewness was calculated using the formula: AT skew = [A − T] / [A + T]; similarly GC skew = [G − C] / [G + C] [15]. The tRNA genes and their putative secondary structures were produced using MITOS software and analysed by comparison with the nucleotide sequence of other lepidopteran tRNA sequences. Tandem repeats at the A + T-rich region were identified using the online Tandem Repeats Finder tool (http://tandem.bu.edu/trf/trf.html) [16]. Relative Synonymous Codon Usage (RSCU) of PCGs was determined using codon usage calculator (www.biologicscorp.com/tools/CodonUsageCalculator).

Phylogenetic analysis

The phylogenetic relationship among the superfamily Noctuoidea was analysed using 13 PCGs and 2 rRNA of mitochondrial genome. PCGs of mitogenomes of 27 species, including one obtained through this study, were used to reconstruct the phylogenetic tree (Table 1). The sequences of remaining 26 mitogenomes were obtained from the Genbank. The mitogenomes of Drosophila yakuba (X03240), Anopheles gambiae (L20934) and Tribolium casteneum (AJ312413) were used as outgroups. The amino acid sequences of 13 mitochondrial PCGs were aligned using ClustalW and concatenated. The concatenated sets of amino acid sequences from the PCGs were used in the reconstruction of phylogenetic tree using the Model based Maximum Likelihood method using the MEGA version 6.0 program [17]. The most appropriate model Jones-Taylor-Thornton (JTT) was used to infer the phylogenetic tree with 1000 bootstrap replicates.
Table 1

List of the complete mitogenome of the superfamily Noctuoidea reported so far.

FamilySpeciesAccessionReference
NotodontidaePhalera flavescensJF440342[27]
Ochrogaster luniferAM946601[28]
ArctiidaeHyphantria cuneaGU592049[29]
Amata formosaeKC513737[25]
DoidaeDoa sp.KJ508058[20]
ErebidaeNyctemera arctata albofasciataKM244681[18]
Eudocima phaloniaKY196412This study
Catocala deuteronymphaKJ432280[9]
Asota plana lacteataKJ173908[9]
LymantriidaeLymantria disparFJ617240[23]
Gynaephora menyuanensisKC185412[26]
NoctuidaeSesamia inferensJN039362[30]
Ctenoplusia agnataKC414791[22]
Ctenoplusia limbirenaKM244665[18] unpublished
Spodoptera exiguaJX316220[36]
Spodoptera lituraKF543065[34]
Agrotis ipsilonKF163965[37]
Agrotis segetumKC894725[24]
Helicoverpa punctigeraKF977797[35]
Helicoverpa armigeraGU188273[19]
Helicoverpa zeaKJ930516[43]
Noctua pronubaKJ508057[20]
Acronicta psiKJ508060[20]
Striacosta albicostaKM488268Coates B.S. & Abel C.A., unpublished
Mythimna separataKM099034[21]
Risoba prominensKJ396197[9]
EuteliidaeEutelia adulatricoidesKJ185131[9]
ColeopteraTribolium casteneumAJ312413Friedrich and Muqim [40]
DipteraDrosophila yakubaX03240Clary et al. [41]
Anopheles gambiaeL20934Beard et al. [42]
List of the complete mitogenome of the superfamily Noctuoidea reported so far.

Results and discussion

Genome organization and base composition

The complete mitogenome of E. phalonia is a closed circular molecule consisting of 15,575 bp length. The complete mitogenome sequence was submitted to the Genbank (Accession Number: KY196412). The sizes of mitogenomes sequenced so far in superfamily Noctuoidea moths ranged from 15,261 bp in Ctenoplusia agnata to 15,671 bp in Catocala deuteronympha [9]. The length of E. phalonia mitogenome falls within the range of mitogenomes of other Noctuoid moths sequenced. The organization of the E. phalonia complete mitogenome is shown in Fig. 1. The E. phalonia mitogenome contains 13 PCGs viz. ATPase subunits 6 and 8 (atp6 and atp8), cytochrome c oxidase subunits 1–3 (cox1, cox2 and cox3), NADH dehydrogenase subunits 1–6 (nad1, nad2, nad3, nad4, nad5 and nad6) subunit 4 L of nad (nad4L) and cytochrome B (cob), 2 ribosomal RNA genes (rrnS and rrnL), 22 tRNA genes (one each for 18 amino acids and two each for leucine and serine), and a major non-coding region known as the A + T-rich region (Table 2). The order of three tRNA genes of E. phalonia mitogenome was trnM-trnI-trnQ (Fig. 1), which differed from ancestral order trnI-trnQ-trnM [44]. However, the position of the trnM of E. phalonia (15,500–15,566) was similar to that of other Noctuoidea such as Nyctemera arctata albofasciata [18], Risoba prominens [9], Helicoverpa armigera [19], Noctua pronuba [20] and Eutelia adulatricoides [9].
Fig. 1

Map of the complete mitochondrial genome of Eudocima phalonia (Lepidoptera: Noctuoidea). The length of this circular genome is 15,575 bp. The sequence starts at A + T-Rich region. The regions corresponding to 13 proteins genes (PCGs), two ribosomal RNA genes and 22 transfer RNA genes are also indicated.

Table 2

Details on gene organization of the Eudocima phalonia mitogenome.

GeneDirectionNucleotide no.SizeIGNcAnticodonStart codonStop codon
trnlF1–57570GAT
trnQR60–128692TTG
nad2F225–109787396ATTTAG
trnWF1195–12636997TCA
trnCR1256–132166− 8GCA
trnYR1330–1395668GTA
cox1F1421–2935151525ATGTAA
trnL2 (UUR)F2955–30226619TAA
cox2F3023–36886660ATGTGA
trnKF3705–37757116CTT
trnDF3776–3842670GTC
atp8F3843–40071650ATTTAA
atp6F4007–46726660ATATAA
cox3F4681–54637838ATGTAA
trnGF5469–5536685
nad3F5543–58813397ATATAA
trnAF5924–59916842TGC
trnRF5994–6057642TCG
trnNF6066–6131668GTT
trnS1F6172–62376640GCT
trnEF6241–6306663TTC
trnFR6309–6375672GAA
nad5R6386–8065168010ATATAA
trnHR8117–81836751GTG
nad4R8188–952213354AAACAT
nad4LR9550–982227327ATATAA
trnTF9846–99106523TGT
trnPR9911–9975650TGG
nad6F9986–10,50151610ATTTAA
cobF10,554–11,651109852ATTTAA
trnS2F11,693–11,7576541TGA
nad1R11,789–12,68890031ATTTAA
trnL1 (CUN)R12,716–12,7836827TAG
rrnLR12,763–14,1531391− 20
trnVR14,150–14,21667− 3TAC
rrnSR14,217–15,0378210
trnMF15,500–15,566670CAT
A + T rich region15,041–15,3763360
Map of the complete mitochondrial genome of Eudocima phalonia (Lepidoptera: Noctuoidea). The length of this circular genome is 15,575 bp. The sequence starts at A + T-Rich region. The regions corresponding to 13 proteins genes (PCGs), two ribosomal RNA genes and 22 transfer RNA genes are also indicated. Details on gene organization of the Eudocima phalonia mitogenome.

Protein-coding genes (PCGs)

The 13 PCGs of E. phalonia mitogenome include nad1–6, nad4L, three cox subunits, two ATPase subunits and one cob. The 13 PCGs are 10,809 bp in length and account for 69.39% of the whole mitogenome. Twelve of the PCGs in E. phalonia mitochondrial genome were initiated by typical ATN codons. One PCG was initiated by an unusual codon. Specifically, 3 PCGs (cox1, cox2 and cox3) started with ATG, 4 PCGs (nad3, nad4l, nad5 and atp6) started with ATA, 5 PCGs (atp8, nad1, nad2, nad6 and cob) started with ATT and one PCG (nad4) started with AAA. For the stop codons, ten PCGs (cox1, nad1, nad3, nad4L, nad5, nad6, cox3, atp8, atp6 and cob) terminated with TAA and 3 PCGs nad4, cox2, nad2 terminated with CAT, TGA, TAG respectively (Table 2). The organization of the PCGs was unchanged as seen in the other sequenced lepidopterans. The start codon for cox1 gene of the lepidopteran insects is not consistent. Change of start codon is common for cox1 gene in all lepidopterans. The A + T nucleotide composition of 13 PCGs in the mitogenome of E. phalonia is 78.39%. The Relative Synonymous Codon usage (RSCU) of PCGs of the E. phalonia was examined and the results are shown in Table 3. In the PCGS of the E. phalonia mitogenome, the codons CTG, GCG, CAG and CGG are not represented. The genome-wise A + T bias is also positively correlated in the codon usage of the E. phalonia. The six most frequently used codons in the E. phalonia mitogenome TTT (Phe), TTA (Leu), ATT (Ile), ATA (Met), TAT (Tyr) and AAT (Asn) accounted for 41.23%. These codons are all composed of A or T nucleotides thus indicating the biased usage of A and T nucleotides in the E. phalonia PCGs. Leucine (9.6%), Isoleucine (10.87%), Phenylalanine (7.18%) and serine (11.60%) were the most frequent amino acids in E. phalonia mitochondrial protein (39.25%). These amino acids are also the most commonly present in other Noctuoidea moths (Fig. 5).
Table 3

Codon usage of the protein-coding genes in Eudocima phalonia mitogenome.a

Codonn%RSCUCodonn%RSCUCodonn%RSCU
TTT(F)1985.530.77CCA(P)290.810.25GAT(D)350.970.83
TTC(F)591.640.23CCG(P)10.020.01GAC(D)70.190.17
TTA(L)2336.510.68ATT(T)782.180.40GAA(E)441.230.98
TTG(L)230.640.07ACC(T)431.200.22GAG(E)10.020.02
CTT(L)451.260.13ACA(T)651.810.33TGT(C)200.550.64
CTC(L)160.450.05ACG(T)90.250.05TGC(C)110.300.36
CTA(L)280.780.08GCT(A)481.340.61TGA(W)802.230.92
CTG(L)00.00.00GCC(A)40.110.05TGG(W)70.190.08
ATT(I)3459.640.89GCA(A)270.750.34CGT(R)80.220.22
ATC(I)441.230.11GCG(A)00.000.00CGT(R)10.020.03
ATA(M)2095.840.93TAT(Y)2105.870.87CGA(R)270.750.75
ATG(M)160.440.07TAC(Y)310.860.13CGG(R)00.000.00
GTT(V)320.890.39TAAb2145.980.99AGT(S)371.030.09
GTC(V)20.050.02TAGb30.080.01AGC(S)350.970.08
GTA(V)481.340.58CAT(H)601.670.81AGA(S)701.950.17
GTG(V)10.020.01CAC(H)140.390.19AGG(S)230.640.06
TCT(S)1213.380.29CAA(Q)531.481.00GGT(G)310.860.25
TCC(S)330.920.08CAG(Q)00.000.00GGC(G)20.050.02
TCA(S)882.460.21AAT(N)2807.820.80GGA(G)852.370.69
TCG(S)80.220.02AAC(N)701.950.20GGG(G)50.130.04
CCT(P)531.480.46AAA(K)1724.800.98
CCC(P)310.860.27AAG(K)40.110.02

A total of 3577 codons were analysed omission of the initiation and terminations codons. RSCU, relative synonymous codon usage.

Stop codon.

Fig. 5

Relative synonymous codon usage (RSCU) in Noctuoidea mtDNAs. The species name represents the superfamily Noctuoidea.

Codon usage of the protein-coding genes in Eudocima phalonia mitogenome.a A total of 3577 codons were analysed omission of the initiation and terminations codons. RSCU, relative synonymous codon usage. Stop codon.

Intergenic spacers and overlapping sequences

The mitogenome of E. phalonia contains a total of 53 bp overlaps between genes in five locations ranging from 3 to 20 bp in length. The longest 20 bp overlap was found in two locations: trnG-nad3 and trnL1-rrnL (Table 2). The overlapping located between trnL1-rrnL was also observed in other Noctuoidea species, such as E. adulatricoides (Mell) [9], Mythimna separata (Walker) [21], Ctenoplusia limbirena (Guenee) [18], Ctenoplusia agnata (Staudinger) [22], Lymantria dispar (Linnaeus) [23], Agrotis segetum (Denis & Schiffermuller) [24], Amata formosae [25] and Doa sp. [20]. The intergenic spacer sequence of E. phalonia mitogenome ranged from 2 to 97 bp, totalling 656 bp in length which was distributed in 26 regions (Table 2.). This intergenic spacers were longer than those of other Noctuoidea moths, including Doa sp. (566 bp over 23 regions), N. arctata albofasciata (594 bp over 25 regions), R. prominens (555 bp over 21 regions), H. armigera (618 bp over 25 regions), C. limbirena (599 bp over 24 regions), C. agnata (573 bp over 26 regions), N. pronuba (624 bp over 26 regions), E. adulatricoides (620 bp over 22 region), A. segetum (642 bp over 25 regions), Agrotis ipsilon (630 bp over 25 regions) and A. formosae (583 bp over 23 regions); however they were shorter than Asota plana [9] (678 bp over 25 regions), Gynaephora menyuanensis [26] (776 bp over 26 regions), L. dispar (703 bp over 26 regions), Acronicta psi (659 bp over 26 regions), Catocala deuteronympha (740 bp over 28 regions), Phalera flavescens [27] (747 bp over 26 regions), Ochrogaster lunifer [28] (764 bp over 27 regions), Hyphantria cunea [29] (705 bp over 26 regions) and Striacosta albicosta (762 bp over 25 regions). The longest intergenic spacers of 96 and 97 bp were located between trnQ-nad2 and nad2-trnW respectively in E. phalonia. In Ctenoptilum vasava, similar intergenic spacer length of 97 bp was found between nad2 and trnW genes [5]. However the first spacer of 96 bp found in E. phalonia through this study is not present in other lepidopterans. The 33 bp intergenic spacer region between trnS2 and nad1 contained the ‘ATACTAA’ motif. The 7 bp motif is a common feature among the 17 species selected from superfamily Noctuoidea, showing that this region is conserved and present in maximum of mitogenomes (Fig. 3b).
Fig. 3

a) Motifs and tandem repeats found in the A + T-rich region of Eudocima phalonia mitochondrial genome. These are indicated by specific colours and highlights. Two motifs (ATAGA) are shown in bold with green highlights. A 19 bp poly stretch (poly T) is shown in red colour with grey highlight. Microsatellites ((TA)4, (TA)6, two (TA)7) are shown in red. Three 40 bp tandem repeats are shown in blue, yellow and green highlights. Two 25 bp tandem repeats are shown in violet and grey highlights.

b) Sequence alignment of the intergenic spacer region between trnS2 (TGA) and ND1 of 17 Noctuoidea moth species. The aligned nucleotides indicate the conserved motif ‘ATACTAA’.

Genome composition and skewness

The nucleotide composition of the mitogenome of E. phalonia was found to be as follows: A = 6205 (39.83%), T = 6362 (40.84%), G = 1175 (7.54%) and C = 1833 (11.76%). As witnessed in other lepidopterans, the nucleotide composition of the E. phalonia mitogenome is heavily biased toward A + T (80.67%) (Table 4.); this was higher than other lepidopterans species such as O. lunifer (77.83%) A. psi (79.08%), S. albicosta (79.32%), A. formosae (79.48%) and N. arctata albofasciata (79.64%); it was slightly lower than Gynaephora menyuanensis (81.47%), Helicoverpa punctigera (81.34%), Agrotis ipsilon (81.24%), Spodoptera litura (81.03%), C. agnata (81.08%), C. limbirena (81.01%), N. pronuba (81.06%), R. prominens (81.05), Catocala deuteronympha (81.11%) M. separata (81.00%) and Helicoverpa zea (81.00%). The AT skew and GC skew were calculated for all Noctuoidea mitogenomes; they are presented in Table 4. The AT skew for the E. phalonia mitogenome was marginally negative (− 0.012), representing a higher amount of T and A nucleotides. Comparable results were observed in P. flavescens (− 0.009), A. formosae (− 0.027), N. arctata albofasciata (− 0.013), A. ipsilon (− 0.005), A. segetum (− 0.003), C. agnata (− 0.030), C. limbirena (− 0.035), E. adulatricoides (− 0.005), Sesamia inferens [30] (− 0.001), N. pronuba (− 0.018), M. separata (− 0.011), R. prominens (− 0.007) and Catocala deuteronympha (− 0.021) for AT skewness. The AT skew of tRNA and rRNA genes in the E. phalonia mitogenome were 0.005 and − 0.023 respectively. The AT skew at A + T-rich region of E. phalonia mitogenome was − 0.067, showing a partiality for T over A nucleotide. The GC skew values were hostile in all sequenced lepidopteran mitogenomes, showing a higher content of C. The GC skew of E. phalonia for 2 rRNA genes was − 0.369, which was higher than that detected for tRNA, PCGs and the A + T-rich regions.
Table 4

Nucleotide compositions and skewness in superfamily Noctuoidea mitogenomes.

SpeciesSize (bp)A %G%T%C%A + T%AT skewGC skew
E. phalonia15,57539.837.5440.8411.7680.67− 0.012− 0.218
P. flavescens15,65940.077.8640.8011.2580.87− 0.009− 0.177
O. lunifer15,59340.097.4337.7414.5977.830.030− 0.325
H. cunea15,48140.577.5539.8112.0580.380.009− 0.229
A. formosae15,46338.667.5240.8212.9779.48− 0.027− 0.0.267
Doa sp.15,22840.417.7640.0111.6480.420.004− 0.2
N. arctata albofasciata15,43139.277.3440.3711.6679.64− 0.013− 0.227
L. dispar15,56940.587.5739.2912.5579.870.016− 0.247
G. menyuanensis15,77040.876.8840.6011.7581.470.003− 0.261
H. armigera15,34740.547.6840.4311.3380.970.001− 0.192
H. punctigera15,38240.697.5840.6511.0681.340.000− 0.186
H. zea15,34340.597.5840.4011.4181.000.002− 0.202
A. ipsilon15,37740.387.7140.8611.0381.24− 0.005− 0.177
A. segetum15,37840.207.7940.4911.5080.69− 0.003− 0.192
S. litura15,37441.037.6040.0011.3681.030.012− 0.198
S. exigua15,36540.877.6740.0511.3880.920.010− 0.194
C. agnata15,26139.577.7041.5111.1981.08− 0.030− 0.184
C. limbirena15,30639.067.8441.9511.1581.01− 0.035− 0.174
E. adulatricoides15,36040.207.8040.6511.3280.85− 0.005− 0.184
S. inferens15,41340.067.6140.1712.1580.23− 0.001− 0.229
N. pronuba15,31539.797.8041.2711.0081.06− 0.018− 0.170
A. psi15,35040.897.8238.1912.9279.080.034− 0.168
Striacosta albicosta15,55340.147.8739.1812.7979.320.012− 0.238
M. separata15,32940.027.6640.9811.3381.00− 0.0110.193
R. prominens15,34340.247.8040.8111.1381.05− 0.007− 0.175
Catocala deuteronympha15,67139.687.2641.4311.6181.11− 0.021− 0.230
Asota plana15,41640.087.4940.2612.1680.34− 0.002− 0.237



PCGs
E. phalonia10,80938.908.5539.4913.0478.39− 0.007− 0.207
P. flavescens11,21133.7310.9545.2310.0878.07− 0.146− 0.041
O. lunifer11,26632.4712.0843.2612.1975.73− 0.142− 0.004
H. cunea11,20533.598.1045.0010.4279.59− 0.1450.027
A. formosae10,78237.928.5139.3914.1677.31− 0.0190.249
Doa sp.10,74739.708.7338.5512.6078.650.009− 0.181
N. arctata albofasciata10,76139.078.3339.8612.7278.93− 0.010− 0.208
L. dispar11,23633.2211.2644.6210.9077.84− 0.1460.016
G. menyuanensis11,22834.6310.1845.1510.0479.78− 0.1320.007
H. armigera10,79139.798.6439.2012.3878.950.0060.177
H. punctigera10,78839.848.5339.5812.0479.420.0030.170
H. zea10,88739.848.4739.2312.4379.080.007− 0.189
A. ipsilon10,82239.538.6639.8611.9379.39− 0.004− 0.158
A. segetum10,78539.298.8739.2512.5778.540.00040.172
S. litura11,20640.308.3839.2612.0679.560.012− 0.180
S. exigua10,76640.088.6938.9212.2879.000.0146− 0.171
C. agnata10,78238.818.5840.6011.9979.41− 0.022− 0.165
C. limbirena10,76438.198.8340.9012.0679.09− 0.034− 0.154
E. adulatricoides10,80039.258.8239.7412.1878.99− 0.006− 0.159
S. inferens10,90539.358.5038.9613.1778.310.005− 0.215
N. pronuba10,74439.088.7840.2811.8479.36− 0.015− 0.148
A. psi10,72540.228.8636.6914.2076.910.045− 0.231
Striacosta albicosta11,21039.468.7937.9913.7677.460.019− 0.220
M. separata11,21139.438.3840.1712.0379.59− 0.009− 0.179
R. prominens10,50638.978.8440.1512.0279.12− 0.102− 0.152
Catocala deuteronympha10,88438.808.2340.2812.6679.08− 0.187− 0.212
Asota plana10,62639.188.5938.7813.4377.960.005− 0.219



tRNA
E. phalonia146240.908.2740.4910.3281.390.005− 0.110
P. flavescens148541.627.8140.619.9782.220.012− 0.121
O. lunifer166641.787.3339.8611.0481.630.023− 0.202
H. cunea147441.867.8739.8910.3881.750.024− 0.138
A. formosae146640.457.9840.2411.3280.690.002− 0.173
Doa sp.145641.558.1739.6210.6481.180.023− 0.131
N. arctata albofasciata144040.908.0540.4810.5581.380.005− 0.134
L. dispar146941.667.9639.3511.0381.010.029− 0.162
G. menyuanensis150441.297.3841.769.5783.05− 0.006− 0.129
H. armigera147141.408.1540.3810.0681.780.012− 0.104
H. punctigera147841.678.1840.189.9481.850.018− 0.097
H. zea147541.498.1340.2010.1681.690.015− 0.111
A. ipsilon147541.158.1340.4710.2381.620.008− 0.114
A. segetum147140.728.2240.8510.1981.57− 0.001− 0.107
S. litura147342.237.9439.5810.2581.810.032− 0.127
S. exigua146841.628.0340.0510.2881.670.019− 0.122
C. agnata147241.108.2240.2810.3981.380.010− 0.116
C. limbirena146640.388.1841.3310.0981.71− 0.011− 0.104
E. adulatricoides147541.358.0640.4710.1081.820.010− 0.111
S. inferens147840.868.2540.6610.2181.520.002− 0.106
N. pronuba146840.538.5140.6610.2881.190.001− 0.094
A. psi140242.227.9839.378.2781.590.034− 0.131
Striacosta albicosta148640.588.4840.4410.5081.020.002− 0.106
M. separata147341.078.1540.7310.0581.810.004− 0.104
R. prominens146440.778.2641.129.8381.89− 0.004− 0.086
Catocala deuteronympha147841.277.8440.5210.3581.790.009− 0.137
Asota plana145740.288.2340.7010.7780.98− 0.005− 0.133



rRNA
E. phalonia221241.504.7443.4910.2684.99− 0.023− 0.367
P. flavescens219841.314.7344.049.9285.35− 0.032− 0.354
O. lunifer215741.964.8240.1913.0382.150.022− 0.460
H. cunea223442.084.9242.7510.2584.83− 0.008− 0.351
A. formosae216338.934.7244.8511.5183.77− 0.071− 0.418
Doa sp.214641.614.9842.5410.8584.15− 0.01− 0.370
N. arctata albofasciata205840.914.8143.3410.9384.25− 0.028− 0.388
L. dispar215042.794.7941.8110.6084.600.012− 0.377
G. menyuanensis231141894.2842.8410.9984.73− 0.011− 0.439
H. armigera215541.624.9143.3410.1184.96− 0.020− 0.345
H. punctigera216141.874.8143.359.9485.22− 0.017− 0.347
H. zea218441.574.8543.779.7985.34− 0.025− 0.337
A. ipsilon213341.495.0643.459.9884.94− 0.023− 0.327
A. segetum215541.484.9143.5710.0285.05− 0.024− 0.341
S. litura221842.654.7342.3410.2884.990.004− 0.370
S. exigua216542.354.6642.6310.3484.98− 0.003− 0.378
C. agnata215639.935.1044.5710.3884.50− 0.054− 0.341
C. limbirena208939.395.1245.2810.1984.67− 0.069− 0.331
E. adulatricoides216941.534.9742.8710.6084.40− 0.015− 0.360
S. inferens214641.514.7542.4511.2783.96− 0.0110.406
N. pronuba214440.954.8944.1210.0285.07− 0.037− 0.347
A. psi214341.624.9941.9011.4783.52− 0.003− 0.393
Striacosta albicosta214742.064.8441.9211.1883.980.002− 0.396
M. separata219841.314.7344.049.9285.35− 0032− 0.354
R. prominens217842.514.7742.889.8285.39− 0.004− 0.345
Catocala deuteronympha219340.084.6944.6410.5784.72− 0.053− 0.385
Asota plana217241.434.8343.1810.5484.61− 0.020− 0.371



A + T-rich region
E. phalonia33643.452.9749.703.8693.15− 0.067− 0.130
P. flavescens54142.142.2249.725.9191.86− 0.083− 0.454
O. lunifer31944.501.6048.905.0093.40− 0.047− 0.524
H. cunea35745.661.1249.303.9294.96− 0.038− 0.556
A. formosae48442.972.8949.794.3392.76− 0.073− 0.2
Doa sp.33243.070.6053.612.7196.68− 0.109− 0.636
N. arctata albofasciata40143.391.9950.374.2393.76− 0.074− 0.36
L. dispar43545.291.6150.802.3096.09− 0.057− 0.176
G. menyuanensis44943.652.4549.674.2393.32− 0.065− 0.266
H. armigera32944.371.2150.753.6495.12− 0.067− 0.5
H. punctigera32845.121.2151.212.4396.33− 0.063− 0.333
H. zea32945.281.2151.062.4396.35− 0.059− 0.033
A. ipsilon34446.221.4548.833.4895.05− 0.0270.411
A. segetum34546.080.8648.404.6394.48− 0.024− 0.684
S. litura32646.632.1547.243.9993.87− 0.006− 0.300
S. exigua33542.982.6850.443.8893.42− 0.079− 0.181
C. agnata33446.701.4946.705.0893.40− 0.545
C. limbirena42246.912.3646.204.5093.110.007− 0.473
E. adulatricoides34146.042.6346.624.6992.66− 0.006− 0.28
S. inferens31443.311.2752.542.8695.85− 0.095− 0.384
N. pronuba33044.242.1249.094.5493.33− 0.051− 0.363
A. psi35444.632.5447.745.0892.37− 0.033− 0.333
Striacosta albicosta38543.121.3049.875.7192.99− 0.073− 0.629
M. separata37244.622.4249.733.2394.35− 0.054− 0.143
R. prominens34344.022.3349.591.1193.58− 0.0590.333
Catocala deuteronympha39042.302.8247.946.9290.24− 0.062− 0.421
Asota plana33145.611.2048.944.2294.55− 0.035− 0.55
Nucleotide compositions and skewness in superfamily Noctuoidea mitogenomes.

Transfer RNA genes

The E. phalonia mitochondrial genome had 22 tRNA genes that were varied in length from 57 to 68 bp with A + T content of 81.39%. Among these, 21 tRNA genes presented the putative secondary structure detected for mitochondrial tRNAs of other species (Fig. 2); however tRNASer1 presented unusual secondary structure lacking a dihydrouridine (DHU). This feature has been witnessed in hexapods and metazoan mitogenomes [21], [31], [32]. A total of 20 mismatched base pairs were observed to occur in 15 of 22 tRNAs genes such as in tRNA [1 U-G], tRNA [1 U-G], tRNA [1 U-G], tRNA [1 U-G], tRNA [1G-U], tRNA [1 U-G], tRNA [1G-U], tRNA [1 U-U], tRNA [1 U-G], tRNA [1 G-U], tRNA [1 G-U and 1 U-G], tRNA [1 G-U], tRNA [2 G-U], tRNA [2 U-G and 2 G-U] and tRNA [1 G-U]. In these mismatches, 9 were U-G and 10 were G-U pairs, which formed a weak bond. The tRNA gene contained a U-U mismatch. The mismatches were commonly located in the acceptor, DHU and anticodon arms, but tRNASer1 gene showed the U-G mismatch on the TѱC stem.
Fig. 2

Putative secondary structures of 22 tRNAs found in mitochondrial genome of Eudocima phalonia. The putative secondary structures were produced using MITOS online tool.

Putative secondary structures of 22 tRNAs found in mitochondrial genome of Eudocima phalonia. The putative secondary structures were produced using MITOS online tool.

Ribosomal RNA genes

As in other insect mitogenomes, two rRNA genes (rrnS and rrnL) were found in E. phalonia. The rrnL was located between positions 12,673 and 14,153 with a total of 1391 bp in length. The total A + T content of the 2 rRNA genes was 84.39% as analysed in this study. The rrnS gene was located between positions 14,217 and 15,037 with 821 bp in length. The A + T content of the rrnS gene was 85.99%.

The A + T-rich region

The A + T-rich region of E. phalonia had 336 bp (position 15,041–15,376) with 93.15% AT and was positioned in the middle of the rrnS and trnM genes. The length of A + T-rich region of E. phalonia was higher than that of H. zea (329 bp), H. punctigera (326), H. armigera (327 bp), C. agnata (333 bp), S. exigua (335 bp), S. litura (325 bp), A. segetum (331 bp); it was shorter than that of Thitarodes renzhiensis (1366 bp), Thitarodes yunnanensis (977 bp), Bombyx mandarina (494), A. pernyi (552), and B. mori (496) Catocala deuteronympha (424 bp), N. arctata albofasciata (400 bp), E. adulatricoides (341 bp) and M. separata (372 bp). The A + T-rich region consisted of highest A + T content (93.15%) of the entire mitogenome of E. phalonia, which was within the range reported for other lepidopteran insects (Table 4). The A + T-rich region is commonly known to regulate transcription and replication in both vertebrate and invertebrate mitochondrial genomes [33]. Sequence analysis of the E. phalonia A + T-rich region revealed the presence of several conserved regions including two “ATAGA” motifs and a 19-bp poly-T stretch (from 15,065 bp to 15,083 bp) at the downstream of the rrnS gene. The poly-T stretch is commonly conserved in ditrysian lepidopteran mitochondrial genomes [20] and, has been represented as the origin of minority or light strand DNA replication [6]. Moreover, we have found microsatellite-like (TA)4, (TA)6 and two (TA)7 elements within the A + T-rich region of the E. phalonia mitochondrial genome. The occurrence of multiple tandem repeat elements is a distinctive feature of the insect A + T-rich region. We have discovered three large, 40 bp tandem repeat and two 25 bp repeat elements in E. phalonia mitochondrial genome (Fig. 3). Similar observations were also reported (Supplementary file 1) in other lepidopterans, even though the function of this region remains to be validated. A total of 27 moth species, representing seven Noctuoidea families (Notodontidae, Arctiidae, Doidae, Lymantriidae, Erebidae, Noctuidae and Euteliidae) were included for phylogenetic analysis. In this study, the phylogenetic trees were reconstructed using two datasets with maximum likelihood method (Fig. 4a, b). The results showed three major evolutionary groups; the first evolutionary group was family Notodontidae, which is a well-supported branch (BP ≥ 99). This evolutionary group consisted of two genera, Phalera and Ochrogaster. The second evolutionary group consisted of families Erebidae, Lymantriidae and Arctiidae with bootstrap proportion (BP ≥ 80). The third evolutionary group (BP ≥ 74) contained family Noctuidae as a pest group (Fig. 4a). This evolutionary group contained five subfamilies Noctuinae, Heliothinae, Hadeninae, Acronictinae and Plusiinae. Within the third evolutionary group subfamily Acronictinae was separated from subfamily Heliothinae with low bootstrap value (BP ≥ 42). The second and third evolutionary groups revealed that the superfamily Noctuoidea is a monophyletic group.
Fig. 4

A) Phylogenetic tree of superfamily Noctuoidea moths. The phylogeny was reconstructed using 13 PCGs of the 27 species. B) The phylogeny was reconstructed using 13 PCGs of the 27 species plus 2 rRNAs.

a) Motifs and tandem repeats found in the A + T-rich region of Eudocima phalonia mitochondrial genome. These are indicated by specific colours and highlights. Two motifs (ATAGA) are shown in bold with green highlights. A 19 bp poly stretch (poly T) is shown in red colour with grey highlight. Microsatellites ((TA)4, (TA)6, two (TA)7) are shown in red. Three 40 bp tandem repeats are shown in blue, yellow and green highlights. Two 25 bp tandem repeats are shown in violet and grey highlights. b) Sequence alignment of the intergenic spacer region between trnS2 (TGA) and ND1 of 17 Noctuoidea moth species. The aligned nucleotides indicate the conserved motif ‘ATACTAA’. A) Phylogenetic tree of superfamily Noctuoidea moths. The phylogeny was reconstructed using 13 PCGs of the 27 species. B) The phylogeny was reconstructed using 13 PCGs of the 27 species plus 2 rRNAs. Relative synonymous codon usage (RSCU) in Noctuoidea mtDNAs. The species name represents the superfamily Noctuoidea. Mitogenomes have been used predominantly to analyse phylogeny and evolution. Not many mitochondrial genome sequences are available just now. At present there are only 27 species from superfamily Noctuoidea moths whose complete mitogenomes have been sequenced and deposited in Genbank. The complete mitochondrial genome of E. phalonia with a circular molecule 15,575 bp was successfully determined; it is longer than those of H. cunea (15,481), A. formosae (14,463), N. arctata albofasciata (15,431), H. armigera (15,347), H. punctigera (15,382), A. ipsilon (15,377), A. segetum (15,378), S. litura (15,374), S. exigua (15,365), C. agnata (15,261), C. limbirena (15,306), E. adulatricoides (15,360), S. inferens (15,413), N. pronuba (15,315), A. psi (15,350), M. separata (15,329), R. prominens (15,343) and shorter than those of Catocala deuteronympha (15,671), G. menyuanensis (15,770), P. flavescens (15,659) and O. lunifer (15,593). The gene order and orientation of the mitochondrial genes of E. phalonia were similar to other lepidopteran moths. The position of trnM gene was found from 1 bp to 68 bp in other Noctuoid moths [9], [19], [20], [22], [23], [24], [25], [26], [27], [28], [29], [34], [35], [36], [37]. However, in E. phalonia trnM gene was found from 15,500 bp to 15,566 bp. The A + T content of the E. phalonia was lower than the other Noctuoidea moths such as H. punctigera, A. ipsilon, S. litura, C. agnata, N. pronuba and R. prominens. These Noctuoid mitogenomes, including E. phalonia, present negative AT- and GC skew values. The incomplete stop codon of a single T has been found in E. phalonia two PCG genes. The incomplete stop codon had been commonly found in numerous invertebrate mitochondrial genes [38], [45], [46]. The relative synonymous codon usage shows wide resemblance with other lepidopteran mitogenomes [28]. Complete mitochondrial genomes are effective markers for deep-level phylogenetic analysis in the Lepidoptera. In this study Lymantriidae (L. dispar and G. menyuanensis), Arctiidae (H. cunea and A. formosae) and Erebidae (E. phalonia, N. arctata albofasciata and Catocala deuteronympha) were clustered in one node of the phylogenetic tree comprising Lymantriidae, Arctiidae and Quadrifine Noctuidae; these are consistent with the morphological data [39]. Sequencing and characterization of mitogenomes of more Noctuoid moths will help establish the deep level phylogeny of this particular superfamily. The following are the supplementary data related to this article.

Supplementary file 1

Alignment of A + T-rich regions.

Transparency document

Transparency document.
  36 in total

1.  MEGA6: Molecular Evolutionary Genetics Analysis version 6.0.

Authors:  Koichiro Tamura; Glen Stecher; Daniel Peterson; Alan Filipski; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2013-10-16       Impact factor: 16.240

2.  Tandem repeats finder: a program to analyze DNA sequences.

Authors:  G Benson
Journal:  Nucleic Acids Res       Date:  1999-01-15       Impact factor: 16.971

3.  Mitochondrial genome of the cotton bollworm Helicoverpa armigera (Lepidoptera: Noctuidae) and comparison with other Lepidopterans.

Authors:  Jiao Yin; Gui-Yun Hong; Ai-Min Wang; Ya-Zhong Cao; Zhao-Jun Wei
Journal:  Mitochondrial DNA       Date:  2010-10

4.  The complete mitochondrial genome of the fall webworm, Hyphantria cunea (Lepidoptera: Arctiidae).

Authors:  Fang Liao; Lin Wang; Song Wu; Yu-Ping Li; Lei Zhao; Guo-Ming Huang; Chun-Jing Niu; Yan-Qun Liu; Ming-Gang Li
Journal:  Int J Biol Sci       Date:  2010-03-29       Impact factor: 6.580

5.  The complete mitochondrial genome of the tobacco hornworm, Manduca sexta, (Insecta: Lepidoptera: Sphingidae), and an examination of mitochondrial gene variability within butterflies and moths.

Authors:  Stephen L Cameron; Michael F Whiting
Journal:  Gene       Date:  2007-12-11       Impact factor: 3.688

6.  MITOS: improved de novo metazoan mitochondrial genome annotation.

Authors:  Matthias Bernt; Alexander Donath; Frank Jühling; Fabian Externbrink; Catherine Florentz; Guido Fritzsch; Joern Pütz; Martin Middendorf; Peter F Stadler
Journal:  Mol Phylogenet Evol       Date:  2012-09-07       Impact factor: 4.286

7.  The complete mitochondrial genome of the beet armyworm Spodoptera exigua (Hübner) (Lepodiptera: Noctuidae).

Authors:  Qiu-Ling Wu; Ya-Jun Gong; Yun Gu; Shu-Jun Wei
Journal:  Mitochondrial DNA       Date:  2012-09-07

8.  The complete mitochondrial genomes of two ghost moths, Thitarodes renzhiensis and Thitarodes yunnanensis: the ancestral gene arrangement in Lepidoptera.

Authors:  Yong-Qiang Cao; Chuan Ma; Ji-Yue Chen; Da-Rong Yang
Journal:  BMC Genomics       Date:  2012-06-22       Impact factor: 3.969

9.  Characterization of the complete mitochondrial genome of the giant silkworm moth, Eriogyna pyretorum (Lepidoptera: Saturniidae).

Authors:  Shao-Tong Jiang; Gui-Yun Hong; Miao Yu; Na Li; Ying Yang; Yan-Qun Liu; Zhao-Jun Wei
Journal:  Int J Biol Sci       Date:  2009-05-22       Impact factor: 6.580

10.  The complete mitochondrial genome of the bag-shelter moth Ochrogaster lunifer (Lepidoptera, Notodontidae).

Authors:  Paola Salvato; Mauro Simonato; Andrea Battisti; Enrico Negrisolo
Journal:  BMC Genomics       Date:  2008-07-15       Impact factor: 3.969

View more
  3 in total

1.  The complete mitochondrial genome of Vanessa indica and phylogenetic analyses of the family Nymphalidae.

Authors:  Youxue Lu; Naiyi Liu; Liuxiang Xu; Jie Fang; Shuyan Wang
Journal:  Genes Genomics       Date:  2018-06-14       Impact factor: 1.839

2.  A supertree of Northern European macromoths.

Authors:  Robert B Davis; Erki Õunap; Toomas Tammaru
Journal:  PLoS One       Date:  2022-02-18       Impact factor: 3.240

3.  Comparative mitochondrial genome analysis of Eudocima salaminia (Cramer, 1777) (Lepidoptera: Noctuoidea), novel gene rearrangement and phylogenetic relationship within the superfamily Noctuoidea.

Authors:  Muzafar Riyaz; Rauf Ahmad Shah; Ignacimuthu Savarimuthu; Sivasankaran Kuppusamy
Journal:  Mol Biol Rep       Date:  2021-06-09       Impact factor: 2.316

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.