Literature DB >> 29755260

Characterization of the complete mitochondrial DNA of Theretra japonica and its phylogenetic position within the Sphingidae (Lepidoptera, Sphingidae).

Jun Li1, Rui-Rui Lin1, Yao-Yao Zhang1, Kun-Jie Hu1, Ya-Qi Zhao1, Yan Li1, Zhuo-Ran Huang1, Xu Zhang1, Xue-Xia Geng1, Jian-Hua Ding1.   

Abstract

In the present study, the complete mitogenome of Theretra japonica was sequenced and compared with other sequenced mitogenomes of Sphingidae species. The mitogenome of T. japonica, containing 37 genes (13 protein-coding genes, 22 tRNA genes, and two rRNA genes) and a region rich in adenine and thymine (AT-rich region), is a circular molecule with 15,399 base pairs (bp) in length. The order and orientation of the genes in the mitogenome are similar to those of other sequenced mitogenomes of Sphingidae species. All 13 protein-coding genes (PCGs) are initiated by ATN codons except for the cytochrome C oxidase subunit 1 gene (cox1) which is initiated by the codon CGA as observed in other lepidopteran insects. Cytochrome C oxidase subunit 2 gene (cox2) has the incomplete termination codon T and NADH dehydrogenase subunit 1 gene (nad1) terminates with TAG while the remainder terminates with TAA. Additionally, the codon distributions of the 13 PCGs revealed that Ile and Leu2 are the most frequently used codon families and codons CGG, CGC, CCG, CAG, and AGG are absent. The 431 bp AT-rich region includes the motif ATAGA followed by a 23 bp poly-T stretch, short tandem repeats (STRs) of TC and TA, two copies of a 28 bp repeat 'ATTAAATTAATAAATTAA TATATTAATA' and a poly-A element. Phylogenetic analyses within Sphingidae confirmed that T. japonica belongs to the Macroglossinae and showed that the phylogenetic relationship of T. japonica is closer to Ampelophaga rubiginosa than Daphnis nerii. Phylogenetic analyses within Theretra demonstrate that T. japonica, T. jugurtha, T. suffusa, and T. capensis are clustered into one clade.

Entities:  

Keywords:  Lepidoptera; Sphingidae; Theretra japonica; mitogenome

Year:  2018        PMID: 29755260      PMCID: PMC5945705          DOI: 10.3897/zookeys.754.23404

Source DB:  PubMed          Journal:  Zookeys        ISSN: 1313-2970            Impact factor:   1.546


Introduction

The () moths are commonly known as hawk moths, sphinx moths, or hornworms and include 1,463 species (Nieukerken et al. 2011). , known as a pest, is widely distributed in Korea, Japan, Russia, and China. Its larva eats leaves and is harmful to many important ornamental plants, such as , , , and etc. (Zhu and Wang 1997; Shirotsuka and Yano 2012). Mitochondrial DNA sequences have been widely used to study the molecular evolution of insects due to protein-coding genes (PCGs) sequence conservatism, maternal inheritance, and rapid evolution (Cameron 2014). In , however, only the complete mitochondrial DNA sequences of (KU934302) (Kim et al. 2016), (KC470083) (Kim et al. 2013), (EU286785) (Cameron and Whiting 2008), (KT153024) (Xin et al. 2017), (https://doi.org/10.1139/gen-2016-0058) (Dai et al. 2017), and (https://doi.org/10.1371/journal.pone.0178773.s001) (Sun et al. 2017) have been reported up to now. Among these six species, , , and belong to the subfamily , while and belong to the . More mitogenome sequences from will be helpful to discover the interfamilial phylogenetic relationships. is taxonomically classified into the subfamily according to its morphology (Zhu and Wang 1997), but its mitogenome has not yet been reported, nor a phylogenetic analysis based on this. In this study polymerase chain reaction (PCR) amplification, DNA sequencing, and overlapped fragments assembling methods were used to determine the complete mitogenome of . The characteristics of the mitogenome were also analyzed and a phylogeny was constructed. These will be helpful to understand the evolutionary position of within .

Materials and methods

Specimens sampling and DNA extraction

The specimen was collected from Xiangshan mountain, Huaibei city, Anhui province, China (), and then was preserved in -20 °C refrigerator. Total genomic DNA was extracted from the abdomen of the moth (voucher number TJ20171011) using Ezup Column Animal Genomic DNA Purification Kit (Sangon Biotech, China) following the manufacturer’s instructions. The extracted DNA samPageBreakples were stored at -20°C. The specimen and the template DNA are respectively deposited in Specimens Room within the Human and Animal Genetics Laboratory, School of Life Sciences, Huaibei Normal University.

PCR amplification and DNA sequencing

The mitochondrial DNA fragments were amplified by PCR method and the total genomic DNAs were used as template. PCR primers were designed according to the conservative sequences of mitochondrial DNA of insects and showed in Table 1. The overlapping fragments were amplified using PrimeSTAR® GXL DNA Polymerase (Takara, China) according to the manufacturer’s instructions. PCR reaction mixture (25 μL in total) included 5 μL 5× PrimeSTAR GXL Buffer, 2 μL dNTP mixture (2.5 mM each), 2.5 μL primer (10 μM) each, 1 μL PrimeSTAR GXL DNA Polymerase, 1 μL template DNA (100 ng/μL) and 11 μL double distillated water. PCR reaction was performed in Eppendoff Mastercycler gradient PCR instrument under the following conditions: 30 sec at 98 °C; followed by 30 cycles of 15 sec at 98 °C, 15 sec at 40–55 °C and 2–8 min at 68 °C; and at last 10 min at 68 °C. PCR productions were confirmed by 1% (w/v) agarose gel electrophoresis and sequenced at least three times.
Table 1.

Details of the primers used to amplify the mitochondrial DNA of .

Primer name Orientation Annealing position (bp) Nucleotide sequence (5'-3') PCR length
Q1FF1314-1336AAACTAATAATCTTCAAAATTAT
Q1RR6236-6213AATATTAATGGAATTTAACCACTA4923
Q2FF6193-6216TAAGCTGCTAACTTAATTTTTAGT
Q2RR9637-9617GTTTCAATAATCCGAACTCAT3445
Q3FF8601-8618CGTCTATGCAATCGCTCA
Q3RR12319-12302GCATTACTTGGAGGGTTG3719
Q4FF11600-11620TCCCTATGTTATTACAGGACA
Q4RR14809-14791CCAGCAGTTGCGGTTATAC3210
Q5FF14637-14659TAATAGGGTATCTAATCCTAGTT
Q5RR1400-1378ATATAAAATTGCAAATTTTAAGG2163
Details of the primers used to amplify the mitochondrial DNA of .

Sequences assembly, annotation and analysis

The overlapping fragments were assembled into a complete linear mitochondria DNA sequence using the DNAStar package (DNAStar Inc. Madison, WI, USA), and the mitogenome was annotated using MITOS2 (Bernt et al. 2013). The PCGs and ribosomal RNA (rRNA) genes were verified by NCBI BLAST. The transfer RNA (tRNA) genes were verified by tRNAscan-SE2.0 (Lowe and Chan 2016; Lowe and Eddy 1997). Barcoding analysis was performed in Bold Systems v4 using cox1 as the marker following the recomPageBreakmendations of Botera-Castro et al (2016). The protein sequences were translated with the invertebrate mitochondrial genetic code. Nucleotides composition and codon usage were counted using MEGA 7.0. The bias of nucleotide composition was measured as AT skew (AT skew = (A-T)/(A+T)) and GC skew (GC skew = (G-C)/(G+C)) respectively.

Phylogenetic analyses

To clarify the phylogenetic position of within the , all published complete mitogenomes of members of the were collected and their 13-protein amino acid (AA) sequences were incorporated together for alignment and phylogenetic tree construction. Sequences were aligned using ClustalX 2.1 (Larkin et al. 2007) and phylogenetic trees were constructed using the Neighbor-Joining (NJ) and Maximum likelihood analysis (ML) methods with bootstrap test of 1000 replications by MEGA 7.0 (Kumar et al. 2016). (, AF149768) and (, AY242996) were utilized as outgroups. The gaps or missing data subsets were completely deleted. Before constructing the ML phylogeny, Mega 7.0 was utilized to find the best model (mtREV + F + G). The NJ phylogeny was constructed using Poisson model and bootstrap for 1000 times. The parameter of Rates among Sites was set as Gamma distributed (G) and value as 13. The parameter for Pattern among Lineages was set as homogeneous. The cox1 barcodes (481 barcodes) were gathered for the genus in BOLD system v4 to construct phylogeny. Those barcodes without gaps or missing nucleotides (total 658 bp in size) were selected to construct ML phylogenetic tree, and finally 285 barcodes (41 species) were utilized. The cox1 sequence of (AF149768) was used as outgroup. The best model was GTR + G + I. To infer nodal support, bootstrapping was conducted 1000 times.

Results and discussion

Genome organization and nucleotides composition

The complete mitogenome sequence of (MG655620) is 15,399 base pairs (bp) in length, shorter than but longer than the other 5 species of . It contains 13 PCGs, 22 tRNAs genes, two rRNAs genes, and an AT-rich region with a length of 431 bp (Fig. 1, Suppl. material 2). Among the 37 genes, 23 genes are encoded by the majority-coding strand (J-strand) and 14 genes are encoded by the minority-coding strand (N-strand). The gene order and orientation are consistent to the other species.
Figure 1.

The schematic illustration for mitogenome of . Gene order and positions are shown. cox1, cox2, and cox3 refer to the cytochrome c oxidase subunits; cob refers to cytochrome b; nad1-nad6 refers to NADH dehydrogenase components; rrnL and rrnS refer to ribosomal RNAs. The bold lines on outer or inner ring represent that the genes lie in the majority-coding strand (J-strand) or the minority-coding strand (N-strand).

The schematic illustration for mitogenome of . Gene order and positions are shown. cox1, cox2, and cox3 refer to the cytochrome c oxidase subunits; cob refers to cytochrome b; nad1-nad6 refers to NADH dehydrogenase components; rrnL and rrnS refer to ribosomal RNAs. The bold lines on outer or inner ring represent that the genes lie in the majority-coding strand (J-strand) or the minority-coding strand (N-strand). The nucleotide composition in J-strand of mitogenome is as follows: 6,331 bp (41.11%) A, 6,043 bp (39.24%) T, 1,883 (12.23%) C, and 1,142 bp (7.42%) for G. A+T accounts for 80.36%, which is slightly higher than PageBreak (80.29%) but lower than (81.79%), (81.17%), (81.79%), (81.49%), and (81.5%) (Sun et al. 2017; Xin et al. 2017; Dai et al. 2016). The AT skew and GC skew of J-strand are 0.023 and -0.245 respectively (Table 2). The GC skews of the seven species are all negative and the AT skews are positive except for (-0.005) and (-0.001) (Sun et al. 2017; Xin et al. 2017; Dai et al. 2016).
Table 2.

Base composition of protein-coding, tRNA and rRNA genes, and A+T rich region of mitogenome.

Genes or regions Size (bp) Base composition (%) A+T (%) AT skewness GC skewness
A T C G
nad2 101437.8747.149.665.3385.01-0.109-0.289
cox1 153632.4238.6115.6313.3571.03-0.087-0.079
cox2 68537.9639.2713.149.6477.23-0.017-0.154
atp8 16544.8544.249.091.8289.090.007-0.667
atp6 67836.2841.8914.167.6778.17-0.072-0.297
cox3 79234.2239.5214.5211.7473.74-0.072-0.106
nad3 35436.4443.7912.996.7880.23-0.092-0.314
nad5 175832.8248.815.9212.4681.63-0.1960.356
nad4 133233.6348.426.3811.5682.06-0.1800.289
nad4L 29130.9352.583.7812.7183.06-0.2590.542
nad6 53140.4945.208.665.6585.69-0.056-0.210
cob 114934.6440.8214.4510.1075.46-0.082-0.177
nad1 93630.0248.087.4814.4278.10-0.2310.317
Total1122134.5044.3810.5310.5978.88-0.1250.003
tRNA 146541.7739.328.0510.8581.090.0300.148
rRNA 204841.5042.245.0811.1883.74-0.0090.375
AT-rich region43141.5042.245.0811.1893.04-0.007-0.400
Complete mitogenome1539941.1139.2412.237.4280.360.023-0.245
In the mitogenome of , there are 13 gene overlaps and 15 intergenic spacers (Suppl. material 2). The 13 gene overlaps range from 1 to 17 bp in size and the longest is present between trnF and nad5. The 15 intergenic spacers range from 1 to 88 bp in size, and the longest is present between trnQ and nad2, which is also founded PageBreakin the other six species. However, the intergenic spacer between trnQ and nad2 of is longer than that of the other six species, which range from 51bp in to 56 bp in . Base composition of protein-coding, tRNA and rRNA genes, and A+T rich region of mitogenome.

Protein-coding genes and codon usage

The mitogenome contains 13 PCGs as expected with a total of 11,221 bp in size. All the PCGs are initiated with ATN codons, except for cox1, which uses CGA as the initiation codon. Most PCGs are terminated with TAA codon while nad1 uses TGA as termination codon. And yet, cox2 has an incomplete termination codon ‘T’. The incomplete termination codon ‘T’ or ‘TA’ could become TAA by posttranscriptional polyadenylation (Ojala et al. 1981). In the 13 PCGs, only the AT skewness of atp8 is positive (0.007) while the others are negative. The GC skewness of nad1, nad4, nad4L, and nad5 are positive and all lie in the N-strand. The amino acids (AAs) components and their codon usage in the PCGs of mitogenome were also analyzed. The results reveal that two codon families (Ile and Leu2) are more than 100 codons per thousand codons (CDpT), six codon families (Asn, Gly, Met, Phe, Ser2, and Tyr) are between 50 CDpT and 100 CDpT, and the other fourteen codon families are less than 50 CDpT (Fig. 2). AAs codon usage is assessed by values of the relative synonymous codon usage (RSCU) and five codons PageBreak(CGG, CGC, CCG, CAG, and AGG) are absent in the PCGs of (Fig. 3). It is found that the codons with high G and C content are likely not to be favored in lepidopteran insects (Dai et al. 2016).
Figure 2.

The amino acids usage in the mitogenome of . CDpT = codons per thousand codons.

Figure 3.

The relative synonymous codon usage (RSCU) in the mitogenome of . The codons listed up the columns are absent in .

The amino acids usage in the mitogenome of . CDpT = codons per thousand codons. The relative synonymous codon usage (RSCU) in the mitogenome of . The codons listed up the columns are absent in .

Ribosomal RNA genes and transfer RNA genes.

The 22 rRNA genes of range from 64 bp to 71 bp in size and totally comprise 1,465 bp of the whole mitogenome. Of these genes, 14 are encoded in J-strand and 8 in N-strand just as other moths. The predicted secondary structures of the tRNAs are shown in Figure 4. All the tRNA genes except for trnS1 could be folded into the typical cloverleaf secondary structure (Fig. 4). The A+T content of the 22 tRNAs is 83.74%, with a positive AT skew (0.030) and GC skew (0.148).
Figure 4.

The cloverleaf secondary structure of transfer RNA of .

The cloverleaf secondary structure of transfer RNA of . Just as the other species there are two rRNA genes in with a total length of 2,048 bp. The large ribosomal gene (rrnL) locates between trnL1 and trnV, with a length of 1,284 bp whereas the small ribosomal gene (rrnS) locates between trnV and the A+T-rich region with a length of 764 bp. The AT skew is slightly negative (−0.009), but the GC skew is strongly positive (0.375). The total A+T content of the rRNA genes (83.74%) is higher than that of total tRNA genes (81.09%) and total PCG genes (79.10%).

A+T rich region

The A+T rich region locates between rrnS and trnM in with 431 bp in length and serves as the initiation of mitochondrial replication in both vertebrates and invertebrates (Cameron 2014). This region contains the highest A+T content (93.04%) in the mitogenome of . As the other lepidopteran mitogenomes, the A+T rich region of has some conserved structures including the motif ‘ATAGA’ followed by a 23 bp poly-T stretch, a short tandem repeats (STRs) of TC and TA, two copies of a 28 bp repeat ‘ATTAAATTAATAAATTAATATATPageBreakTAATA’, and a poly-A element upstream of the trnM (Fig. 5). The poly-T element may be associated with mitochondrial transcription controlling and replication initiation.
Figure 5.

Features of the A+T-rich region of . The ATATG motif is shaded. The polyT stretch is underlined while the poly-A stretch is double underlined. The TA and GC repeats sequence are indicated by dotted underlining. The 28 bp repeats ‘ATTAAATTAATAAATTAATATATTAATA’ are labeled with wave underlining.

Features of the A+T-rich region of . The ATATG motif is shaded. The polyT stretch is underlined while the poly-A stretch is double underlined. The TA and GC repeats sequence are indicated by dotted underlining. The 28 bp repeats ‘ATTAAATTAATAAATTAATATATTAATA’ are labeled with wave underlining.

Barcoding analysis and phylogenetic analysis

By using the identified cox1 barcode and aligning it with the three barcodes of deposited in Bold system v4 we found that the similarities are 100% with GBMIN88089-17 (KX440686, 541 bp) and SOWD617-06 (JN678612, 658 bp), and 99.85% with GBMIN88088-17 (KT988392, 658 bp) respectively. The only mutation of GBMIN88088-17 takes place at the third position nucleotide of a codon for Leucine which mutates from TTA to TTG, but the mutation is synonymous because it does not change its AA code (Fig. 6). SOWD617-06 was collected from Sichuan province, China (Wilson et al. 2011) and the positions where the two others were collected are unspecified.
Figure 6.

Barcoding analysis of .

Barcoding analysis of . Phylogenetic analyses were firstly based on the sequences of 13 PCGs of seven mitogenomes using NJ and ML Methods. The phylogenetic trees constructed by NJ and ML are consistent with high intermediate bootstrap values (Fig. 7). The includes three subfamilies, , , and , but no complete mitogenome from the has been published so far. Among these species of , , and belong to and the other four belong to . and are clustered together into a monophyletic group and is phylogenetically closer to than . The phylogenetic tree of these seven species is (( + ) + ) + (() + ( + )) which is also partly supported by Kawahara et al (2009). ML phylogenetic tree constructed for genus demonstrates that , , , and are clustered into one clade (Suppl. material 1).
Figure 7.

Phylogenetic analysis. Phylogenetic tree constructed using NJ and ML methods based on the amino acid sequences of 13 PCGs of 7 species with (: ) and (: ) as outgroups. The support values at the nodes represent bootstrap values for NJ and ML respectively.

Phylogenetic analysis. Phylogenetic tree constructed using NJ and ML methods based on the amino acid sequences of 13 PCGs of 7 species with (: ) and (: ) as outgroups. The support values at the nodes represent bootstrap values for NJ and ML respectively.
  16 in total

1.  Coincidental intraguild predation by caterpillars on spider mites.

Authors:  Kanako Shirotsuka; Shuichi Yano
Journal:  Exp Appl Acarol       Date:  2012-01-29       Impact factor: 2.132

Review 2.  Insect mitochondrial genomics: implications for evolution and phylogeny.

Authors:  Stephen L Cameron
Journal:  Annu Rev Entomol       Date:  2013-10-16       Impact factor: 19.686

3.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

4.  MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets.

Authors:  Sudhir Kumar; Glen Stecher; Koichiro Tamura
Journal:  Mol Biol Evol       Date:  2016-03-22       Impact factor: 16.240

5.  The complete mitochondrial genome of the tobacco hornworm, Manduca sexta, (Insecta: Lepidoptera: Sphingidae), and an examination of mitochondrial gene variability within butterflies and moths.

Authors:  Stephen L Cameron; Michael F Whiting
Journal:  Gene       Date:  2007-12-11       Impact factor: 3.688

6.  MITOS: improved de novo metazoan mitochondrial genome annotation.

Authors:  Matthias Bernt; Alexander Donath; Frank Jühling; Fabian Externbrink; Catherine Florentz; Guido Fritzsch; Joern Pütz; Martin Middendorf; Peter F Stadler
Journal:  Mol Phylogenet Evol       Date:  2012-09-07       Impact factor: 4.286

7.  Thrice better than once: quality control guidelines to validate new mitogenomes.

Authors:  Fidel Botero-Castro; Frédéric Delsuc; Emmanuel J P Douzery
Journal:  Mitochondrial DNA A DNA Mapp Seq Anal       Date:  2015-09-04       Impact factor: 1.514

8.  Phylogeny and biogeography of hawkmoths (Lepidoptera: Sphingidae): evidence from five nuclear genes.

Authors:  Akito Y Kawahara; Andre A Mignault; Jerome C Regier; Ian J Kitching; Charles Mitter
Journal:  PLoS One       Date:  2009-05-28       Impact factor: 3.240

9.  tRNAscan-SE On-line: integrating search and context for analysis of transfer RNA genes.

Authors:  Todd M Lowe; Patricia P Chan
Journal:  Nucleic Acids Res       Date:  2016-05-12       Impact factor: 16.971

10.  Mitochondrial Genomes of Two Bombycoidea Insects and Implications for Their Phylogeny.

Authors:  Zhao-Zhe Xin; Xiao-Yu Zhu; Ying Wang; Hua-Bin Zhang; Dai-Zhen Zhang; Chun-Lin Zhou; Bo-Ping Tang; Qiu-Ning Liu
Journal:  Sci Rep       Date:  2017-07-26       Impact factor: 4.379

View more
  1 in total

1.  The complete mitogenome sequence of the hawk moth, Theretra latreillii subsp. lucasii (Lepidoptera: Sphingidae) from Zhejiang Province, China.

Authors:  Qiaoying Lu; Hongwei Yao; Jinming Zhang; Hongxing Xu; Caiying Jiang
Journal:  Mitochondrial DNA B Resour       Date:  2021-06-07       Impact factor: 0.658

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.