Literature DB >> 30655597

The first complete mitochondrial genome of marigold pest thrips, Neohydatothrips samayunkur (Sericothripinae) and comparative analysis.

Vikas Kumar1, Kaomud Tyagi2, Shantanu Kundu1, Rajasree Chakraborty1, Devkant Singha1, Kailash Chandra1.   

Abstract

Complete mitogenomes from the order Thysanoptera are limited to representatives of the subfamily Thripinae. Therefore, in the present study, we sequenced the mitochondrial genome of Neohydatothrips samayunkur (15,295 bp), a member of subfamily Sericothripinae. The genome possesses the canonical 13 protein-coding genes (PCGs), 22 transfer RNA genes (tRNAs), and two ribosomal RNA genes (rRNAs) as well as two putative control regions (CRs). The majority strand was 77.42% A + T content, and 22.58% G + C with weakly positive AT skew (0.04) and negative GC skew (-0.03). The majority of PCGs start with ATN codons as observed in other insect mitochondrial genomes. The GCG codon (Alanine) was not used in N. samayunkur. Most tRNAs have the typical cloverleaf secondary structure, however the DHU stem and loop were absent in trnV and trnS1, while the TΨC loop was absent in trnR and trnT. The two putative control regions (CR1 and CR2) show 99% sequence similarity indicated a possible duplication, and shared 57 bp repeats were identified. N. samayunkur showed extensive gene rearrangements, with 11 PCGs, 22 tRNAs, and two rRNAs translocated when compared to the ancestral insect. The gene trnL2 was separated from the 'trnL2-cox2' gene block, which is a conserved, ancestral gene order found in all previously sequenced thrips mitogenomes. Both maximum likelihood (ML) and Bayesian inference (BI) phylogenetic trees resulted in similar topologies. The phylogenetic position of N. samayunkur indicates that subfamily Sericothripinae is sister to subfamily Thripinae. More molecular data from different taxonomic groups is needed to understand thrips phylogeny and evolution.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30655597      PMCID: PMC6336932          DOI: 10.1038/s41598-018-37889-6

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

The order Thysanoptera (thrips) includes nine families in two suborders, the Terebrantia and Tubulifera. The family Thripidae is the largest of the Terebrantia and is further subdivided into four subfamilies; Dendrothripinae, Panchaetothripinae, Sericorthripinae, and Thripinae[1]. The members of Sericothripinae have a worldwide distribution and are usually associated with flowers[2,3]. This subfamily currently includes 168 species in three genera, Neohydatothrips, Hydatothrips, and Sericothrips. The marigold thrips, Neohydatothrips samayunkur is a pest of marigold (Tagetes spp.) with a worldwide distribution[4-6]. Recently, N. samayunkur has also been suspected as a vector for tospoviruses[7]. Integration of molecular data with morphology is required for fast and accurate species identification and to understand phylogenetic relationships[2]. The mitochondrial genes cox1 and 16S rRNA have been found to be useful in the identification of thrips species and to infer phylogenetic relationships[8-10], however, phylogenetic relationships below the family level in thrips are still unclear and require more molecular data[2-4]. Insects typically have a single circular mitochondrial genome, 14–19 kb in size, with 37 genes, including 13 protein-coding genes (PCGs), large and small ribosomal RNA genes (rRNAs), 22 transfer RNA genes (tRNAs) and variable number of A + T rich control regions (CRs). The characteristic features of the animal mitochondrial genomes are (i) conserved gene content, (ii) conserved genome size and organization, (iii) lack of extensive recombination, (iv) maternal inheritance, and (v) an accelerated rate of nucleotide substitution[11-13]. Therefore, this small molecule has been widely used in insect phylogenetic and evolutionary studies[14-16]. To date, the highly rearranged mitogenomes of five thrips species (Anaphothrips obscurus, Frankliniella intonsa, Frankliniella occidentalis, Scirtothrips dorsalis and Thrips imaginis) are available[17-21]. However, the availability of thrips mitogenomes is limited to the subfamily Thripinae. In this study, we sequenced the complete mitochondrial genome of N. samayunkur, a member of subfamily Sericothripinae using next-generation sequencing (NGS) technology and compared it to other thrips mitogenomes, analysing genome organization, codon usage patterns, tRNA secondary structure and strand asymmetry. Phylogenetic relationships were inferred by analysing the 13 PCGs from published thrips mitogenomes using maximum likelihood (ML) and Bayesian inference (BI).

Results and Discussion

Genome structure, organization and composition

The complete sequence of the mitochondrial genome of N. samayunkur (accession number MF991901) is 15,295 base pairs (bp) in length. This is longer than those of A. obscurus, F. intonsa, F. occidentalis, and S. dorsalis South Asia (SA1), but smaller than the genomes of T. imaginis and S. dorsalis East Asia (EA1) (Table S1). As in other insect species, the mitochondrial genome of N. samayunkur included 37 genes: 13 PCGs, large and small rRNAs and 22 tRNAs but two putative CRs (Fig. 1). There are 204 bp intergenic nucleotides in total, across 23 locations, with individual spacer length of 1 to 41 bp. The longest intergenic spacer (41 bp) was located between the trnS2 and cox1 gene, with an extremely high AT content (85.37%). Three pairs of genes overlap with lengths ranging from 1 to 24 bp. Thirty genes are on the majority strand and seven on the minority (Table 1). Nucleotide composition was 77.42% A + T content and 22.58% G + C content (Table 2); similar to other thrips mitogenomes. A + T content was highest at 80.87%, in tRNAs, followed by rRNAs (79.31%), PCGs (77.15%), and CRs (71.12%). The mitogenome showed weakly positive AT (0.04) and negative GC (−0.03) skews (Table 2).
Figure 1

The mitochondrial genome of the marigold thrips, N. samayunkur. Direction of gene transcription is indicated by arrows in entire complete genome. PCGs are shown as purple arrows, rRNA genes as sea green arrows, tRNA genes as blue arrows and CR regions as red rectangles. The GC content is plotted using a black sliding window, as the deviation from the average GC content of the entire sequence. GC-skew is plotted using a colored sliding window (green and orchid color), as the deviation from the average GC-skew of the entire sequence. The figure was drawn using CGView online server (http://stothard.afns.ualberta.ca/cgview_server/) with default parameters. The species photograph was taken by the second author (KT) using Leica Microscope DM1000 with Leica software application suite (LAS EZ) and edited manually in Adobe Photoshop CS 8.0.

Table 1

List of annotated mitochondrial genes of Neohydatothrips samayunkur and its characteristic features.

GeneStrandLocationSize (bp)Anti codonStart codonStop codonIGN
cox1 +1–15361536ATATAA0
nad3 +1537–1890354ATGTAA25
cox2 +1916–2584669ATATAA10
trnD +2595–266167GAC0
trnR +2662–272362CGA3
trnG +2727–278862GGA7
trnK +2796–285863AAA13
cox3 +2872–3702831ATATAA14
trnN +3717–378266AAC−3
trnE +3780–384263GAA0
trnI +3843–390967ATC4
cytb +3914–50141101ATTTAA1
trnY 5016–507964TAC17
nad2 +5097–6087991ATCT(AA)0
trnW +6088–615265TGA0
nad1 +6153–7089937ATAT(AA)−24
trnM +7066–712661ATG8
trnF +7135–720672TTC6
rrnS +7213–79357231
atp8 7937–8152216ATTTAA13
atp6 +8166–8777612ATGTAG0
trnL1 +8778–884164CTA5
trnT +8847–890660ACA−1
trnQ +8906–897469CAA10
trnP 8985–905066CCA8
trnA +9059–912163GCA10
nad5 9132–108171686ATATAA0
trnH 10818–1087760CAC2
nad4 10880–121781299ATTTAA2
nad4L 12181–12459279ATGTAA0
CR2 +12460–127593000
trnV +12760–1281960GTA0
CR1 +12820–134476280
nad6 +13448–13918471ATTTAA1
rrnL +13920–1500410851
trnS1 +15006–1506964TCA0
trnC +15070–1512960TGC0
trnL2 +15130–1519465TTA2
trnS2 +15197–1525458AGA0

The protein coding and ribosomal RNA genes are represented by standard nomenclature, tRNAs are represented as trn followed by the IUPAC-IUB single letter amino acid codes. (+) values in strand represent as heavy (H) and (−) values represent as light (L). IGN represents (+) values as intergenic nucleotides and (−) values as overlapping regions. CR represents the control region.

Table 2

Composition and skew in different Thysanoptera mitogenomes included for comparative analysis.

SpeciesSize(bp)A%G%T%C%GC%AT%AT skewGC skew
Whole mtgenome
N.samayunkur 15,29540.2510.9837.1711.6022.5877.420.04−0.03
T. imaginis 15,40743.8510.4732.7212.9623.4376.570.15−0.11
F. intonsa 15,21541.2411.0634.6813.0124.0775.930.09−0.08
F. occidentalis 14,88940.9811.3536.6211.0622.4177.590.060.01
S. dorsalis EA115,34339.1211.6136.6212.6424.2675.740.03−0.04
S. dorsalis SA115,20439.8311.1837.5611.4222.6077.400.03−0.01
A. obscurus 14,89038.3811.2739.7510.6021.8778.13−0.020.03
PCG
N. samayunkur 10,98239.5910.7937.5612.0622.8577.150.03−0.06
T. imaginis 10,92242.7510.1532.8914.2124.3675.640.13−0.17
F. intonsa 11,00939.9511.3934.5814.0825.4774.530.07−0.11
F. occidentalis 10,85239.8211.6236.7211.8423.4676.540.04−0.01
S. dorsalis EA110,95438.0611.9236.5313.4825.4174.590.02−0.06
S. dorsalis SA110,97338.9411.3637.6712.0323.3876.620.02−0.03
A. obscurus 11,16737.3611.4639.9311.2522.7177.29−0.030.01
tRNA
N. samayunkur 1,40143.119.8537.769.2819.1380.870.070.03
T. imaginis 1,49243.839.4536.6610.0519.5080.500.09−0.03
F. intonsa 1,39243.5310.7035.789.9920.6979.310.100.03
F. occidentalis 1,38042.3910.5837.399.6420.2279.780.060.05
S. dorsalis EA11,42640.5311.0137.5210.9421.9578.050.040.00
S. dorsalis SA11,42941.3610.4338.2110.0120.4379.570.040.02
A. obscurus 1,43039.7910.6339.869.7220.3579.650.000.04
rRNA
N. samayunkur 1,80844.9711.7334.358.9620.6979.310.130.13
T. imaginis 1,87647.6510.7732.149.4320.2079.800.190.07
F. intonsa 1,69947.1511.3032.029.5420.8479.160.190.08
F. occidentalis 1,84845.9412.1833.937.9520.1379.870.150.21
S. dorsalis EA11,77543.2111.8934.999.9221.8078.200.110.09
S. dorsalis SA11,77745.3611.6534.448.5520.2079.800.140.15
A. obscurus 1,81243.1611.7036.598.5520.2579.750.080.16
Control region
N. samayunkur 92833.8414.8737.2814.0128.8871.120.05−0.03
T. imaginis 90047.5616.6725.2210.5627.2272.780.310.22
F. intonsa 94241.727.8638.2212.2120.0679.940.04−0.22
F. occidentalis 59540.347.9043.708.0715.9784.03−0.04−0.01
S. dorsalis EA11,77543.2111.8934.999.9221.8078.200.110.09
S. dorsalis SA176735.339.2643.5511.8621.1278.88−0.10−0.12
A. obscurus 14525.528.9762.762.7611.7288.28−0.420.53
The mitochondrial genome of the marigold thrips, N. samayunkur. Direction of gene transcription is indicated by arrows in entire complete genome. PCGs are shown as purple arrows, rRNA genes as sea green arrows, tRNA genes as blue arrows and CR regions as red rectangles. The GC content is plotted using a black sliding window, as the deviation from the average GC content of the entire sequence. GC-skew is plotted using a colored sliding window (green and orchid color), as the deviation from the average GC-skew of the entire sequence. The figure was drawn using CGView online server (http://stothard.afns.ualberta.ca/cgview_server/) with default parameters. The species photograph was taken by the second author (KT) using Leica Microscope DM1000 with Leica software application suite (LAS EZ) and edited manually in Adobe Photoshop CS 8.0. List of annotated mitochondrial genes of Neohydatothrips samayunkur and its characteristic features. The protein coding and ribosomal RNA genes are represented by standard nomenclature, tRNAs are represented as trn followed by the IUPAC-IUB single letter amino acid codes. (+) values in strand represent as heavy (H) and (−) values represent as light (L). IGN represents (+) values as intergenic nucleotides and (−) values as overlapping regions. CR represents the control region. Composition and skew in different Thysanoptera mitogenomes included for comparative analysis.

Protein-coding genes

All 13 PCGs used ATN start codons (five with ATA, four with ATT, three with ATG and one with ATC) as is observed in most of the insect mitochondrial genomes[22,23]. The stop codon TAA was used by 10 PCGs, and TAG for atp6, while an incomplete stop codon is present in nad1 and nad2. Comparative analysis of start and stop codons among thrips showed the unique features of N. samayunkur: ATT start codon in cytb and nad6, ATC in nad2, and ATG in atp6. The complete stop codon TAA was used by atp8 in N. samayunkur, while it was terminated by an incomplete stop codon T(AA) in other thrips species (S2 Table). The detection of an incomplete stop codon in atp8 gene may be due to misannotation, as atp8-atp6 is a conserved ancestral gene block with no tRNA between them[24]. The entire length of PCGs of N. samayunkur was 10,982 bp. Overall A + T content of 13 PCGs was 77.15% in N. samayunkur, while it ranges from 74.53% to 77.29% across thrips. Codon usage in N. samayunkur shows a significant bias towards A/T rich codons. Relative synonymous codon usage analysis of N. samayunkur revealed that the codon GCG (Alanine) was not present at all. The most frequently utilized amino acids were Lysine (K), Phenylalanine (F), Leucine (L), Isoleucine (I), Tyrosine (Y), and Serine (S) as in other insects (S3 Table).

Ribosomal and transfer RNA genes

N. samayunkur has two rRNAs as in other insects. The large ribosomal gene (16S) was 1085 bp long, and located between nad6 and trnS1; the small (12S) was 723 bp long, located between trnF and atp8 (Table 2). A + T content of two rRNAs was 79.31%, while it ranges from 79.16% (S. dorsalis EA1) to 79.87% (F. occidentalis) observed in other thrips. N. samayunkur contained a complete set of 22 tRNAs (total length 1,401 bp) individually ranging from 58 to 72 bp in length. Collectively tRNAs have the highest A + T content 80.87% of any gene group (78.05% to 80.87% in thrips) (Table 2). Most tRNAs have the typical cloverleaf secondary structure except trnV, trnS1, trnR, and trnT. The DHU stem and loop were absent in trnV and trnS1 while TΨ C loop absent in trnR and trnT (Fig. S1). The absence of DHU stem and loop in trnV is consistent across all thrips species sequenced to date.

Control regions

Control regions (CRs) in mitogenomes play an important role in transcription and replication[25]. A CR was found with following conserved elements; a poly T stretch at the 5′ end, a TA(A)n-like stretch, a stem and loop structure, a TATA motif, and a GAT motif[26-28]. The N. samayunkur mitogenome contains two putative control regions, CR1 (628 bp) and CR2 (300 bp), located between trnV and nad6, and trnV and nad4L respectively. CR2 had 99% sequence similarity with CR1, indicating a possible duplication. Three near tandem repeats (57 bp) were identified in CR1, while one repeat sequence was present in CR2 (Fig. 2). Most thrips species have been documented to have multiple CRs except A. obscurus. Three CRs are present in F. intonsa, F. occidentalis, and S. dorsalis SA1, two in T. imaginis, S. dorsalis EA1, and one in A. obscurus[18-21]. A location of CR1 upstream of nad5 gene has been suggested to be ancestral condition of thrips[17], however, the CR locations in N. samayunkur (subfamily Sericothripinae) differ from those of other thrips.
Figure 2

Comparison of the nucleotide sequences of two putative control regions of N. samayunkur. Four types of sequences were recognized in the control regions: tandem repeats, Poly T-stretches, A + T-rich sequences, TA(A)n motif, TATA motif, GAT motif and stem and loop. The figure was edited in Adobe Photoshop CS 8.0.

Comparison of the nucleotide sequences of two putative control regions of N. samayunkur. Four types of sequences were recognized in the control regions: tandem repeats, Poly T-stretches, A + T-rich sequences, TA(A)n motif, TATA motif, GAT motif and stem and loop. The figure was edited in Adobe Photoshop CS 8.0.

Gene arrangement

The mitogenome gene arrangements have been characterized by following patterns, transpositions, inversions, and inverse transpositions[11,29,30]. Tandem duplication–random loss (TDRL) is the most widely accepted process to explain transpositions[11]. The gene arrangement of N. samayunkur was assessed by comparing the common intervals with the ancestral insect gene order as an outgroup[17,31]. CREx[32] analysis identified eight gene rearrangement events in N. samayunkur, including four inversions plus four TDRLs, assignable to two sets of alternative scenarios (Fig. S2). CREx detected inversions of atp8, trnF, trnC and gene block nad1-rrnS in both scenarios. N. samayunkur is a highly rearranged mitogenome with rearrangements of 11 PCGs, 22tRNAs, and two rRNAs as compared with the ancestral insect (Fig. 3). The majority of rearrangements were transpositions, while nine rearrangements (nad1, atp8, trnF, trnL1, trnQ, trnV, trnC, rrnS, and rrnL) were inverse transpositions. Further, when N. samayunkur was compared to other thrips species, the following derived gene blocks: trnG-cox3, trnN-trnE, trnY-nad1, trnF-atp6, and nad5-nad4L were conserved in all thrips species. Within the conserved gene block trnF-atp6, atp8 was subsequently inverted in N. samayunkur. The following tRNAs were inverted in thrips species as compared to the ancestral insect: trnY in S. dorsalis SA1, trnP in both S. dorsalis, trnS1 in T. imaginis, and trnF in all species except S. dorsalis SA1. The gene trnL2 was transposed in N. samayunkur away from the gene block trnL2-cox2, which is conserved in most insects including thrips (Fig. 3). The gene block trnD-cox3 is conserved in five thrips species including N. samayunkur, while trnD and trnR were translocated in T. imaginis and interrupted by the CR2 in S. dorsalis EA1. The gene block nad4L-nad5 is ancestral in insects and conserved in all thrips species. The conserved gene blocks trnY-nad1 and atp6-trnF are separated by trnM and trnA in most of the thrips species except N. samayunkur (trnM alone) and A. obscurus (trnA alone).
Figure 3

Linearized view of complete mitochondrial genome organization and gene rearrangement, transposition, inversion, and inverse transposition in N. samayunkur compared with the ancestral insect gene order. The green color blocks show the conserve gene blocks. The grey color blocks showed the pseudo genes of T. imaginis. Genes nomenclature: atp6 and atp8; ATP synthase subunits 6 and 8; cytb: cytochrome b; cox1–3: cytochrome c oxidase subunits 1–3; nad1–6 and nad4L: NADH dehydrogenase subunits 1–6 and 4L; rrnS and rrnL: small and large subunit ribosomal RNA (rRNA) genes; Transfer RNA genes are denoted by a one-letter symbol according to the IPUCIUB single-letter amino acid codes. CR indicates the control regions. The figure was edited in Adobe Photoshop CS 8.0.

Linearized view of complete mitochondrial genome organization and gene rearrangement, transposition, inversion, and inverse transposition in N. samayunkur compared with the ancestral insect gene order. The green color blocks show the conserve gene blocks. The grey color blocks showed the pseudo genes of T. imaginis. Genes nomenclature: atp6 and atp8; ATP synthase subunits 6 and 8; cytb: cytochrome b; cox1–3: cytochrome c oxidase subunits 1–3; nad1–6 and nad4L: NADH dehydrogenase subunits 1–6 and 4L; rrnS and rrnL: small and large subunit ribosomal RNA (rRNA) genes; Transfer RNA genes are denoted by a one-letter symbol according to the IPUCIUB single-letter amino acid codes. CR indicates the control regions. The figure was edited in Adobe Photoshop CS 8.0.

Strand asymmetry

AT and GC skews on the majority strand are used to measure strand asymmetry[33-35]. Most insects have positive AT skew (A > T) and negative GC skew (C > G). A reversal of the strand asymmetry (T > A and G > C) has been observed in a few species, and is proposed to be caused by the inversion of the replication origin within the control region[26-28,35]. N. samayunkur showed weakly positive AT skew (0.04) and negative GC skew (−0.03), similar to most other thrips species (Table 2). AT skew in other thrips species, ranges from −0.02 (A. obscurus) to 0.15 (T. imaginis), while GC skew varies from −0.11 (T. imaginis) to 0.03 (A. obscurus). Insect species with reversal of strand asymmetry have faster rate of gene rearrangements, however, species with faster rate of gene rearrangements do not always show reversal of strand asymmetry[35]. Two species of thrips (A. obscurus and F. occidentalis) showed weekly positive GC skew value. However, there is no inversion of replication related elements in CR indicating that strand asymmetry is not reversed in thrips.

Phylogenetic analysis

Both Maximum likelihood (ML) and Bayesian Inference (BI) phylogenetic trees resulted in similar topologies (Fig. S3, Fig. 4). BI posterior probabilities (PP) were higher than ML bootstrap support (BS) values. It has been suggested that PP and BS values are not directly comparable and interchangeable, as PP gives higher nodal support than BS[36]. Species within the same genus, F. intonsa and F. occidentalis were grouped together and closely related to T. imaginis. The two cryptic species of S. dorsalis (EA1 and SA1) also clustered together and were closely related to the Frankliniella + Thrips clade. The four genera in the subfamily Thripinae: Anaphothrips, Frankliniella, Scirtothrips, and Thrips grouped together. N. samayunkur shows a sister relatioship to the Thripinae clade in the present phylogeny. Although, gene order is extensively rearranged among thrips mitogenomes, the branching pattern inferred by MLGO[37] is congruent with the PCGs based ML and BI phylogeny (Fig. 5). Previous studies showed a close relationship between T. imaginis with S. dorsalis[17], however, our study found that T. imaginis was closer to Frankliniella than to Scirtothrips, congruent with morphological understanding of these taxa[38].
Figure 4

Phylogenetic tree inferred from nucleotide sequences of 13 PCGs using Bayesian Inference method in MrBayes v3.2. The tree is drawn to scale with bayesian posterior probability values indicated along with the branches. The figure was edited in Adobe Photoshop CS 8.0.

Figure 5

Phylogenetic tree inferred from MLGO web server based on gene arrangements.

Phylogenetic tree inferred from nucleotide sequences of 13 PCGs using Bayesian Inference method in MrBayes v3.2. The tree is drawn to scale with bayesian posterior probability values indicated along with the branches. The figure was edited in Adobe Photoshop CS 8.0. Phylogenetic tree inferred from MLGO web server based on gene arrangements. Previous studies described a close relationships between Scirtothrips (subfamily Thripinae) and Neohydatothrips (subfamily Sericothripinae) using both morphological[6] and molecular data[8,9]. These two genera share the following morphological characters:[6] presence of closely spaced rows of microtrichia on lateral thirds of abdominal tergites; median pair of tergal setae close together; campaniform sensilla absent on tergite IX, tergite X not split longitudinally. The suborder Terebrentia is typically classified into eight families with four subfamilies[39]; however, an alternative view proposes 28 families based on highly conserved taxonomic characters and elevates the subfamily Sericothripinae to family rank[40]. According to Bhatti, the proposed family Sericothripidae can be separated from other families of suborder Terebrantia by the presence of sublateral callosities on antecostal line on tergites II to VII and sternites II to VI (II to VIII in male), prominent anteriorly directed sublateral apodeme on each side on female sternites VII, one pair of cervical sclerites, annular rows of microtrichia on femora and tibiae, short and straight hind coxal apodeme, metathoracic furca not elongate and lyre-shaped, metasternellum somewhat to strongly enlarged, forming a transversely striate area on each side of the mid line, mesothorax with sternal coxal process small and inconspicuous; trochantin small and inconspicuous, absence of sclerites anterior to mesoacrotergite, one anteriorly directed apodeme on each side of sternite I. Moreover, previous studies clearly stated that the relationships between Thripinae (1779 species in 234 genera) and Sericothripinae (168 species in 3 genera) were unclear due to the absence of molecular data[8,41]. The present phylogenetic analysis contradicts a close relationship between Scirtothrips and Neohydatothrips. The mitogenomes of two cryptic species of S. dorsalis (SA1 and EA1) vary considerably with respect to gene rearrangement and chromosome size[20]. S. dorsalis was described from “castor and chillies” at Coimbatore, India[42]. It is a polyphagous pest and a vector of tospoviruses with a global distribution. Earlier studies indicated that this species is a complex, consisting of many morphologically indistinguishable species[43,44]. Recently, nine cryptic species of S. dorsalis were delimited using multilocus molecular data[45], however, these cryptic species has never been morphologically treated to validate and describe these species. Tagging specimens with the correct species name is a major problem as it is difficult to ascertain which of these cryptic species represent the true S. dorsalis. To date, molecular phylogenetic studies of thrips is in its early stages due to lack of large scale data and taxonomic sampling.The generation of comprehensive molecular data on families/subfamilies is still needed.

Materials and Methods

Sample collection and DNA extraction

Adult specimens of N. samayunkur were collected from Odisha State, India. The studied species are common pests of crops, thus no prior permission was required for collection. Specimens were morphologically identified by the second author (K.T.) with available taxonomic keys[2,10], and preserved in absolute ethyl alcohol at −30 °C in Centre for DNA Taxonomy, Molecular Systematics Division, Zoological Survey of India, Kolkata. Genomic DNA was extracted using DNeasy (QIAGEN) following the manufacturer’s standard protocol. Concentration of DNA was determined using a Qubit fluorometer with a dsDNA high-sensitivity kit (Invitrogen), and by agarose gel (0.8%) electrophoresis.

Mitogenome sequencing and assembly

The whole genome library of genomic DNA was sequenced using the Illumina Hiseq2500 (2 × 150 base paired-end reads) (Illumina, USA) platform which yielded ~23 million reads. The paired-end library was constructed according to standard protocols for the TruSeq DNA Library Preparation kit (https://support.illumina.com/downloads/truseq). Raw sequencing reads were trimmed and quality filtered using the NGS-Toolkit[46] to removing adapter contamination and low-quality reads (N’s or more than 70% of bases with a quality score < 20). High quality reads were filtered by using the Burrows-Wheeler Alignment (BWA) tool[47] and assembled with SPAdes 3.9.0[48], using default parameters, and the S. dorsalis mitochondrial genome (NC_025241.1) as a reference. Aligned reads were used for de novo mitochondrial genome assembly.

Genome annotation, visualization, and comparative analysis

The assembled mitogenome was annotated using the MITOS web-server (http://mitos.bioinf.uni-leipzig.de/index.py)[49]. PCGs and rRNAs were confirmed manually by BLASTn, BLASTp and ORF Finder in NCBI[50,51] (https://www.ncbi.nlm.nih.gov/orffinder/). Nucleotide sequences from protein coding genes (PCGs) were translated into putative proteins on the basis of the invertebrate mitochondrial genetic code. Initiation and termination codons were identified in ClustalX[52] using other thrips reference mitogenome sequences. MEGA6[53] was used for the alignment of homologous sequences across thrips species. The complete annotated mitogenome was submitted to NCBI GenBank using Sequin tool (http://www.ncbi.nlm.nih.gov/Sequin/). The circular map of the N. samayunkur mitogenome was illustrated by the CGView online server (http://stothard.afns.ualberta.ca/cgview_server/) with default parameters[54]. MEGA6 was used for estimation of nucleotide composition, codon usages, relative synonymous codon usage (RSCU) and composition of skewness with the following formula: AT skew = (A − T)/(A + T) and GC skew = (G − C)/(G + C)[55]. Secondary structures of transfer RNA (tRNA) genes were predicted by MITOS and further confirmed using tRNAscan-SE (http://lowelab.ucsc.edu/tRNAscan-SE/)[56] and ARWEN 1.2[57]. RNAstructure version 6.0.1 was used to predict possible secondary structure within CRs[58]. Homology between CR1 and CR2 in N. samayunkur was determined through the ClustalW sequence alignment tool implemented in MEGA6. Gene arrangements pathways in N. samayunkur were evaluated by CREx (Common Interval Rearrangement Explorer)[32]. Six complete mitogenomes of five thrips species were retrieved from GenBank on 1st November 2017 for phylogenetic inference (S1 Table). The A. bakeri mitogenome was used as an out group[31]. Each PCG was aligned individually using the MAFFT algorithm in the TranslatorX[59] online platform under the L-INS-i strategy based on codon-based multiple alignment. Poorly aligned nucleotides (1652 bp) were removed from the protein alignment using GBlocks (within TranslatorX) with default settings. The resulting alignments were concatenated by using Sequence Matrix1.7.8[60]. Concatenated dataset (9330 bp) was used for Bayesian inference (BI) and maximum likelihood (ML) analysis. PartitionFinder version 2.1.1[61], with the greedy algorithm was used to find the best substitution models and partition schemes. Partitions were predefined for the codon positions for each PCGs (13 genes X 3 codons = 39 partitions). The BI analysis was performed using Mr. Bayes 3.2[62] with HKY + I + G, TVM + G, TRN + G, GTR + I + G, HKY + G, GTR + I, TVM + I + G model estimated by PartitionFinder (S4 Table). Two runs each with four chains (three heated and one cold) for 500,000 generations, and trees were sampled every 100 generations. A consensus tree was acquired and visualized after excluding the first 25% trees as burn-in. The ML analysis was performed using the IQ-TREE[63] Web Server in W-IQ-TREE[64] (http://iqtree.cibiv.univie.ac.at/) with 1,000 replicates of ultrafast likelihood bootstrap[65]. The phylogenetic tree was visualized and edited using FigTree v1.4.2 (http://tree.bio.ed.ac.uk/software/figtree/)[66]. Phylogenetic relationships of studied taxa were also estimated based on gene arrangement patterns in the MLGO web server[37] (S5 Table). Supplementary info
  47 in total

1.  Mitogenomics: digging deeper with complete mitochondrial genomes.

Authors: 
Journal:  Trends Ecol Evol       Date:  1999-10       Impact factor: 17.712

2.  Overcredibility of molecular phylogenies obtained by Bayesian phylogenetics.

Authors:  Yoshiyuki Suzuki; Galina V Glazko; Masatoshi Nei
Journal:  Proc Natl Acad Sci U S A       Date:  2002-11-25       Impact factor: 11.205

3.  How do insect nuclear and mitochondrial gene substitution patterns differ? Insights from Bayesian analyses of combined datasets.

Authors:  Chung-Ping Lin; Bryan N Danforth
Journal:  Mol Phylogenet Evol       Date:  2004-03       Impact factor: 4.286

4.  Phylogeny of Arthropoda inferred from mitochondrial sequences: strategies for limiting the misleading effects of multiple changes in pattern and rates of substitution.

Authors:  Alexandre Hassanin
Journal:  Mol Phylogenet Evol       Date:  2005-11-14       Impact factor: 4.286

5.  Evidence for multiple reversals of asymmetric mutational constraints during the evolution of the mitochondrial genome of metazoa, and consequences for phylogenetic inferences.

Authors:  Alexandre Hassanin; Nelly Léger; Jean Deutsch
Journal:  Syst Biol       Date:  2005-04       Impact factor: 15.683

6.  Deviations from Chargaff's second parity rule in organellar DNA Insights into the evolution of organellar genomes.

Authors:  Christoforos Nikolaou; Yannis Almirantis
Journal:  Gene       Date:  2006-06-28       Impact factor: 3.688

7.  Molecular identification key for pest species of Scirtothrips (Thysanoptera: Thripidae).

Authors:  Paul F Rugman-Jones; Mark S Hoddle; Laurence A Mound; Richard Stouthamer
Journal:  J Econ Entomol       Date:  2006-10       Impact factor: 2.381

8.  Asymptotically increasing compliance of genomes with Chargaff's second parity rules through inversions and inverted transpositions.

Authors:  Guenter Albrecht-Buehler
Journal:  Proc Natl Acad Sci U S A       Date:  2006-11-08       Impact factor: 11.205

9.  The highly rearranged mitochondrial genome of the plague thrips, Thrips imaginis (Insecta: Thysanoptera): convergence of two novel gene boundaries and an extraordinary arrangement of rRNA genes.

Authors:  Renfu Shao; Stephen C Barker
Journal:  Mol Biol Evol       Date:  2003-03       Impact factor: 16.240

10.  GenBank.

Authors:  Dennis A Benson; Ilene Karsch-Mizrachi; David J Lipman; James Ostell; David L Wheeler
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

View more
  4 in total

1.  The complete mitochondrial genome of the endangered Assam Roofed Turtle, Pangshura sylhetensis (Testudines: Geoemydidae): Genomic features and phylogeny.

Authors:  Shantanu Kundu; Vikas Kumar; Kaomud Tyagi; Kailash Chandra
Journal:  PLoS One       Date:  2020-04-23       Impact factor: 3.240

2.  The Complete Mitochondrial Genome of endemic giant tarantula, Lyrognathus crotalus (Araneae: Theraphosidae) and comparative analysis.

Authors:  Vikas Kumar; Kaomud Tyagi; Rajasree Chakraborty; Priya Prasad; Shantanu Kundu; Inderjeet Tyagi; Kailash Chandra
Journal:  Sci Rep       Date:  2020-01-09       Impact factor: 4.379

3.  Novel gene rearrangement in the mitochondrial genome of Anastatus fulloi (Hymenoptera Chalcidoidea) and phylogenetic implications for Chalcidoidea.

Authors:  Jiequn Yi; Han Wu; Jianbai Liu; Jihu Li; Yinglin Lu; Yifei Zhang; Yinjie Cheng; Yi Guo; Dunsong Li; Yuxing An
Journal:  Sci Rep       Date:  2022-01-25       Impact factor: 4.996

4.  Substantial rearrangements, single nucleotide frameshift deletion and low diversity in mitogenome of Wolbachia-infected strepsipteran endoparasitoid in comparison to its tephritid hosts.

Authors:  Sharon Towett-Kirui; Jennifer L Morrow; Markus Riegler
Journal:  Sci Rep       Date:  2022-01-10       Impact factor: 4.379

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.