Literature DB >> 24885120

Transcriptomic analyses reveal the adaptive features and biological differences of guts from two invasive whitefly species.

Xiao-Dong Ye, Yun-Lin Su, Qiong-Yi Zhao, Wen-Qiang Xia, Shu-Sheng Liu, Xiao-Wei Wang1.   

Abstract

BACKGROUND: The gut of phloem feeding insects is critical for nutrition uptake and xenobiotics degradation. However, partly due to its tiny size, genomic information for the gut of phloem feeding insects is limited.
RESULTS: In this study, the gut transcriptomes of two species of invasive whiteflies in the Bemisia tabaci complex, Middle East Asia Minor 1 (MEAM1) and Mediterranean (MED), were analyzed using the Illumina sequencing. A total of 12,879 MEAM1 transcripts and 11,246 MED transcripts were annotated with a significant Blastx hit. In addition, 7,000 and 5,771 gut specific genes were respectively identified for MEAM1 and MED. Functional analyses on these gut specific genes demonstrated the important roles of gut in metabolism of insecticides and secondary plant chemicals. To reveal the molecular difference between guts of MEAM1 and MED, a comparison between gut transcriptomes of the two species was conducted and 3,910 pairs of orthologous genes were identified. Based on the ratio of nonsynonymous and synonymous substitutions, 15 genes were found evolving under positive selection. Many of those genes are predicted to be involved in metabolism and insecticide resistance. Furthermore, many genes related to detoxification were expressed at an elevated level in the gut of MED compared to MEAM1, which might be responsible for the MED's higher resistance to insecticides and environmental stresses.
CONCLUSION: The sequencing of MED and MEAM1 gut transcriptomes and extensive comparisons of MEAM1 and MED gut transcripts provide substantial sequence information for revealing the role of gut in whiteflies.

Entities:  

Mesh:

Year:  2014        PMID: 24885120      PMCID: PMC4035086          DOI: 10.1186/1471-2164-15-370

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Many hemipteran insects feed on phloem sap, which is composed of rich content of sucrose, relatively poor component of amino acids, and small quantities of proteins and inorganic substances [1]. The gut of these sap feeders is often supposed to have strong capacities in dealing with the unbalanced diet. Efficient absorption of limited organic nutrients, such as amino acids, high activity of phloem sugar hydrolysis, and maintenance of osmotic pressure at an appropriate level are speculated to be the main function of guts [2-4]. Compared to the sap of other plant cells, the fluid of phloem is a good diet with less toxic substances [5]. However, defense secondary metabolites and proteins, such as alkaloids, glucosinolates, glucosides, chitinase and protein inhibitors, are detectable in the phloem, and have been shown to exert negative effects on phloem feeders [6]. As a major organ of the insect digestive system, the gut is likely involved in detoxification of harmful substances in phloem during digestion and assimilation [7-9]. In addition, insect guts play important roles in pesticide resistance and xenobiotics metabolism [10-12]. Insecticide resistance can arise by over expression of detoxification enzymes such as cytochrome P450 monooxygenases (P450), UDP-glucuronosyltransferase (UGT), glutathione-S-transferases (GST) [13-17]. These proteins can convert insecticides and toxic compounds into less toxic or nontoxic chemicals [18]. Therefore, insect guts are critical in plant-insect interaction and insecticide resistance. The whitefly Bemisia tabaci (Gennadius) (Hemiptera: Aleyrodidae) is now known as a complex of genetically distinct cryptic species that often differ in host range, insecticide resistance, capacity for virus transmission, and the symbionts they harbor [19-24]. Worldwide, more than 35 putative cryptic species of the complex have so far been identified [24-28]. Bemisia tabaci impairs plant by sucking phloem sap and transmitting over 100 species of plant viruses in the genus Begomovirus during feeding [29]. Within the species complex, the Mediterranean (MED, previously known as the Q ‘biotype’) and Middle East-Asia Minor 1 (MEAM1, previously known as the B ‘biotype’) species are highly invasive and have caused considerable economic damages to many important crops. MEAM1 invaded China in the mid-1990s, and displaced the native whiteflies of the B. tabaci complex rapidly and became the dominant whitefly species in many regions of invasion [19, 27]. In 2003, MED was first detected in Yunnan province of China [30], and since then has been rapidly spreading to many provinces and replacing other species in the B. tabaci complex including the earlier invader MEAM1 [27, 31–33]. Compared to some native whitefly species, many genes involved in drug metabolism and detoxification pathways were highly expressed in the invasive species, which may contribute to their invasion and displacement of other indigenous species [34]. In addition, several studies have revealed that the greater abundance of MED relative to MEAM1 in Israel and southern Spain were associated with its higher levels of resistance to pyriproxyfen and neonicotinoids [35, 36]. Comparison of MEAM1 and MED transcriptomes demonstrated that the sequence divergence of pesticide resistance genes may cause functional differences in corresponding enzymes and result in the biological variations [37]. However, no genomic information is yet available for the whitefly gut despite the importance of gut in plant-insect interaction and insecticide resistance, Due to the fact that the body length of an adult whitefly is only ca. 1 mm and the size of its gut is much smaller, the collection of microgram amounts of total whitefly gut RNA is extremely difficult. To overcome this obstacle, we utilized the cDNA amplification method described in previous studies to obtain large amount of MEAM1 and MED gut nucleotide samples [34, 37, 38]. The amplified gut cDNA was used for library construction and Illumina sequencing. After sequence assembly, a total of 33,412 MEAM1 gut transcripts and 27,443 MED gut transcripts were obtained. Through the analysis of the transcriptome data, genomic features and putative functions of the whitefly gut were revealed. Furthermore, the divergences of MED and MEAM1 gut genes were analyzed and presented for the first time. This study provides a valuable source of molecular information for future functional studies on whitefly guts and will facilitate the research of guts in whitefly-plant interactions and insecticide resistance.

Result and discussion

Illumina sequencing, reads assembly and functional annotation

The amplified cDNA samples of MEAM1 and MED guts were separately sequenced using the Illumina HiSeq 2000 platform. Initially, about 30 and 29 million raw reads were obtained from the libraries of MEAM1 and MED guts, respectively (Table 1). The raw reads were filtered by removing those with adaptors and ambiguous nucleotides. After that, approximately 27 million clean reads were obtained for each sample. Subsequently, MEAM1 and MED gut transcriptomes were de novo assembled using the short reads assembling program – Trinity [39]. A total of 65,213 and 60,357 Inchworm contigs were assembled for MEAM1 and MED respectively. After paired-end joining and sequence clustering, 33,412 MEAM1 gut transcripts with the mean size of 625 nucleotides and 27,443 MED gut transcripts with the mean size of 632 nucleotides were acquired (Table 1). The lengths of these transcripts ranged from 300 to over 3,000 nucleotides. For functional annotation, all the transcripts of the two transcriptomes were searched against NCBI nr nucleotide database using BLASTx. For MEAM1 gut transcriptome, 12,879 transcripts got significant BLAST hits (E-value < 1.0E−5) (Additional file 1), and for MED gut, 11,246 transcripts got significant BLAST hits (E-value < 1.0E−5) (Additional file 2).
Table 1

Summary for the MEAM1 and MED gut transcriptomes

FeaturesMEAM1 gutMED gut
Total number of raw reads30,066,09629,326,438
Total number of clean reads27,222,86426,782,986
Total clean nucleotides (nt)2,450,057,7602,410,468,740
Average read length (nt)9090
Total number of Inchworm contigs65,21360,357
Mean length of Inchworm contigs354330
Total transcripts33,41227,443
Mean length of transcript units (nt)625632
Summary for the MEAM1 and MED gut transcriptomes

Assignment of transcripts to Gene Ontology (GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways

To further reveal their functions, GO assignments were used to classify the MED and MEAM1 gut transcripts. Based on sequence homology, a total of 7,127 MEAM1 and 6,305 MED gut sequences were categorized into 58 functional groups at level two under the ‘Biological process’, ‘Cellular component’ and ‘Molecular function’ divisions (Additional file 3). However, the profiles of the 58 GO groups in the two species had some differences. While ‘virion’ and ‘virion part’ were only present in the MEAM1 gut, ‘channel regulator activity’ and ‘carbohydrate utilization’ were unique to the MED gut. The top three groups of each division of these two transcriptomes were the same. Specifically, the top three groups in ‘Biological process’ were ‘cellular process’, ‘metabolic process’ and ‘biological regulation’; the top three groups in ‘Cellular component’ were ‘cell’, ‘cell part’ and ‘organelle’; and the top three groups in ‘Molecular function’ were ‘catalytic activity’, ‘binding’ and ‘transporter activity’. In order to find out which biological pathways are active in whitefly guts, the MEAM1’s 33,412 transcripts and MED’s 27,443 transcripts were assigned to the reference pathways in KEGG. Consequently, 5,943 MEAM1 gut transcripts were mapped to 256 pathways and 5,254 MED gut transcripts were mapped to 253 pathways. Among these pathways, ‘Spliceosome’ (325), ‘RNA transport’ (295) and ‘Ubiquitin mediated proteolysis’ (264) (Figure 1A) were the most common pathways in MEAM1. As to the MED gut, ‘Spliceosome’ has the highest percentage of transcripts (307 transcripts), followed by ‘RNA transport’ (276 transcripts) and ‘Lysosome’ (233 transcripts) (Figure 1B).
Figure 1

Distribution of transcripts among the KEGG pathways. (A) Transcripts from the MEAM1 gut. (B) Transcripts from the MED gut. The top ten pathways with highest numbers of transcripts mapped to were shown. Abbreviation for pathways: Spliceosome (Spl), RNA transportor (RNAT), Ubiquitin mediated proteolysis (UMP), Purine metabolism (PuM), Lysosome (Lys), Endocytosis (Endo), Starch and sucrose metabolism (SSM), Protein processing in endoplasmic reticulum (PPER), Regulation of actin cytoskeleton (RAC), Focal adhesion (FoA), Bile secretion (BiS), Pyrimidine metabolism (PyM).

Distribution of transcripts among the KEGG pathways. (A) Transcripts from the MEAM1 gut. (B) Transcripts from the MED gut. The top ten pathways with highest numbers of transcripts mapped to were shown. Abbreviation for pathways: Spliceosome (Spl), RNA transportor (RNAT), Ubiquitin mediated proteolysis (UMP), Purine metabolism (PuM), Lysosome (Lys), Endocytosis (Endo), Starch and sucrose metabolism (SSM), Protein processing in endoplasmic reticulum (PPER), Regulation of actin cytoskeleton (RAC), Focal adhesion (FoA), Bile secretion (BiS), Pyrimidine metabolism (PyM).

KEGG pathway enrichment analysis of gut genes

Enrichment analysis is an effective way to identify the KEGG pathways that frequently occur in a tissue with the whole body transcriptome as background [40, 41]. In MED, a total of 11 gut enriched KEGG pathways (P-value < 5.0E−3) were identified (Table 2). Whereas in MEAM1, there were 25 gut enriched pathways (Additional file 4). Even though the numbers of enriched pathways differ, the functions of the enriched pathways appeared similar between MED and MEAM1 guts. Pathways like ‘Metabolism of cofactors and vitamins’, ‘Carbohydrate metabolism’ and ‘Digestive system’ were enriched in both MEAM1 and MED guts. This is consistent with the principal function of sap-sucking insect’s gut – uptake of nutrients. In addition, ‘Membrane transport’ and ‘Transport and catabolism’ were also enriched. These transport-related pathways have been shown to be important during the secretion of digestive enzymes and the formation of gradient between the gut luminal sap and the perimicrovillar space [3, 42, 43]. Interestingly, ‘Xenobiotics biodegradation and metabolism’ pathway was highly enriched in MED, but not in MEAM1 (Table 2 and Additional file 4). In addition, pathways such as ‘Drug metabolism - other enzymes’, ‘Drug metabolism - cytochrome P450’, ‘Metabolism of xenobiotics by cytochrome P450’ were also enriched in MED gut (Table 2), which is consistent with the high insecticide resistance of MED whiteflies [44-47].
Table 2

Statistically enriched KEGG pathways in MED guts

KEGGE-valueGut genes1 WB genes2
Xenobiotics biodegradation and metabolism0.00E + 00465518
 Drug metabolism - cytochrome P4506.61E-10156161
 Metabolism of xenobiotics by cytochrome P4501.18E-08151164
 Drug metabolism - other enzymes9.30E-07158193
Metabolism of cofactors and vitamins4.18E-13429555
 Retinol metabolism3.97E-11141131
 Porphyrin and chlorophyll metabolism1.06E-06124141
Digestive system8.97E-13634906
 Bile secretion0.00E + 00198117
 Mineral absorption8.40E-043737
 Fat digestion and absorption2.18E-034958
 Carbohydrate digestion and absorption4.32E-034047
Transcription1.99E-11456622
 Spliceosome7.64E-10307396
Carbohydrate metabolism2.98E-0710551793
 Ascorbate and aldarate metabolism1.47E-07110113
 Pentose and glucuronate interconversions2.68E-05125157
Membrane transport5.67E-058598
 ABC transporters5.67E-058598
Excretory system3.04E-04204305
 Vasopressin-regulated water reabsorption1.96E-047994
Metabolism of Terpenoids and Polyketides3.09E-046676
 Insect hormone biosynthesis3.47E-042924
Transport and catabolism1.52E-036831212
 Lysosome2.77E-03233377
Folding, sorting and degradation2.16E-036741201
 Proteasome7.76E-045563
Lipid metabolism4.01E-03518914
 Steroid hormone biosynthesis1.73E-07124134
Others
 Tight junction1.52E-05156204
 Cardiac muscle contraction3.43E-0377105
 Other types of O-glycan biosynthesis3.30E-05113139
 beta-Alanine metabolism3.81E-033742
 Pyrimidine metabolism3.14E-03170265
 Phototransduction1.71E-062511
 Olfactory transduction1.26E-033434
 Ribosome biogenesis in eukaryotes3.47E-04150213

1The number of gut genes in each of the KEGG pathways.

2The number of whole-body (WB) genes in each of the KEGG pathways.

Statistically enriched KEGG pathways in MED guts 1The number of gut genes in each of the KEGG pathways. 2The number of whole-body (WB) genes in each of the KEGG pathways.

Gut specific genes

In order to reveal the specific function of whitefly guts, the orthologous genes between whitefly gut and whole body transcriptomes were identified using OrthoMCL [48]. The gut genes that cannot be classified into any orthologous groups were considered as gut specific genes. In our analysis, the MEAM1 gut specific genes were identified against the MEAM1 whole body transcriptome [37] and MED gut specific genes against the MED whole body transcriptome [49]. As a result, a total of 7,000 and 5,771 specific genes were identified for MEAM1 and MED guts respectively. The proportion of MEAM1 and MED gut specific genes to the whole gut transcriptome were nearly equal (20.95% and 21.03%). Next, these gut specific genes were classified through KEGG annotation (Table 3). The results showed that ‘Alpha-glucosidase’, ‘Facilitated trehalose transporter’ and ‘MFS transporter’ terms contained the most gut specific genes. This is consistent with the function of whitefly gut in sucrose hydrolysis and nutrient absorption. In addition, a number of detoxification-related genes such as cytochrome P450, GST and glucuronosyltransferase were also found specifically expressed in the gut (Table 3).
Table 3

The KO classification of gut specific genes

KO IDNumber of genesKO definition
MED gut
 K0069942glucuronosyltransferase [EC:2.4.1.17]
 K0136335cathepsin B [EC:3.4.22.1]
 K0118731alpha-glucosidase [EC:3.2.1.20]
 K0922828KRAB domain-containing zinc finger protein
 K1095527intestinal mucin-2
 K1500223cytochrome P450, family 6 [EC:1.14.-.-]
 K1425821facilitated trehalose transporter
 K1441018acid phosphatase [EC:3.1.3.2]
 K0079916glutathione S-transferase [EC:2.5.1.18]
 K0110416protein-tyrosine phosphatase [EC:3.1.3.48]
 K0814514MFS transporter, SP family, solute carrier family 2
 K0328311heat shock 70 kDa protein 1/8
 K0611511spectrin beta
 K1038011ankyrin
 K1497211PAX-interacting protein 1
 K0565810ATP-binding cassette, subfamily B (MDR/TAP)
MEAM1 gut
 K0118739alpha-glucosidase [EC:3.2.1.20]
 K0922833KRAB domain-containing zinc finger protein
 K0069922glucuronosyltransferase [EC:2.4.1.17]
 K1500222cytochrome P450, family 6 [EC:1.14]
 K0136321cathepsin B [EC:3.4.22.1]
 K1425821facilitated trehalose transporter
 K1095518intestinal mucin-2
 K0079915glutathione S-transferase [EC:2.5.1.18]
 K0110414protein-tyrosine phosphatase [EC:3.1.3.48]
 K0814513MFS transporter, SP family, solute carrier family 2
 K1038013ankyrin
 K0564311ATP-binding cassette, subfamily A (ABC1), member 3
 K0805211neurofibromin 1
 K1178911HIV-1 Vpr-binding protein
 K1038210dystonin
 K1059410E3 ubiquitin-protein ligase HERC1 [EC:6.3.2.19]
 K1420910solute carrier family 36 (proton-coupled amino acid transporter)

Note: ‘The KO IDs with more than 10 gut genes were shown in this table’.

The KO classification of gut specific genes Note: ‘The KO IDs with more than 10 gut genes were shown in this table’. To validate whether these gut genes are specifically expressed, several genes were randomly picked out to analyze their expression levels. Total RNA was separately extracted from the gut and the rest of the body of MEAM1 and MED whiteflies. Quantitative real-time PCR (qPCR) results showed that all the selected genes were highly or specifically expressed in the gut (Figure 2). The expression of these genes was significantly low or almost non-detectable in the rest of whitefly body.
Figure 2

Expression of gut specific genes. In each assay, the expression level was normalized to the lowest expression level, which was arbitrarily set at one. The number in parentheses represents the ID number of transcripts in the MEAM1 and MED gut transcriptomes.

Expression of gut specific genes. In each assay, the expression level was normalized to the lowest expression level, which was arbitrarily set at one. The number in parentheses represents the ID number of transcripts in the MEAM1 and MED gut transcriptomes.

Orthologous genes between the MED and MEAM1 gut transcriptomes

To compare the sequence divergence of the MEAM1 and MED gut genes, a bidirectional best hit approach was used to identify orthologous genes [37, 50, 51]. By this way, 14,472 pairs of putative orthologous genes between MEAM1 and MED guts were identified (Additional file 5). To filter out potential paralogs, only pairs of sequences mapped unambiguously to the same protein in Swissprot database with an E-value < 1E−5 were selected as orthologous genes. As a result, 3,987 pairs of orthologous genes were identified between MEAM1 and MED gut transcriptomes (Additional file 5). The coding sequence (CDS) and untranslated region (UTR) of each sequence pair were identified based on the predicted coding region. After filtering the sequences shorter than 75 bp, a total of 3,910 pairs of orthologous CDS were identified (Additional file 5 and Additional file 6). The average length of the 3,910 orthologous genes was 677.7 bp with an average similarity of 99.25%.

The sequence divergence between MEAM1 gut and MED gut orthologous genes

The sequence divergence of orthologous genes between MEAM1 and MED guts was analyzed to reveal their molecular variation. As shown in Table 4, the overall difference between 5’UTRs of MEAM1 and MED gut orthologous genes was 1.69% and the difference between 3’UTR of MEAM1 and MED orthologous genes was 1.59%. When it comes to the CDS, the overall divergence among the 3,910 orthologous gene pairs was 0.75%. The lower divergence rate at CDS might be due to the high selection pressure. In coding regions, the nucleotides can be further classified as nondegenerative (nd) sites (any substitutions result in amino acid change) and four fold degenerate (4d) sites (no changes produce amino acid replacement). From a total of 2,649.92 kb of CDS, a total of 1,551.93 kb were nd sites, whereas 375.10 kb were 4d sites (Table 4). As any nucleotide substitutions at nd sites will produce amino acid changes, the nd sites are under extensive functional constraints in the evolution process. Indeed, at 4d sites, the divergence between MEAM1 and MED was 2.48%, 14.6 times of that at nd sites (0.17%) (Table 4). Therefore, the difference at 4d sites might be the main source of sequence divergence between MEAM1 and MED gut genes. In order to verify the sequence divergence, 5 pairs of genes with known point mutations were randomly selected for cloning and Sanger sequencing from both MEAM1 and MED guts (Additional file 7). Our results showed that: i) for both species, the sequences from the de novo assembled transcriptome and Sanger sequencing are identical; ii) all of the differences between MEAM1 and MED revealed from the bioinformatic analysis were confirmed with Sanger sequencing. These results suggest that the transcriptome sequences of MEAM1 and MED guts and their divergence are reliable (Additional file 7).
Table 4

Sequence divergence between MED and MEAM1 gut transcriptomes

% Differences
%GCLociMeanSECompared kb
5’UTRsa 39.286851.690.09109.69
CDSb 43.4639100.750.012649.92
 nd sitesc 44.2139100.170.011551.93
 4d sitesd 37.7439102.480.05375.10
3’ UTRs32.7910131.590.07336.37

aUTRs: untranslated regions.

bCDS: coding sequence.

cnd sites: non-degenerative sites.

d4d sites: fourfold-degenerate sites where no changes cause any amino acid replacement.

Sequence divergence between MED and MEAM1 gut transcriptomes aUTRs: untranslated regions. bCDS: coding sequence. cnd sites: non-degenerative sites. d4d sites: fourfold-degenerate sites where no changes cause any amino acid replacement.

Analysis of sequences with weak amino-acid similarity

The 3,910 sequence pairs between MEAM1 and MED guts had a mean homology of 99.25% and ranged from 78.38% to 100% (Additional file 6). The functions of sequence pairs with weak amino-acid similarity were analyzed to reveal the proteins responsible for the differences between the two species. Many of the highly diverged genes were related to sugar metabolism such as gene Pair 2223 (sequence identity: 93.69%) and gene Pair 3793 (sequence identity: 95.30%), both of which encode alpha-glucosidase (Additional file 6). In addition, gene Pair 3748 encoding a sugar transporter also showed high sequence divergence (sequence identity: 96.12%). We also noticed that genes related to ‘Xenobiotics metabolism’ were highly diverged, such as carboxylesterase clade E (95.13%), two GSTs (95.19%, 96.26%) and two UGTs (95.45%, 95.56%) (Additional file 6). Furthermore, the 3,910 MEAM1 and MED gut orthologous genes were classified by KEGG pathways and the average divergences of genes in each pathway were calculated. Interestingly, the most divergent pathways were those related to metabolism, such as ‘Steroid hormone biosynthesis’, ‘Drug metabolism - cytochrome P450’ and ‘Metabolism of xenobiotics by cytochrome P450’ (Table 5). These data indicate that these pathways of xenobiotics biodegradation and metabolism may result in the difference of some major biological characteristics such as insecticide resistance. Besides, ‘Starch and sucrose metabolism’ and ‘Ascorbate and aldarate metabolism’ also had a low average identity, suggesting that MEAM1 and MED guts may have different metabolic capacities as well. Furthermore, the high divergence of genes in ‘Nicotinate and nicotinamide metabolism’ pathway might be responsible for the differences in host plant utilization between MEAM1 and MED whiteflies (Table 5).
Table 5

Average identities of orthologous genes between MEAM1 and MED guts in different KEGG pathways

Pathway IDPathway descriptionNumber of gene pairsAverage identity
ko00140Steroid hormone biosynthesis300.9767
ko00982Drug metabolism - cytochrome P450380.9770
ko00053Ascorbate and aldarate metabolism250.9777
ko00980Metabolism of xenobiotics by cytochrome P450370.9778
ko00040Pentose and glucuronate interconversions300.9790
ko00514Other types of O-glycan biosynthesis300.9806
ko00983Drug metabolism - other enzymes410.9811
ko00860Porphyrin and chlorophyll metabolism360.9816
ko00830Retinol metabolism380.9819
ko00760Nicotinate and nicotinamide metabolism110.9827
ko00350Tyrosine metabolism130.9830
ko00500Starch and sucrose metabolism710.9833
ko00563Glycosylphosphatidylinositol(GPI)-anchor biosynthesis130.9846
ko04974Protein digestion and absorption260.9850
ko04976Bile secretion520.9853
Average identities of orthologous genes between MEAM1 and MED guts in different KEGG pathways

Synonymous and nonsynonymous sites

To classify genes undergoing purifying and positive selections, the substitution rates of synonymous (Ks) and nonsynonymous (Ka) sites between MEAM1 and MED gut ortholog pairs were calculated. A total of 1,080 ortholog pairs generated both Ka and Ks (Figure 3 and Additional file 8). The mean value of Ka of these 1,080 sequences was 0.0056 and the mean value of Ks was 0.0392. The average Ka/Ks ratio was 0.1951, which is similar to the average Ka/Ks ratio of MEAM1 and MED whole body (0.225) [37]. In the 1,080 selected orthologous gene pairs, the majority of genes’ Ka/Ks ratios were less than 1 (98.61%, 1065/1080), indicating purifying selection for these genes [52]. At the same time, 15 gene pairs’ Ka/Ks values were larger than 1 (Table 6), indicating that these genes are under strong positive selection. Some of the genes are related to energy metabolism and xenobiotic metabolism, such as NADH dehydrogenase, ATP synthase, mannosyl-oligosaccharide 1 and UGT, suggesting that these processes may be critical to the difference between MEAM1 and MED guts.
Figure 3

Distribution of Ka and Ks. Sequences with Ka/Ks > 1 fall below the solid line; while sequences with Ka/Ks between 0.5 -1 fall between the solid and dashed lines.

Table 6

List of gene pairs with Ka/Ks larger than one

Gene pair IDS-Suba N-Subb Kac Ksd Ka/KsProtein homolog
1245160.00620.00321.9268ER degradation-enhancing alpha-mannosidase
2260260.04510.02411.8694Paternally-expressed gene 3 protein
1570160.01240.00701.7688ATP synthase subunit s-like protein
38782120.01780.01091.6398Eukaryotic translation initiation factor
2569140.00650.00401.5995Solute carrier family 25 member 38
1312140.01430.01051.3721NADH dehydrogenase
1207150.01170.00881.3270NADH dehydrogenase
911140.00890.00721.2397Uncharacterized protein
1382250.00910.00761.2037UDP-glucuronosyltransferase
3874260.03830.03351.1414Steroid 17-alpha-hydroxylase/17,20 lyase
91130.00790.00751.0525Uncharacterized protein
385130.00610.00581.0478Uncharacterized protein
2769390.01270.01251.014439S ribosomal protein
2661130.00760.00751.014228S ribosomal protein
38236140.03290.03281.0037Glutamic acid-rich protein

aS-Sub Synonymous substitutions.

bN-Sub Nonsynonymous substitutions.

cKa Nonsynonymous substitution rate.

dKs Synonymous substitution rate.

Distribution of Ka and Ks. Sequences with Ka/Ks > 1 fall below the solid line; while sequences with Ka/Ks between 0.5 -1 fall between the solid and dashed lines. List of gene pairs with Ka/Ks larger than one aS-Sub Synonymous substitutions. bN-Sub Nonsynonymous substitutions. cKa Nonsynonymous substitution rate. dKs Synonymous substitution rate.

Differential expression of the orthologous genes between MEAM1 and MED guts

To compare the level of gene expression between MEAM1 and MED gut orthologous genes, the number of reads mapped to each orthologous gene pairs was extracted. Among the 3,910 orthologous genes between MEAM1 and MED guts, a total of 64 genes were down-regulated and 304 were up-regulated in MEAM1 (log2 > 1, FDR value < 1.0E−3) (Additional file 9). In order to validate the data from bioinformatic analysis, the gene expression between 20 MEAM1 and MED gut genes were examined using qPCR (10 MEAM1 up-regulated genes and 10 MEAM1 down-regulated genes). Out of the 20 selected genes, 19 showed a concordant direction of change between transcriptome analysis and qPCR quantification results, which confirmed the reliability of our analyses (Additional file 10). Of the differentially expressed genes, the detected fold changes (log2 ratio) of gene expression ranged from minimum −4.62 to maximum 6.60 (Figure 4). The majority of orthologous genes were up- or down-regulated between 1.0 and 2.0 fold, whereas 41 orthologous pairs were with a value of log2 > 2, and 17 orthologous pairs with a value of log2 < −2 (Figure 4). Among all these 58 differentially expressed gene pairs, 4 detoxification-related genes (3 P450s and 1 GST) were significantly up-regulated in MED gut compared with MEAM1 gut, and none of them was up-regulated in MEAM1, which may be responsible for the difference in insecticide resistance between MEAM1 and MED.
Figure 4

Differential expressions of the orthologous genes between MEAM1 and MED guts. (A) The numbers of differentially expressed genes between MEAM1 and MED guts. (B) The log2 ratio distribution of differentially expressed genes between MEAM1 and MED gut transcriptomes.

Differential expressions of the orthologous genes between MEAM1 and MED guts. (A) The numbers of differentially expressed genes between MEAM1 and MED guts. (B) The log2 ratio distribution of differentially expressed genes between MEAM1 and MED gut transcriptomes.

Conclusions

In this study, we sequenced the transcriptomes of MEAM1 and MED guts, and obtained 33,412 MEAM1 and 27,443 MED gut transcripts. Among these genes, 12,879 MEAM1 and 11,246 MED transcripts had significant nr BLAST hits. The main function of whitefly gut was revealed by the GO and KEGG analysis of MEAM1 and MED transcriptomes. Then, 3,910 orthologous genes pairs between MED and MEAM1 guts were identified. Gene divergence analysis showed that 15 genes were under strong positive selection and many of them are related to energy and xenobiotic metabolism. In addition, some of the drug metabolism and detoxification related genes such as P450s, GSTs were expressed at a higher level in MED than in MEAM1. The higher expression and divergence of these genes may contribute to the high detoxification ability and insecticide resistance of MED. To our knowledge, this is the first substantial sequencing and comparison of gut transcriptomes in whiteflies. These analyses provide a valuable resource of molecular information for future investigations of the functions of whitefly gut genes.

Methods

Whitefly rearing, sample collection and RNA isolation

Cotton Gossypium hirsutum L. (Malvaceae) cv. Zhe-Mian 1793 was used as the host plant for rearing MEAM1 and MED whiteflies. Whiteflies were maintained in climate chambers at 27 ± 1 °C, a photoperiod of 14 h/10 h and 70 ± 10% relative humidity. Every 3–5 generations, the purity of the cultures was monitored using the random amplified polymorphic DNA polymerase chain reaction technique with the primer H16 (5’-TCTCAGCTGG-3’), which has been used widely to distinguish different species in the B. tabaci complex [53], and further confirmed by mtCOI sequencing of a few individuals. The cultures of the two species were raised from a pair of virgin adult whiteflies of MEAM1 or MED and maintained on cotton for 3 generations before they were taken for experiments. For sample collection and RNA isolation, about 200 guts were dissected from the MEAM1 and MED adult whiteflies in PBS respectively. Then total gut RNA was isolated using the Absolutely RNA Nanoprep kit (Agilent, USA) according to the manufacturer’s manual with slight modifications [38].

Sequencing library construction

The gut cDNA was prepared by using a SMARTer™ PCR cDNA Synthesis Kit (Clontech, USA) and an Advantage 2 PCR Kit (Clontech, USA) as described previously [38]. Briefly, the RNA sample was reverse transcribed to first-strand cDNA. Then, the first-strand cDNA product was used for PCR amplification. To determine the optimal number of PCR cycles, 5 μl of PCR products at 15, 18, 21, 24, 27 and 30 cycles were checked by agarose gel electrophoresis. According to the analysis, 2 μl of first-strand cDNA was amplified for 26 cycles. Then, the amplified cDNA was purified by QIAquick PCR purification kit (Qiagen, Germany). At last, library for transcriptome sequencing was prepared using Illumina kit following the manufacturer’s recommendations.

Illumina sequencing and transcript annotation

The MEAM1 and MED gut libraries were sequenced for both ends using Illumina HiSeq 2000 platform at Beijing Genomics Institute (Shenzhen, China). The total sequencing amount was 3G for each library. After removing the low quality reads, de novo transcriptome assembly was carried out with the Trinity program [39]. There are three independent modules in the Trinity program, known as “Inchworm”, “Chrysalis” and “Butterfly”. The module “Inchworm” assembled the RNA-Seq data into unique transcripts which we called Inchworm contigs. “Chrysalis” clustered the Inchworm contigs and constructed complete de Bruijn graphs for each cluster and then partitioned the full read set among these disjoint graphs. “Butterfly” processed the individual graphs in parallel, tracing the paths based on reads and pair-end information, ultimately reporting full-length transcripts for alternatively spliced isoforms. After Trinity assembly, TGICL with parameter setting “-l 40 -c 10” was used to cluster and remove redundant transcripts. The remaining sequences after TGICL clustering were defined as tentative transcripts. The transcripts can be divided into two classes. One is cluster, which include several transcripts with more than 70% similarity among them and the prefix is CL. The other class is singletons with the prefix unigene. Next, Blastx search (E-value < 1E−5) against the NCBI nr, Swiss-Prot and KEGG databases was performed and the best aligning results were used to decide the direction of transcripts. If results of different databases conflict with each other, a priority order of nr, SwissProt and KEGG were followed to decide the direction of transcripts. The data sets are available at the NCBI Short Read Archive (SRA) with the accession number: SRR835757 (MEAM1 gut) and SRR835756 (MED gut). The assembled sequences have been deposited in the NCBI's TSA database: GAPP00000000 (MEAM1 gut) and GAPQ00000000 (MED gut).

Identification of statistically enriched pathways

KEGG enrichment analysis is an effective way to identify the enriched pathways in a specific tissue using the whole body transcriptome as a background [54]. The significantly enriched pathways in the gut were identified by a hypergeometric test with the calculating formula: . In this formula, N and n represent the total numbers of whole body and gut transcriptome genes with KEGG annotations, while M, m are the numbers of whole body and gut transcriptome genes annotated to a certain KEGG pathway. The pathways with a p-value less than 5.0E−3 were determined as enriched. In our analyses, human diseases associated pathways were excluded.

Identification of gut specific expressed genes

The OrthoMCL program was used to detect MEAM1 and MED gut specific genes compared to the whole body transcriptomes of MEAM1 and MED [48]. Complete lists (in FASTA format) of all predicted proteins in gut and whole body transcriptomes were used as templates. The OrthoMCL program began with all-vs-all BLASTP of the complete protein set [55]. Putative orthologous relationships were identified between gut and body by reciprocal best similarity pairs. The OrthoMCL program classified all putative orthologous into orthologous groups. Thus, gut genes that could not be categorized into any of the orthologous groups were regarded as gut specifically expressed genes.

Orthologous genes between MEAM1 and MED gut transcriptomes

The orthologous genes between MEAM1 gut and MED gut transcriptomes were identified by MegaBLAST [37, 50]. Only pairs of sequences that were best hit (E-value < 1E−5) to each other and longer than 200 bp were retained as putative orthologs. To prevent potential orthologous paralogs, all putative orthologs obtained in the previous step were further filtered and only pairs of sequences unambiguously mapped to the same protein in Swissprot database with an E-value < 1E−5 were selected [34]. Transcripts were firstly aligned by Blastx (E-value < 1.0E−5) to nr, Swiss-Prot and KEGG databases. Proteins with the highest rank in Blast results were taken to identify the CDS of transcripts. CDS with unexpected stop codon in the Blast hit region and shorter than 75 bp were removed. The 5’UTR were defined based on the position of start codon and 3’UTR regions were defined based on the position of stop codon and predicted CDS.

Sequence divergence analysis and estimation of substitution rates

The 5’UTR, CDS and 3’UTR regions were separately extracted from each pair of orthologs. To ensure the sequence comparison was performed only at the homologous regions of each gene pair, CDS and UTRs regions were aligned to each other separately and checked manually for errors. Sequence divergences between the homologous regions of each gene pair were calculated by dividing the number of substitutions with the number of base pairs in the comparison. The divergence was determined for both nd and 4d sites. In addition, Ka and Ks were estimated using the KaKs Calculator (YN method) [56, 57].

Differential expression of orthologous genes

To compare the level of gene expression between MEAM1 and MED guts, the number of reads mapped to each orthologous gene pairs was extracted. Since read mapping is sensitive to the size of the target reference sequence and sequencing amount, the raw read counts were adjusted by the total number of reads mapped and the length of the gene by calculating Reads Per Kilo-base per Million mapped reads (RPKM). The fold change of the gene expression level of ortholog gene pairs in MEAM1 and MED transcriptomes were calculated with log 2 (RPKM of the MEAM1 gut gene/RPKM of the MED gut gene). After that, significance of gene expression differences was tested using the algorithm established by Audic et al.[58]. At last the Benjamini Hochberg procedure was used for multiple testing correction and estimating the False Discovery Rate (FDR) [59].

Real-time quantitative PCR analysis

To validate gut specific expressed genes, total RNA was extracted from gut and the rest of the body (excluding gut) respectively. Then, all samples were used for first-strand cDNA synthesis with a PrimeScript RT reagent kit (Takara, Japan). Amplification of those cDNA samples was carried out in Bio-Rad CFX96TM Real-Time System (Bio-Rad, USA) using SYBR Premix Ex Taq TM II (Takara, Japan). The relative expression levels of each gene in gut and body were normalized using the threshold cycle (Ct) values that were obtained from reactions run on the same plate. TATA Box Binding Protein-Associated Factor (TAF) was chosen as the endogenous reference gene in qPCR analysis with the 2-△△Ct method [60, 61]. To confirm the results of RPKM analyses of orthologous gene expression between MEAM1 and MED gut, the levels of expression of 20 selected genes (10 MEAM1 up-regulated genes and 10 MEAM1 down-regulated genes) were measured. The primers of these genes were designed based on the perfectly matched orthologous region (Additional file 10). Additional file 1: Top BLAST hits of MEAM1 gut transcripts. BLAST results against the NCBI nr database for all the transcripts with a cut-off E-value <1.0E−5 are shown. (XLSX 843 KB) Additional file 2: Top BLAST hits of MED gut transcripts. BLAST results against the NCBI nr database for all the transcripts with a cut-off E-value <1.0E−5 are shown. (XLSX 745 KB) Additional file 3: Histogram presentation of GO classification of genes from the MEAM1 and MED gut transcriptomes. The results are summarized in three main categories “Biological process”, “Cellular component” and “Molecular function”. The right y-axis indicates the number of genes in a category and the left y-axis indicates the percentage of a specific category of genes in that main category. (PDF 2 MB) Additional file 4: MEAM1 guts enriched KEGG pathways. The MEAM1 gut enriched pathways (level 2) was identified by a hypergeometric test with MEAM1 whole body transcriptome as the background. Pathways with E-value <5.0E−3 were regarded as enriched. (XLSX 10 KB) Additional file 5: Identification and analysis of the orthologous genes between the gut transcriptomes of MEAM1 and MED. The orthologous genes were identified by bidirectional best hit method using MegaBLAST. All putative orthologs were further filtered against the Swissprot database. The putative orthologs that hit to different genes in the Swissprot database were removed. The CDS of the orthologous genes were determined by BLASTx against the Swissprot database with a threshold E-value of 1.0E−5. After removing the UTR regions, sequences shorter than 75 bp and with unexpected codons in the CDS were filtered. (PDF 314 KB) Additional file 6: List of the orthologous gene pairs between MEAM1 and MED gut transcripts. The length of orthologous region, identity and Nr annotations are shown. (XLSX 443 KB) Additional file 7: Verification of the sequence divergence between MEAM and MED. Five pairs of orthologous genes that are different in sequence between MEAM1 and MED were cloned and sequenced. The Transcript ID and primer sequences are listed in the table. (XLSX 9 KB) Additional file 8: Ka and Ks of each orthologous gene pairs. S-Substitutions: synonymous substitutions; N-Substitutions: nonsynonymous substitutions; Ka: nonsynonymous substitution rate; Ks: synonymous substitution rate. Nr ID and Nr annotation are also shown. (XLSX 154 KB) Additional file 9: Differentially expressed orthologous genes between MEAM1 and MED guts. The differentially expressed orthologous genes (overlapping region ≥ 200 bp, FDR-value Additional file 10: Real time quantitative PCR (qPCR) analysis. Twenty orthologous genes were selected for validation of expression level between MEAM1 and MED gut using qPCR. In this table, we listed the fold change values revealed by the transcriptome analysis, the fold change obtained by RT-PCR, and the primers for the RT-PCR. (XLSX 12 KB)
  51 in total

1.  Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method.

Authors:  K J Livak; T D Schmittgen
Journal:  Methods       Date:  2001-12       Impact factor: 3.608

2.  The adaptation of insects to plant protease inhibitors.

Authors:  C Bolter; M A. Jongsma
Journal:  J Insect Physiol       Date:  1997-10       Impact factor: 2.354

Review 3.  Insect glutathione transferases and insecticide resistance.

Authors:  A A Enayati; H Ranson; J Hemingway
Journal:  Insect Mol Biol       Date:  2005-01       Impact factor: 3.585

4.  Change in the biotype composition of Bemisia tabaci in Shandong Province of China from 2005 to 2008.

Authors:  Dong Chu; Fang Hao Wan; You Jun Zhang; Judith K Brown
Journal:  Environ Entomol       Date:  2010-06       Impact factor: 2.377

Review 5.  Phloem-sap feeding by animals: problems and solutions.

Authors:  A E Douglas
Journal:  J Exp Bot       Date:  2006-01-31       Impact factor: 6.992

Review 6.  Bemisia tabaci: a statement of species status.

Authors:  Paul J De Barro; Shu-Sheng Liu; Laura M Boykin; Adam B Dinsdale
Journal:  Annu Rev Entomol       Date:  2011       Impact factor: 19.686

7.  Insect resistance to Bacillus thuringiensis: alterations in the indianmeal moth larval gut proteome.

Authors:  Mehmet Candas; Olga Loseva; Brenda Oppert; Pradeepa Kosaraju; Lee A Bulla
Journal:  Mol Cell Proteomics       Date:  2003-01       Impact factor: 5.911

8.  Biotypes B and Q of Bemisia tabaci and their relevance to neonicotinoid and pyriproxyfen resistance.

Authors:  A Rami Horowitz; Svetlana Kontsedalov; Vadim Khasdan; Isaac Ishaaya
Journal:  Arch Insect Biochem Physiol       Date:  2005-04       Impact factor: 1.698

9.  Transcriptomic and phylogenetic analysis of Culex pipiens quinquefasciatus for three detoxification gene families.

Authors:  Liangzhen Yan; Pengcheng Yang; Feng Jiang; Na Cui; Enbo Ma; Chuanling Qiao; Feng Cui
Journal:  BMC Genomics       Date:  2012-11-10       Impact factor: 3.969

10.  Intestinal transcriptomes of nematodes: comparison of the parasites Ascaris suum and Haemonchus contortus with the free-living Caenorhabditis elegans.

Authors:  Yong Yin; John Martin; Sahar Abubucker; Alan L Scott; James P McCarter; Richard K Wilson; Douglas P Jasmer; Makedonka Mitreva
Journal:  PLoS Negl Trop Dis       Date:  2008-08-06
View more
  17 in total

1.  Glucosinolate Desulfation by the Phloem-Feeding Insect Bemisia tabaci.

Authors:  Osnat Malka; Anton Shekhov; Michael Reichelt; Jonathan Gershenzon; Daniel Giddings Vassão; Shai Morin
Journal:  J Chem Ecol       Date:  2016-03-10       Impact factor: 2.626

2.  Variable vision in variable environments: the visual system of an invasive cichlid (Cichla monoculus) in Lake Gatun, Panama.

Authors:  Daniel Escobar-Camacho; Michele E R Pierotti; Viktoria Ferenc; Diana M T Sharpe; Erica Ramos; Cesar Martins; Karen L Carleton
Journal:  J Exp Biol       Date:  2019-03-18       Impact factor: 3.312

3.  De novo assembly and characterization of central nervous system transcriptome reveals neurotransmitter signaling systems in the rice striped stem borer, Chilo suppressalis.

Authors:  Gang Xu; Shun-Fan Wu; Ya-Su Wu; Gui-Xiang Gu; Qi Fang; Gong-Yin Ye
Journal:  BMC Genomics       Date:  2015-07-15       Impact factor: 3.969

4.  Transcriptome-based identification of ABC transporters in the western tarnished plant bug Lygus hesperus.

Authors:  J Joe Hull; Kendrick Chaney; Scott M Geib; Jeffrey A Fabrick; Colin S Brent; Douglas Walsh; Laura Corley Lavine
Journal:  PLoS One       Date:  2014-11-17       Impact factor: 3.240

5.  The draft genome of whitefly Bemisia tabaci MEAM1, a global crop pest, provides novel insights into virus transmission, host adaptation, and insecticide resistance.

Authors:  Wenbo Chen; Daniel K Hasegawa; Navneet Kaur; Adi Kliot; Patricia Valle Pinheiro; Junbo Luan; Marcus C Stensmyr; Yi Zheng; Wenli Liu; Honghe Sun; Yimin Xu; Yuan Luo; Angela Kruse; Xiaowei Yang; Svetlana Kontsedalov; Galina Lebedev; Tonja W Fisher; David R Nelson; Wayne B Hunter; Judith K Brown; Georg Jander; Michelle Cilia; Angela E Douglas; Murad Ghanim; Alvin M Simmons; William M Wintermantel; Kai-Shu Ling; Zhangjun Fei
Journal:  BMC Biol       Date:  2016-12-14       Impact factor: 7.431

6.  A peptidoglycan recognition protein acts in whitefly (Bemisia tabaci) immunity and involves in Begomovirus acquisition.

Authors:  Zhi-Zhi Wang; Min Shi; Yi-Cun Huang; Xiao-Wei Wang; David Stanley; Xue-Xin Chen
Journal:  Sci Rep       Date:  2016-11-28       Impact factor: 4.379

7.  Genome sequencing of the sweetpotato whitefly Bemisia tabaci MED/Q.

Authors:  Wen Xie; Chunhai Chen; Zezhong Yang; Litao Guo; Xin Yang; Dan Wang; Ming Chen; Jinqun Huang; Yanan Wen; Yang Zeng; Yating Liu; Jixing Xia; Lixia Tian; Hongying Cui; Qingjun Wu; Shaoli Wang; Baoyun Xu; Xianchun Li; Xinqiu Tan; Murad Ghanim; Baoli Qiu; Huipeng Pan; Dong Chu; Helene Delatte; M N Maruthi; Feng Ge; Xueping Zhou; Xiaowei Wang; Fanghao Wan; Yuzhou Du; Chen Luo; Fengming Yan; Evan L Preisser; Xiaoguo Jiao; Brad S Coates; Jinyang Zhao; Qiang Gao; Jinquan Xia; Ye Yin; Yong Liu; Judith K Brown; Xuguo Joe Zhou; Youjun Zhang
Journal:  Gigascience       Date:  2017-05-01       Impact factor: 6.524

8.  Glucosylation prevents plant defense activation in phloem-feeding insects.

Authors:  Osnat Malka; Michael L A E Easson; Christian Paetz; Monika Götz; Michael Reichelt; Beate Stein; Katrin Luck; Aleksa Stanišić; Ksenia Juravel; Diego Santos-Garcia; Lilach L Mondaca; Simon Springate; John Colvin; Stephan Winter; Jonathan Gershenzon; Shai Morin; Daniel G Vassão
Journal:  Nat Chem Biol       Date:  2020-09-28       Impact factor: 15.040

9.  Identification of a Sulfatase that Detoxifies Glucosinolates in the Phloem-Feeding Insect Bemisia tabaci and Prefers Indolic Glucosinolates.

Authors:  Abinaya Manivannan; Bhawana Israni; Katrin Luck; Monika Götz; Elena Seibel; Michael L A E Easson; Roy Kirsch; Michael Reichelt; Beate Stein; Stephan Winter; Jonathan Gershenzon; Daniel Giddings Vassão
Journal:  Front Plant Sci       Date:  2021-06-04       Impact factor: 5.753

10.  Transcription analysis of neonicotinoid resistance in Mediterranean (MED) populations of B. tabaci reveal novel cytochrome P450s, but no nAChR mutations associated with the phenotype.

Authors:  Aris Ilias; Jacques Lagnel; Despoina E Kapantaidaki; Emmanouil Roditakis; Costas S Tsigenopoulos; John Vontas; Anastasia Tsagkarakou
Journal:  BMC Genomics       Date:  2015-11-14       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.