Literature DB >> 23658687

Transcriptome of the Lymantria dispar (gypsy moth) larval midgut in response to infection by Bacillus thuringiensis.

Michael E Sparks1, Michael B Blackburn, Daniel Kuhar, Dawn E Gundersen-Rindal.   

Abstract

Transcriptomic profiles of the serious lepidopteran insect pest Lymantria dispar (gypsy moth) were characterized in the larval midgut in response to infection by Bacillus thuringiensis kurstaki, a biopesticide commonly used for its control. RNA-Seq approaches were used to define a set of 49,613 assembled transcript sequences, of which 838, 1,248 and 3,305 were respectively partitioned into high-, mid- and low-quality tiers on the basis of homology information. Digital gene expression profiles suggested genes differentially expressed at 24 hours post infection, and qRT-PCR analyses were performed for verification. The differentially expressed genes primarily associated with digestive function, including α-amylase, lipase and carboxypeptidase; immune response, including C-type lectin 4; developmental genes such as arylphorin; as well as a variety of binding proteins: cellular retinoic acid binding protein (lipid-binding), insulin-related peptide binding protein (protein-binding) and ovary C/EBPg transcription factor (nucleic acid-binding). This is the first study conducted to specifically investigate gypsy moth response to a bacterial infection challenge using large-scale sequencing technologies, and the results highlight important genes that could be involved in biopesticide resistance development or could serve as targets for biologically-based control mechanisms of this insect pest.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23658687      PMCID: PMC3641027          DOI: 10.1371/journal.pone.0061190

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Lymantria dispar (Lepidoptera: Erebidae), the gypsy moth, is the most serious insect pest of forest and shade trees in the Northeastern United States. The gypsy moth is a polyphagous insect; North American larval populations feed on over 300 different shrub and tree species [1], including forest, shade, ornamental and fruit trees and shrubs. Gypsy moth larvae have been responsible for defoliating an average of 3.0 million forested acres per year over the past 25 years, as well as trees and shrubs in residential areas, causing significant economic impacts. Great effort has been made to slow the spread, yet attempts to fully contain this insect pest have had variable success and its range has continued to expand into the Midwest and South. Several control measures have been implemented in the U.S. to reduce gypsy moth spread, including stringent quarantine practices, augmentative release of natural enemies (mainly parasitoids), application of chemical pesticides (diflubenzuron =  Dimilin®), use of a chemical pheromone for mating disruption (Disparlure), or ground or aerial application of formulations containing either a specific nucleopolyhedrovirus (Gypchek®) or the gram-positive soil bacterium Bacillus thuringiensis (Bt). These microbe-based bioinsecticides have been used with variable success and bioinsecticide resistance has evolved rapidly in gypsy moth larval populations; the reasons for this are currently unknown. Limited genetic information exists for L. dispar. The only large publicly available data set for gypsy moth-associated genes was recently generated by characterizing the transcriptome from the L. dispar-derived cell line IPLB-Ld652Y [2]; however, this L. dispar cell line does not completely reflect the gene and gene systems expressed by whole insect larvae. Comprehensive genetic evaluations are needed to reveal genetic sensitivities of the pest, improve bioinsecticide selection, recognize genes that are determinants of disease and resistance development, and facilitate targeted pest management. Strains of Bt produce crystalline (Cry) proteins that possess insecticidal activity and act as a gut poison but are harmless to vertebrates and plants [3]. B. thuringiensis subsp. kurstaki, which produces three lepidopteran-insecticidal Cry1A proteins, is the strain most commonly employed as a bioinsecticide against lepidopteran pest larvae. Once ingested, Cry proteins are proteolytically activated by enzymatic cleavage in the alkaline larval midgut, where they bind to cadherin receptors and undergo insertion into the membrane of epithelial cells [4]. The bound toxin is changed in conformation and then able to bind to additional receptors [5]–[7]. The activated toxins ultimately form pores in epithelial cell membranes causing osmotic lysis and sloughing of damaged cells from the basement membrane of the midgut epithelium [8], [9]. The midgut becomes paralyzed, resulting in cessation of feeding activity and leading to death [10]. A different mechanism of Cry activity was demonstrated in Trichoplusia ni where Cry protein binding caused induction of the adenyl cyclase/PKA signaling pathway, leading to significant midgut cytological changes and lysis [11]. Lepidopteran midgut tissues are the site for digestion, secretion, absorption, enzymatic activities, toxin binding and activation, and the center of activity for biopesticide detoxification and resistance development. Several distinct midgut genes have been identified as potentially associated with lepidopteran resistance development to Bt. Candas et al. (2003; [12]) identified changes in numerous midgut epithelial cell proteins in a Plodia interpunctella colony exhibiting resistance to Bt, including decreased chymotrypsin, decreased cell adhesion protein peroxinectin, and increased prophenoloxidase. Soberón et al. (2007; [13]) demonstrated that susceptibility to Bt toxin Cry1Ab was reduced by cadherin gene silencing with RNA interference in Manduca sexta, and suggested cadherin increases Bt toxicity by facilitating toxin oligomerization. A global transcriptome-based analysis of Bt-resistance development in Heliothis virescens by Zhu et al. (2011; [14]) revealed genes potentially related to Bt activation and resistance (proteinases, cadherins, aminopeptidases, and alkaline phosphatases) as well as detoxification (cytochrome P450 oxidases, glutathione S-transferases, esterases, sodium channels, and cytochrome oxidases). In the current study, a global genetic and transcriptome-based analysis of the gypsy moth midgut in response to infection by B. thuringiensis kurstaki was examined using bioinformatics techniques coupled with high throughput next-generation DNA sequencing. An RNA-Seq transcriptome dataset was obtained from synchronized late third instar L. dispar larvae midgut tissues, a set of high-quality gypsy moth gene structures was delineated and functionally annotated, and the transcriptome-level response of gypsy moth midgut tissue to infection by Bt was analyzed. Highly differentially expressed transcripts were further validated by quantitative real-time PCR analyses to gain insight into the global larval gypsy moth response to Bt infection and the larval genes that may be involved in resistance mechanisms or that could serve as targets for improved biopesticides.

Results and Discussion

This study establishes a large collection of high-quality larval L. dispar transcript sequences that should be of considerable value to the lepidopteran community in general and to gypsy moth researchers in particular. The sequence analysis protocol described below in Materials and Methods resulted in 838 distinct L. dispar gold-tier PUTs associated with 732 distinct NR proteins. Similarly, 1,248 silver-tier PUTs associating with 989 distinct NR proteins, and 3,305 bronze-tier PUTs associating with 2,770 distinct NR proteins, were identified. (One or more PUTs may hit to a single NR protein because, for instance, certain subsets of the PUTs may represent transcript isoforms of the same gene.) Eight hundred (≈92.4%) of the 838 gold-tier gene protein translations exhibited a hit in Pfam-A, yielding 1,059 Pfam terms (565 of which were unique) and 1,470 GO terms (408 of which were unique). 11,094 sequences from the complete PUT collection (≈22.4%) exhibited a longest ORF significantly similar to a Pfam protein family per the aforementioned criteria, of which 3,069 were unique. The 25 most abundant Pfam families encountered in the complete PUT dataset is shown in Table 1; those for the high-quality/gold-tier genes are presented in Table S1. 294 GO terms were recovered from the gold-tier gene set, and 1,675 from the comprehensive set of PUTs. (Pfam families can have zero, one, or more associated GO terms). Figure 1 presents the ten most abundant GO terms identified for the high-quality and complete PUT datasets, partitioned into each of the three main GO sub-groupings: Biological Process, Cellular Component and Molecular Function. Figure 2 presents similar information, but using the more abstract GO-Slim terminology. Only GO terms classified under Molecular Function could be mapped to a KEGG entry – for the gold-tier gene set, 7/148 (≈4.7%) of its unique GO terms had an associated KEGG entry, while for the complete PUT collection, this value was 43/492 (≈8.7%). The KEGG entities recovered are exhaustively listed in Table S2, for both gene sets.
Table 1

The 25 most abundant Pfam families encountered in the complete gypsy moth PUT collection.

Pfam Hit Pfam Description Counts
PF00400.25WD domain, G-beta repeat174
PF00069.18Protein kinase domain170
PF00096.19Zinc finger, C2H2 type149
PF00135.21Carboxylesterase138
PF07690.9Major Facilitator Superfamily135
PF00089.19Trypsin119
PF00076.15RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain)106
PF00067.15Cytochrome P450100
PF07679.9Immunoglobulin I-set domain91
PF00083.17Sugar (and other) transporter84
PF00560.26Leucine Rich Repeat83
PF01607.17Chitin binding Peritrophin-A domain75
PF00106.18short chain dehydrogenase74
PF00078.20Reverse transcriptase (RNA-dependent DNA polymerase)65
PF00023.23Ankyrin repeat63
PF00501.21AMP-binding enzyme58
PF00270.22DEAD/DEAH box helicase56
PF00041.14Fibronectin type III domain52
PF00071.15Ras family51
PF00435.14Spectrin repeat50
PF00171.15Aldehyde dehydrogenase family46
PF00271.24Helicase conserved C-terminal domain43
PF07714.10Protein tyrosine kinase42
PF00201.11UDP-glucoronosyl and UDP-glucosyl transferase42
PF00005.20ABC transporter40
Figure 1

GO term abundances (full ontology).

Shown are pie charts presenting the relative abundance of the ten most frequently encountered gene ontology (GO) terms for both the 838 gold-tier PUT subset, as well as the complete 49,613 PUT set. GO terms contribute additional annotation information for inferred gypsy moth genes along three dimensions: biological process, cellular component and molecular function. The GO terms recovered were stratified according to these ontology domains.

Figure 2

GO-Slim term abundances.

Shown are pie charts presenting the relative abundance of the five most frequently encountered GO-Slim terms for both the 838 gold-tier PUT subset, as well as the complete 49,613 PUT set. GO-Slim terms provide a less granular representation of the gene annotation information conveyed by terms from the full GO ontology, thereby offering a higher-level perspective on the functional capacity contained in the Lymantria dispar gene set relative to fully-qualified terms. GO-Slim terms are stratified with respect to the ontology's three primary domains: biological process, cellular component and molecular function.

GO term abundances (full ontology).

Shown are pie charts presenting the relative abundance of the ten most frequently encountered gene ontology (GO) terms for both the 838 gold-tier PUT subset, as well as the complete 49,613 PUT set. GO terms contribute additional annotation information for inferred gypsy moth genes along three dimensions: biological process, cellular component and molecular function. The GO terms recovered were stratified according to these ontology domains.

GO-Slim term abundances.

Shown are pie charts presenting the relative abundance of the five most frequently encountered GO-Slim terms for both the 838 gold-tier PUT subset, as well as the complete 49,613 PUT set. GO-Slim terms provide a less granular representation of the gene annotation information conveyed by terms from the full GO ontology, thereby offering a higher-level perspective on the functional capacity contained in the Lymantria dispar gene set relative to fully-qualified terms. GO-Slim terms are stratified with respect to the ontology's three primary domains: biological process, cellular component and molecular function. Our results are reasonably similar to those of related studies focusing on lepidopteran midgut gene content – including studies on Manduca sexta [15] and Chilo suppressalis [16] – implying that a core set of functional midgut genes operates across the Lepidoptera. For example, on the basis of NCBI NR comparisons, 4 serpin, 14 lectin, 16 lysozyme, 29 cytochrome P450, 50 glutathione S-transferase, 26 heat shock protein, 19 aminopeptidase, and 14 chymotrypsin-like protease PUTs were encountered in the comprehensive PUT collection, which corresponds well with observations made in these earlier studies. Further, cytochrome P450s, gluthathione S-transferases, and several heat shock proteins have been linked to detoxification and stress adaptation and response in other Lepidopteran systems [17]. Non-tissue-specific surveys of gene content in the Lepidoptera have also been performed for Plutella xylostella [18] and Spodoptera exigua [19], which also support the notion of a common gene set across this order. Interestingly, two transcript fragments were identified in the bronze-tier PUT subset which appeared to encode a gypsy moth β-fructofuranosidase gene having high similarity to a gene reported in M. sexta [15], thereby expanding the incidence of this horizontally-acquired bacterial gene among the Lepidoptera to the Lymantria genus. In addition to qualitatively defining a set of reliable gypsy moth gene structures, the RNA-Seq data were quantitatively evaluated to suggest which gold-tier genes were expressed differentially. Table 2 displays the 15 gypsy moth genes exhibiting the greatest differences in RNA-Seq-inferred expression levels upon Bt infection, and an exhaustive listing is available in Table S3. Seventy-eight genes exhibited a greater than five-fold change in expression level. A subset of 20 L. dispar genes was selected for qRT-PCR validation on the basis of these results, as well as on intuitions concerning the possible biological relevancy of other genes not evaluated by digital expression. The resulting qRT-PCR expression profiles for these selected genes are presented in Figure 3. In general, at 24 hours post infection, transcription and binding-related genes exhibited elevated expression levels, while enzymatic-/digestion-related gene expression was strongly inhibited, as was observed with α-amylase, (pancreatic) lipase, carboxypeptidase and chymotrypsin-like protease. The observation of sharply decreased chymotrypsin-like protease expression corresponds well with proteomic-based data reported for Plodia interpunctella [12].
Table 2

The 15 gypsy moth genes having the largest digital gene expression level perturbations following Bacillus thuringiensis infection.

FoldDiff Direction Control Bt midgut NRgene
166.94down4.00E-042.40E-06gb|ABU98614.1| alpha-amylase [Helicoverpa armigera]
103.98down8.31E-047.99E-06gb|ABR88238.1| chymotrypsin-like protease C8 [Heliothis virescens]
98.38down3.93E-053.99E-07ref|NP_001165397.1| C-type lectin 4 [Bombyx mori]
91.88down2.20E-042.40E-06gb|ACB54943.1| lipase [Helicoverpa armigera]
69.57up3.39E-072.36E-05ref|XP_396381.2| PREDICTED: similar to tubulin-specific chaperone a [Apis mellifera]
65.62down4.98E-047.59E-06gb|ACU00133.1| chymotrypsin-like protein precursor [Spodoptera litura]
52.80down8.44E-051.60E-06ref|XP_001662277.1| hypothetical protein AaeL_AAEL012148 [Aedes aegypti]
49.52up6.78E-073.36E-05ref|NP_001037102.1| ovary C/EBPg transcription factor [Bombyx mori]
44.85down1.07E-032.40E-05gb|ACD37364.1| pancreatic lipase 2 [Mamestra configurata]
42.66down6.30E-041.48E-05gb|ACH86113.1| molt-regulating transcription factor HaHR3 [Helicoverpa armigera]
39.63down1.74E-044.39E-06gb|ACX53715.1| aldo-keto reductase [Heliothis virescens]
38.12up1.02E-063.87E-05ref|XP_001604022.1| PREDICTED: hypothetical protein [Nasonia vitripennis]
25.74up2.03E-065.23E-05gb|AAF61949.1|AF236641_1 insulin-related peptide binding protein [Spodoptera
24.91down7.96E-053.20E-06gb|AAS82583.1| midgut carboxypeptidase A1 [Trichoplusia ni]
21.22up1.02E-062.16E-05ref|NP_001040248.1| glycolipid transfer protein [Bombyx mori]

The FoldDiff column indicates the fold change in expression level; Direction whether the gene was up- or down-regulated; Control and Bt midgut provide sample-specific digital expression counts normalized to the total number of reads per sample; and NRgene describes the best homolog identified in the NR protein sequence database for the associated transcript.

Figure 3

qRT-PCR Results.

Quantitative real-time PCR analysis data for 20 select Lymantria dispar midgut genes are presented. Three technical replicates were performed for each of three biological replicates. The height of each box represents the mean average of sample-specific 2-ΔCt values, while associated error bars denote the standard error of the mean. Fold changes are shown in parentheses.

qRT-PCR Results.

Quantitative real-time PCR analysis data for 20 select Lymantria dispar midgut genes are presented. Three technical replicates were performed for each of three biological replicates. The height of each box represents the mean average of sample-specific 2-ΔCt values, while associated error bars denote the standard error of the mean. Fold changes are shown in parentheses. The FoldDiff column indicates the fold change in expression level; Direction whether the gene was up- or down-regulated; Control and Bt midgut provide sample-specific digital expression counts normalized to the total number of reads per sample; and NRgene describes the best homolog identified in the NR protein sequence database for the associated transcript. Interestingly, the L. dispar C-type lectin 4 gene was also highly down-regulated in midgut tissue (-13.2 fold). This gene belongs to a family of calcium-dependent glycoprotein receptors associated with lepidopteran innate immunity [20]; specifically, they bind various cell wall components of invading microorganisms, facilitating nodule formation by hemocytes and subsequent elimination of alien microbial agents via the hemocoel. It would seem that a bacterial infection should promote enhanced, not repressed, levels of immune-related gene products like the gypsy moth C-type lectin 4 gene. The observed down-regulation may indicate that Bt, either directly or indirectly, enhances its entomopathogenicity by means of repressing such immune-related genes as the lectins. Another important lepidopteran midgut-expressed gene is arylphorin, a hexameric storage protein which has a demonstrated role in lepidopteran midgut stem cell proliferation and differentiation [21], was highly down-regulated in L. dispar midgut tissue after Bt infection (-11.1 fold). This was somewhat unexpected given the roles of arylphorin in the midgut. Forcada et al. (1999; [22]) and Martínez-Ramírez et al. (1999; [23]) proposed a Bt resistance mechanism for Heliothis virescens based on an increased capacity to repair damage to the midgut. Loeb et al. (2001; [24]) demonstrated in primary midgut cell cultures that transient exposure of midgut cells to Cry1Ac killed differentiated cells, but subsequently stimulated proliferation and differentiation of stem cells. High levels of arylphorin expression have also been implicated in the development of Bt resistance in Spodoptera exigua [25]. Monomeric subunits of arylphorin were shown to stimulate proliferation of M. sexta midgut stem cells in vitro [26]. The reduced expression of arylphorin in the L. dispar midgut at 24 hours post infection may suggest a generalized response of midgut cells to slough without (yet) stimulating new cells. Genes related to anti-bacterial or anti-pathogen response in L. dispar and other lepidopterans are of high interest and several were up-regulated, among them hemolin-interacting protein and a gloverin precursor, the latter of which was the most abundantly up-regulated gene surveyed in our qRT-PCR panel (+427 fold). The gloverin peptide was originally described from the giant silk moth Hyalophora gloveri [27] as anti-microbial and highly induced in midgut tissue following exposure to Bt. As with many antimicrobial peptides, the transcriptional regulation of gloverin is regulated by interaction with the spätzle-Toll receptor [28]. In Spodoptera exigua, RNAi knockdown of gloverin expression caused developmental retardation and increased susceptibility to Bt [29]. Gloverin expressed in Trichoplusia ni was shown to increase in response to infection with Autographa californica M nucleopolyhedrovirus, though it remains unclear whether this gene may also have anti-viral properties or its increased expression is coincidental [30]. Another interesting gene highly up-regulated in L. dispar midgut after Bt infection is an insulin-related peptide-binding protein (+37.8 fold). This is a putative endocrine factor that may be important in the lepidopteran insulin-signal pathway and immune response, and a similar protein from Bombyx mori (BmIBP2) was recently shown to be up-regulated on infection with cytoplasmic polyhedrosis virus (BmCPV) in midgut but not other tissues [31]. Additional endocrine-associated genes are of interest, including the ovary C/EBP transcription factor up-regulated in Bt-infected gypsy moth midgut (+4.7 fold). In mammalian skin cells these factors are known to regulate wound repair and EGF receptor signaling [32], which suggests a potential role in host defense. The FAR-like gene was also up-regulated in gypsy moth midgut (+3.8). Fatty acyl reductase (FAR) genes are important in the biosynthesis of insect pheromone as shown for the pheromone gland-specific fatty-acyl reductase of the lepidopteran adzuki bean borer, Ostrinia scapulalis [33]. FAR and FAR-like genes may heavily influence the pheromone composition in Lepitoptera and contribute to their pheromone specificity [34]. This gene has been shown to be up-regulated in B. mori at a later developmental stage and prior to eclosion, and could perhaps have functional flexibility, as described by Liénard and Löfstedt (2010; [35]), in promoting phenotypic transitions in the pheromone signaling pathway that is not yet active in the larval gypsy moth; perhaps this gene has functional plasticity which needs to be explored. Several differentially-regulated genes of unknown function may have significant roles in the lepidopteran response to Bt infection. For instance, a number of hypothetical proteins were up-regulated in the present study; one of these, presented in Fig. 3, exhibited limited similarity (53% identity) to a fungal protein involved in the transport of fructose-1,6-bisphosphatase from vesicles to vacuoles for degradation [36]. The observation of a concomitant increase in ubiquitin conjugating enzyme, a gene known to play a key role in protein degradation [37], suggests that Bt infection may elicit an increase in proteolytic activities. The gene sets identified in this study will be of utility to entomologists studying the functional capacity inherent in the L. dispar gene repertoire, and in teasing apart its molecular evolutionary relationships with other lepidopteran species. The quantitative results highlight a variety of larval-stage L. dispar genes that respond within 24 hours to infection by Bacillus thuringiensis. Bt-responsive midgut genes have been discussed in other studies of lepidopteran pests, such as the beet armyworm [25] and tobacco budworm [14], and our similar findings support the shared core set of responsive midgut genes operating across this order (e.g., lectin, cytochrome p450, amylase, chymotrypsin). Interestingly, the general pattern of down-regulation in metabolism-associated genes, and a mixed response among immune-related genes, corresponds very well to observations recently made of a Coleopteran species, Tenebrio molitor, following ingestion of Cry3Aa protoxin [38]. The responses observed for genes such as arylphorin and the FAR-like transcript provide opportunities for further exploring the mechanisms underlying Bt entomopathogenicity and its associated host response. Likewise, genes induced in the infected midgut by multiple classes of pathogens (e.g., insulin-related peptide binding protein and gloverin) represent key lepidopteran genes for further assessment. If they prove to be sufficiently species-specific, these genes could suggest possible targets for RNAi-mediated gene disruption to be used in ongoing biocontrol efforts slowing the spread of this destructive insect pest.

Materials and Methods

Insect Rearing and Dissection

L. dispar egg masses were obtained from the USDA APHIS rearing facility, Otis AFB, Massachusetts. Larvae were hatched and reared on high wheat germ artificial diet [39] in 180-ml plastic cups with paper lids under conditions of 24±1°C, RH 55–60%, L16:D8. All L. dispar larvae used for this study were reared from a single egg mass to reduce variability. Larvae were staged by rearing in groups of approximately 50 individuals in 180-ml plastic cups until head capsule formation was observed signifying entry into the larval moult to 3rd instar. Larvae which entered the third instar at the same time were then reared with 10-15 larvae per plastic cup for 48 h. The synchronous larvae were starved for 24 hrs then orally inoculated by feeding a 3×3 mm diet block containing 108 B. thuringiensis kurstaki spores, obtained from Thuricide®, per ml diet [40], or diet alone for control larvae.

RNA Extraction

Midgut tissues (containing anterior, middle, and posterior regions as well as the peritrophic matrix) from a total of 31 control uninfected and 28 Bt-infected larvae, respectively, were dissected from the larvae at a time point 24 hours post infection, rinsed thoroughly in 1xPBS to remove debris, and pooled. Midgut tissue RNAs were extracted immediately upon dissection and used to generate RNA-seq libraries. RNAs were stored frozen at −80°C. Additional control and Bt-infected midguts were stored in RNAlater™ (Ambion/Life Technologies, Carlsbad, CA) until RNAs were extracted to obtain biological replicates for qRT-PCR. Total RNA for RNA-seq libraries was extracted using the mirVana miRNA Isolation Kit (Ambion/Life Technologies) according to the manufacturer's protocol for total RNA isolation, with homogenization of the midgut tissue performed on a FastPrep 24 homogenizer (MP Biomedicals, Santa Ana, CA).

RNA-seq libraries

RNA quantity and quality was determined using an Agilent RNA 6000 Nano kit on a 2100 Bioanalyzer system (Agilent Technologies, Santa Clara, CA) according to the manufacturer's instructions. Samples for sequencing of mRNA were prepared using the mRNA-Seq Sample Prep Kit (Illumina, San Diego CA) as per the manufacturer's protocol. Library quantity and quality was assessed using an Agilent DNA 1000 kit on a 2100 Bioanalyzer, and these materials were then sequenced using an Illumina GAII instrument (Illumina).

RNA-Seq Data

94,477,033 72 bp reads were generated, available in the NCBI sequence read archive under accession no. SRA058966. Illumina quality scores were transformed to the Phred scoring scale [41], and read sets were cleaned using the FASTX-toolkit (http://hannonlab.cshl.edu/fastx_toolkit): Artifact reads were purged, and terminal spans of bases having Phred scores not more than 20 (corresponding to a 1/100 error rate) were clipped – resultant reads were required to be at least 36 bases long. At least 90% of bases in a read were required to exhibit a Phred score of 21 or higher. Of the remaining reads, all bases having quality scores of 20 or less were masked with the symbol ‘N’. 3,061,431,248 bases were retained from 6,802,346,376 originally sequenced (45% retention). These were aligned against a fruit fly rRNA gene locus – accession no. M21017, comprising 2S, 5.8S, 18S and 28S rRNA sequences[42] – using the Blat program with default parameters [43]. 7,122,506 reads containing 498,351,320 usable bases were retained for transcript assembly and digital gene expression characterization. Table 3 presents some descriptive statistics of these data.
Table 3

Impacts of quality filtering and rRNA depletion on resultant RNA-Seq dataset sizes.

Untreated Midgut Tissue Bt-infected Midgut TissueIndependent Study, Midgut Tissue
123Total45Total67Total
Raw DataBases905,299,9201,156,410,792777,226,0322,838,936,744953,698,2481,059,143,0402,012,841,288857,218,0321,093,350,3121,950,568,344
Reads12,573,61016,061,26110,794,80639,429,67713,245,80914,710,32027,956,12911,905,80615,185,42127,091,227
Quality ControlBases550,084,462368,559,364366,374,0291,285,017,855518,769,000458,765,055977,534,055433,011,464365,867,874798,879,338
Reads7,803,2035,455,3525,300,52418,559,0797,424,3246,625,47914,049,8036,263,6015,412,99211,676,593
rRNA DepletionBases106,726,80638,345,09361,164,946206,236,845106,625,97469,809,845176,435,81975,353,13540,325,521115,678,656
Reads1,507,602563,277880,9222,951,8011,507,789995,7102,503,4991,076,817590,3891,667,206

Sequence Analysis

To create a highly-reliable set of gypsy moth genes (“gold tier” – see below), all RNA-Seq samples were pooled and globally assembled using the Velvet/Oases short read assembler suite [44] with its hash length parameter set to 23. This generated a set of 49,613 putatively unique transcripts (PUTs). The assembly exhibited an N50 of 353 bp, with 41,352; 11,798 and 6,362 PUTs being not less than 100; 300 and 450 bp in length, respectively. Known Dipteran repetitive elements were purged using RepeatMasker [45], and the residual PUTs were aligned to the NCBI NR database using Blastx with default parameter settings [46]. Blastx results were parsed under varying levels of stringency to establish three distinct tiers of homology-inferred gene accuracy quality: gold, silver and bronze. The gold-tier gene set was defined per the following criteria. A Blastx hit had to consist of a single high-scoring segment pair (HSP), so as to minimize consideration of PUTs having potentially significant internal mis-assemblies; a PUT had to be at least 300 bases in length; the alignment's subject sequence (i.e., an NR protein) had to be at least 100 amino acid residues in length; at least 75% of aligned residues had to be positively similar; and the ratio of hit length to subject sequence length had to be at least 90%. Only the top-scoring hit per PUT was considered. End sequences from PUTs that were not incorporated into Blastx-derived alignments were trimmed and resultant CDSs were translated into high-quality gypsy moth protein sequences using the EMBOSS package's Transeq utility [47]. PUTs of the silver-tier gene class had to be at least 100 nucleotides in length and exhibit a hit length covering at least 75% of the NR subject sequence's length; PUTs of the bronze-tier class were required to be at least 100 nucleotides long and have a hit covering at least 30% of the NR protein's length. These represent mutually disjoint sets in that a gold-tier PUT is not multiply listed among those in the silver- or bronze-tier, and silver-tier PUTs are likewise not listed at the bronze level.

Functional annotation of genes

Gold-tier protein sequences were analyzed against Pfam-A – the manually curated subset of Pfam [48] – using hmmscan [49] with an E-value cutoff of 1E-2. The top hit for the PUT (provided any satisfied the hmmscan inclusion threshold criteria) was selected and used to annotate the gypsy moth protein. In addition, the complete set of PUTs was also assessed for its protein family composition profile: Each assembled transcript was translated in six frames using Transeq, and the longest ORF that resulted in this set was selected for querying Pfam-A. (Where two or more longest ORFs for a given PUT occurred, one was selected arbitrarily.) To provide additional functional information, Pfam families encountered in these datasets were then mapped to Gene Ontology (GO) terms using the January 10, 2011 version of the pfam2go table [50]. To provide a yet more abstract representation of the functional capacity implicit in the gypsy moth transcriptome, each (fine-grained) term's penultimate ancestor was retrieved by traversing the GO directed acyclic graph using the “is_a” relationships embedded in the ontology definition – this was repeated for each of the three main categories of GO terms. Penultimate ancestors, herein referred to as “GO-Slim terms,” only were considered because ultimate ancestors would resolve to one of the three main categorizations, and would be uninformative. Version 1.9 of the kegg2go table was obtained from the Gene Ontology web site (http://www.geneontology.org) and used to map fine-grained GO terms onto Reaction records from the KEGG Ligand database [51].

Sample-specific gene expression quantification

To identify any putatively up- or down-regulated genes among the gold-tier gene set described above, the unassembled read data were pooled into two groups: uninfected L. dispar control (2,951,801 reads) and B. thuringiensis- infected L. dispar (2,503,499; see Table 3). These were aligned to trimmed gold-tier PUT sequences using Blat. A read was considered as emanating from a particular PUT if and only if at least 95% of its length aligned with perfect sequence identity. A read was associated with at most one PUT, being that having the highest-quality Blat alignment – where multiple such best hits occurred, one among the competing PUTs was arbitrarily selected. Counts emanating from multiple PUTs associated with the same NR protein were accumulated to that protein. Digital expression counts were normalized relative to the total number of reads pooled within the sample.

Quantitative real-time PCR (qRT-PCR) primer design and validation of L. dispar larval midgut gene expression

qRT-PCR analysis was conducted to validate key midgut genes of interest, which were identified either by their exhibition of relatively large expression fold-changes from digital expression analyses, or by intuition about potential gene functions. PrimerPlex 2.61 (PREMIER Biosoft, Palo Alto, CA) was used to design primers for SYBR-Green experiments using PUT template sequences (see Table S4 for primers used). qRT-PCR was conducted using three biological samples: the first replicate utilized the RNA from which the RNA-Seq library was produced, as well as two additional biological replicates that were obtained in the same infection experiment and stored in RNALater. An ABI 7500 Real Time PCR System (Applied Biosystems, Carlsbad CA) was used. For each replicate, first strand cDNA was synthesized from 1–5 ug RNA using Superscript Reverse Transcriptase II (Invitrogen/Life Technologies, Carlsbad, CA). Each qPCR reaction consisted of 6.25 ul of Power SYBR Green PCR Master Mix (Applied Biosystems/Life Technologies, Carlsbad, CA), 50 ng of diluted cDNA and 1 uM of each primer in a total volume of 12.5 ul. Reactions were performed in triplicate to ensure consistent technical replication and run in 96-well plates under the following conditions: 50°C for 2 min, 95°C for 10 min, and 40 cycles of 95°C for 15 sec and 60°C for 1 min. Melting curves (60°C to 95°C) were derived for each reaction to ensure a single product. Relative gene expression was evaluated with DataAssist Software version 3.0 (Applied Biosystems/Life Technologies), using L. dispar 18 s rRNA and the elongation factor-Tu gene as endogenous controls for RNA load and gene expression in analyses. (DOCX) Click here for additional data file. (DOCX) Click here for additional data file. (XLSX) Click here for additional data file. (XLSX) Click here for additional data file.
  45 in total

1.  LIGAND: database of chemical compounds and reactions in biological pathways.

Authors:  Susumu Goto; Yasushi Okuno; Masahiro Hattori; Takaaki Nishioka; Minoru Kanehisa
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

2.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

Review 3.  The ubiquitin-proteasome system.

Authors:  Dipankar Nandi; Pankaj Tahiliani; Anujith Kumar; Dilip Chandu
Journal:  J Biosci       Date:  2006-03       Impact factor: 1.826

4.  Resistance to bacillus thuringiensis Cry1Ac toxin in three strains of heliothis virescens: proteolytic and SEM study of the larval midgut

Authors: 
Journal:  Arch Insect Biochem Physiol       Date:  1999-09       Impact factor: 1.698

5.  Insect resistance to Bacillus thuringiensis: alterations in the indianmeal moth larval gut proteome.

Authors:  Mehmet Candas; Olga Loseva; Brenda Oppert; Pradeepa Kosaraju; Lee A Bulla
Journal:  Mol Cell Proteomics       Date:  2003-01       Impact factor: 5.911

Review 6.  Bacillus thuringiensis and its pesticidal crystal proteins.

Authors:  E Schnepf; N Crickmore; J Van Rie; D Lereclus; J Baum; J Feitelson; D R Zeigler; D H Dean
Journal:  Microbiol Mol Biol Rev       Date:  1998-09       Impact factor: 11.056

7.  Constitutive activation of the midgut response to Bacillus thuringiensis in Bt-resistant Spodoptera exigua.

Authors:  Patricia Hernández-Martínez; Gloria Navarro-Cerrillo; Silvia Caccia; Ruud A de Maagd; William J Moar; Juan Ferré; Baltasar Escriche; Salvador Herrero
Journal:  PLoS One       Date:  2010-09-17       Impact factor: 3.240

8.  Transcriptome profiling of the intoxication response of Tenebrio molitor larvae to Bacillus thuringiensis Cry3Aa protoxin.

Authors:  Brenda Oppert; Scot E Dowd; Pascal Bouffard; Lewyn Li; Ana Conesa; Marcé D Lorenzen; Michelle Toutges; Jeremy Marshall; Diana L Huestis; Jeff Fabrick; Cris Oppert; Juan Luis Jurat-Fuentes
Journal:  PLoS One       Date:  2012-04-25       Impact factor: 3.240

9.  The Pfam protein families database.

Authors:  Marco Punta; Penny C Coggill; Ruth Y Eberhardt; Jaina Mistry; John Tate; Chris Boursnell; Ningze Pang; Kristoffer Forslund; Goran Ceric; Jody Clements; Andreas Heger; Liisa Holm; Erik L L Sonnhammer; Sean R Eddy; Alex Bateman; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2011-11-29       Impact factor: 16.971

10.  Vid24p, a novel protein localized to the fructose-1, 6-bisphosphatase-containing vesicles, regulates targeting of fructose-1,6-bisphosphatase from the vesicles to the vacuole for degradation.

Authors:  M C Chiang; H L Chiang
Journal:  J Cell Biol       Date:  1998-03-23       Impact factor: 10.539

View more
  23 in total

Review 1.  Intestinal regeneration as an insect resistance mechanism to entomopathogenic bacteria.

Authors:  Anaïs Castagnola; Juan Luis Jurat-Fuentes
Journal:  Curr Opin Insect Sci       Date:  2016-04-20       Impact factor: 5.186

2.  Transcriptome Sequencing of the Striped Cucumber Beetle, Acalymma vittatum (F.), Reveals Numerous Sex-Specific Transcripts and Xenobiotic Detoxification Genes.

Authors:  Michael E Sparks; David R Nelson; Ariela I Haber; Donald C Weber; Robert L Harrison
Journal:  BioTech (Basel)       Date:  2020-10-27

3.  Arbovirus Infections As Screening Tools for the Identification of Viral Immunomodulators and Host Antiviral Factors.

Authors:  Emily A Rex; Dahee Seo; Don B Gammon
Journal:  J Vis Exp       Date:  2018-09-13       Impact factor: 1.355

Review 4.  Response Mechanisms of Invertebrates to Bacillus thuringiensis and Its Pesticidal Proteins.

Authors:  Daniel Pinos; Ascensión Andrés-Garrido; Juan Ferré; Patricia Hernández-Martínez
Journal:  Microbiol Mol Biol Rev       Date:  2021-01-27       Impact factor: 11.056

5.  Dissimilar Regulation of Antimicrobial Proteins in the Midgut of Spodoptera exigua Larvae Challenged with Bacillus thuringiensis Toxins or Baculovirus.

Authors:  Cristina M Crava; Agata K Jakubowska; Baltasar Escriche; Salvador Herrero; Yolanda Bel
Journal:  PLoS One       Date:  2015-05-18       Impact factor: 3.240

6.  Wolbachia infection modifies the profile, shuttling and structure of microRNAs in a mosquito cell line.

Authors:  Jaime G Mayoral; Kayvan Etebari; Mazhar Hussain; Alexander A Khromykh; Sassan Asgari
Journal:  PLoS One       Date:  2014-04-23       Impact factor: 3.240

7.  Transcriptome of the invasive brown marmorated stink bug, Halyomorpha halys (Stål) (Heteroptera: Pentatomidae).

Authors:  Michael E Sparks; Kent S Shelby; Daniel Kuhar; Dawn E Gundersen-Rindal
Journal:  PLoS One       Date:  2014-11-11       Impact factor: 3.240

8.  Effect of midgut proteolytic activity on susceptibility of lepidopteran larvae to Bacillus thuringiensis subsp. Kurstaki.

Authors:  Reza Talaei-Hassanloui; Raziyeh Bakhshaei; Vahid Hosseininaveh; Ayda Khorramnezhad
Journal:  Front Physiol       Date:  2014-01-16       Impact factor: 4.566

9.  Multi-method approach for characterizing the interaction between Fusarium verticillioides and Bacillus thuringiensis subsp. Kurstaki.

Authors:  Liliana O Rocha; Sabina Moser Tralamazza; Gabriela M Reis; Leon Rabinovitch; Cynara B Barbosa; Benedito Corrêa
Journal:  PLoS One       Date:  2014-04-16       Impact factor: 3.240

10.  A single vertebrate DNA virus protein disarms invertebrate immunity to RNA virus infection.

Authors:  Don B Gammon; Sophie Duraffour; Daniel K Rozelle; Heidi Hehnly; Rita Sharma; Michael E Sparks; Cara C West; Ying Chen; James J Moresco; Graciela Andrei; John H Connor; Darryl Conte; Dawn E Gundersen-Rindal; William L Marshall; John R Yates; Neal Silverman; Craig C Mello
Journal:  Elife       Date:  2014-06-25       Impact factor: 8.140

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.