Literature DB >> 25280771

Transcriptome analysis of Gossypium hirsutum flower buds infested by cotton boll weevil (Anthonomus grandis) larvae.

Sinara Artico, Marcelo Ribeiro-Alves, Osmundo Brilhante Oliveira-Neto, Leonardo Lima Pepino de Macedo, Sylvia Silveira, Maria Fátima Grossi-de-Sa, Adriana Pinheiro Martinelli, Marcio Alves-Ferreira1.   

Abstract

BACKGROUND: Cotton is a major fibre crop grown worldwide that suffers extensive damage from chewing insects, including the cotton boll weevil larvae (Anthonomus grandis). Transcriptome analysis was performed to understand the molecular interactions between Gossypium hirsutum L. and cotton boll weevil larvae. The Illumina HiSeq 2000 platform was used to sequence the transcriptome of cotton flower buds infested with boll weevil larvae.
RESULTS: The analysis generated a total of 327,489,418 sequence reads that were aligned to the G. hirsutum reference transcriptome. The total number of expressed genes was over 21,697 per sample with an average length of 1,063 bp. The DEGseq analysis identified 443 differentially expressed genes (DEG) in cotton flower buds infected with boll weevil larvae. Among them, 402 (90.7%) were up-regulated, 41 (9.3%) were down-regulated and 432 (97.5%) were identified as orthologues of A. thaliana genes using Blastx. Mapman analysis of DEG indicated that many genes were involved in the biotic stress response spanning a range of functions, from a gene encoding a receptor-like kinase to genes involved in triggering defensive responses such as MAPK, transcription factors (WRKY and ERF) and signalling by ethylene (ET) and jasmonic acid (JA) hormones. Furthermore, the spatial expression pattern of 32 of the genes responsive to boll weevil larvae feeding was determined by "in situ" qPCR analysis from RNA isolated from two flower structures, the stamen and the carpel, by laser microdissection (LMD).
CONCLUSION: A large number of cotton transcripts were significantly altered upon infestation by larvae. Among the changes in gene expression, we highlighted the transcription of receptors/sensors that recognise chitin or insect oral secretions; the altered regulation of transcripts encoding enzymes related to kinase cascades, transcription factors, Ca2+ influxes, and reactive oxygen species; and the modulation of transcripts encoding enzymes from phytohormone signalling pathways. These data will aid in the selection of target genes to genetically engineer cotton to control the cotton boll weevil.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25280771      PMCID: PMC4234063          DOI: 10.1186/1471-2164-15-854

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Cotton is a fibre and oil-yielding crop grown worldwide. Four species of cotton are generally cultivated [1]; however, Gossypium hirsutum L. contributes the most to the total lint cotton production worldwide [2]. Cotton productivity is severely affected by both biotic and abiotic stresses [3]. Approximately 1326 species of insects have been reported to attack cotton plants. Among these species, the aphid (Aphis gossypii G.), the fall armyworm (Spodoptera frugiperda), the budworm (Heliothis virescens), the cotton bollworm (Helicoverpa armigera) and the cotton boll weevil (Anthonomus grandis) are the major pests affecting cotton culture [4]. The cotton boll weevil is undoubtedly the most devastating pest [5, 6]. The adult female feeds, ovoposits, and develops primarily in cotton flower buds and fruits. After hatching, the larvae remain within the reproductive structures and use them as a food sources and a protected habitat to complete their life cycle. The endophytic behaviour of the larvae has made these insects difficult to control with conventional insecticides and other cultural practices [7]. Plants have evolved elaborate defence systems to respond in a rapid and effective way to herbivorous insects. Numerous studies have revealed that, in addition to the constitutive defences comprised of trichomes, thick secondary cell walls, and toxic compounds, plants are equipped with inducible defences that can be grouped into indirect responses, such as the production of volatile odour blends to attract natural enemies of the attacking insect, and direct responses, such as the production of anti-digestive proteins, toxic secondary compounds, and enzymes that affect insect growth and development [8]. These two powerful defence systems evolved by plants during the long arms race with herbivores have enabled plants to survive [9]. An appropriate defence response to a biotic threat requires initial recognition. Herbivores or pathogens are recognised when conserved patterns of molecules, called herbivore- or pathogen-associated molecular patterns, HAMP or PAMP (such as chitin or flagellin) are detected by pattern recognition receptors (PRR) on the surface of the host plant cell, leading to HAMP-triggered immunity (HTI). Damage-associated molecular patterns (DAMP), which are endogenous molecules produced by the plant after infection, are also recognised by PRR to trigger defensive reactions [10, 11]. Following the recognition of an attacker, plants use different signalling cascades to reprogram their phenotype. These include an interconnected network of signal transduction pathways depending mainly on the small regulators jasmonic acid (JA), salicylic acid (SA), and ethylene (ET), concomitantly with Ca+2 ion fluxes, mitogen-activated protein kinases (MAPK), transcription factors and reactive oxygen species (ROS) [12]. Crop losses due to insects constitute one of the most significant constraints to the increase of global productivity and food production, estimated at 10-20% for major crops [13]. A better understanding of the diversity of plant responses to insect attacks and, in particular, the induced defences and their regulation, has generated interest in the scientific community to examine alternative strategies to protect plants and crops from insects pests by exploiting the endogenous resistance mechanisms exhibited by plants to most herbivorous insects. Recent transcriptomics studies of plants exposed to herbivory identified a central role for transcripts that can lead to the development of insect-resistant crops [4, 14–18]. High-throughput sequencing of RNA using next-generation sequencing platforms (RNA-Seq) offers a variety of new possibilities such as the transcriptional profiling of organisms lacking sequence information [19], as well as the identification of novel loci, alternative splicing events [20], and sequence variation [21]. Thus, we decided to study the molecular responses of G. hirsutum flower buds to infestation by cotton boll weevil (A. grandis) larvae using the Illumina HiSeq™ 2000 platform. Furthermore, we combined this approach with laser microdissection (LMD) to isolate RNA from two different regions of tissue damaged by feeding larvae to evaluate the spatial expression pattern of the differentially expressed genes in response to feeding by cotton boll weevil larvae.

Results

Analyses of RNA-seq data

To explore the response of G. hirsutum to tissue-chewing pests such as cotton boll weevil larvae, the flower bud transcriptome of infested plants was compared with control plants (non-inoculated flower buds). Two biological replicates for each condition were selected for transcriptome analyses with high-throughput parallel sequencing using HiSeq™ 2000, Illumina. We generated 327,489,418 sequence reads, and each sample was represented by at least 74 million reads, a tag density sufficient for quantitative analysis of gene expression (Table 1) [22, 23]. The correlation between the two biological replicates was high (0.99 for infested flower buds and 0.98 for control samples; data not shown).
Table 1

Summary of sequencing data output, statistical analysis of the reads obtained and mapping of the reads onto the cotton ( ) transcriptome

Total reads1 Mapped reads2 % of totalOMM3 Expressed genesYield_Gbases
Sample 321,7268,56
Exon85,575,32836,222,89142.3318,153,201
Not mapped85,575,32847,769,76255.82
Sample 521,7197,43
Exon74,271,97832,110,31343.2315,672,772
Not mapped74,271,97840,654,74254.74
Sample 721,6977,69
Exon76,936,85430,018,66039.0214,888,504
Not mapped76,936,85445,549,86959.2
Sample 821,7329,07
Exon90,705,25838,184,75942.118,824,230
Not mapped90,705,25850,726,08355.92

1Total number of reads mapped onto the G. hirsutum transcriptome.

2Percentage of reads mapped onto the G. hirsutum transcriptome.

3Number of perfect matches to the reference sequence.

Summary of sequencing data output, statistical analysis of the reads obtained and mapping of the reads onto the cotton ( ) transcriptome 1Total number of reads mapped onto the G. hirsutum transcriptome. 2Percentage of reads mapped onto the G. hirsutum transcriptome. 3Number of perfect matches to the reference sequence. The sequence reads were aligned to the G. hirsutum reference transcriptome (cotton EST database) using the BWA package, an efficient engine when searching for perfect matches [24]. Among the total number of reads, 39–43.2% were confined to exons, and 67,538,707 were perfect matches (OMM) to the reference sequence (Table 1). The total number of expressed genes (contigs) was higher than 21,697 per sample (Table 1), and 21,561 expressed genes were common to all samples analysed (data not shown). The average length of contigs generated was 1,063 bp (Additional file 1).

Overview of the changes in gene expression in response to feeding by cotton boll weevil larvae

The quantitative profiling of the transcriptome using DEGseq (R-bioconductor) analysis identified 443 differentially expressed genes (DEG) in cotton flower buds infested with cotton boll weevil larvae: 402 of them (90.7%) were up-regulated, and 41 (9.3%) were down-regulated compared to the control (adjusted p-value ≤ 0.05, |log FC| ≥ 2.0). Among these DEG, 432 (97.5%) were identified as orthologues of A. thaliana genes by Blastx with an e-value of 10−5 (Additional file 2). To examine the range of genes involved in the response of cotton flower buds to inoculation with A. grandis larvae, the Blast2GO program was also used to confirm the annotation of differentially expressed transcripts (Additional file 2). Blast2GO software returned functions for 87.3% of the differentially expressed genes from species with greater Blast hit distributions that included Vitis vinifera, Glycine max, Populus trichocarpa, and Threobroma cacao. Of the DEG, 76% had the same functional annotation between Blastx with Arabidopsis and the Blast2GO analysis (Additional file 2). To determine which genes and pathways were relevant responses to cotton boll weevil larvae feeding, a gene set enrichment analysis (GSEA) approach was used. Based on GSEA, a hypergeometric test (p-values ≤ 0.005) was applied to identify which cellular components (CC), molecular functions (MF) and biological processes (BP) were overrepresented in our list of DEG. GSEA revealed many DEG putatively associated with the plasma membrane and cell wall in the cellular component category (Additional file 3). Most of the genes encoding plasma membrane proteins are receptor-like kinases (RLK), which are known to be involved in the perception of pathogen-derived elicitors. All RLK genes are up-regulated in infested plants in comparison to control plants (Additional file 4). Among the genes associated with the cell wall, there are up-regulated transcripts encoding a disease resistance protein (LRR) and cell wall-modifying enzymes, such as pectin methylesterase (PME41) and endotransglycosylase/hydrolase proteins (XTH31, XTH32, XTH23 and TCH4) (Additional file 5). Many of the down-regulated genes encode cytosolic heat shock proteins such as HSP90 and HSP70 (Additional file 5). These molecular chaperones assist in folding newly synthesised proteins and also in several other biological and cellular processes, such as cell growth, development and signal transduction during abiotic and biotic stress [25]. With regard to molecular functions, the DEGs were mainly associated with sequence-specific DNA binding transcription factor activity (Table 2), calmodulin binding and calcium ion binding (Additional file 3).
Table 2

The expression of 88 Gossypium transcription factor (TF) genes in response to cotton boll weevil larvae feeding in flower buds

Gene familyContig G. hirsutumGene symbolTAIR codeE-valueLog fold changeAdjusted p-value
Transcription factors
WRKY domain transcription factor
contig8076 ATWRKY40AT1g8084004.980
contig1646ATWRKY40AT1g8084004.290
contig4254ATWRKY40AT1g8084052.762.456.94715156272822e-13
contig313ATWRKY40AT1g8084045.812.271.59891623026382e-11
contig5359 ATWRKY53AT4g2381005.370
contig9783 ATWRKY70AT3g5640004.751.14115253194648e-13
contig3807ATWRKY70AT3g5640037.272.265.24201769462223e-11
contig3808ATWRKY70AT3g5640034.522.243.54998511396895e-06
contig4599ATWRKY70AT3g5640040.272.036.30528430834368e-05
contig4087 ATWRKY46AT2g4640004.371.76819207423999e-12
contig4086ATWRKY46AT2g4640037.033.50
contig9296ATWRKY46AT2g4640030.992.888.66053260852238e-14
contig1954 ATWRKY26AT5g0710048.262.614.46309655899313e-14
contig4726 ATWRKY41AT4g1107004.070
contig2606 ATWRKY33AT2g3847051.13.360
contig2607ATWRKY33AT2g3847053.333.141.00441679956828e-13
contig1952ATWRKY33AT2g3847056.582.563.0123591681817e-14
contig1953ATWRKY33AT2g3847044.672.432.45606573427175e-11
contig16334 ATWRKY72AT5g1513044.752.751.43400565479266e-05
contig4409 AtWRKY22AT4g0125065.222.561.56843040084616e-11
AP2/ERE transcription factor
contig4498DREB1DAT5g5199004.880
contig8299RRTF1AT4g3441004.851.39364892550935e-13
contig5510 ATERF-5AT5g4723004.70
contig5512ATERF-5AT5g4723004.470
contig5102AT1g6304003.920
contig5511ATERF-5AT5g4723003.920
contig18592CBF4AT5g5199003.918.25514597578304e-13
contig16446ATERF-5AT5g4723003.720
contig18443DREB26AT1g2191003.70
contig22401ATERF-5AT5g4723003.531.03395493612283e-10
contig23741 ATERF98AT3g2323061.43.451.35043960387988e-09
contig5014ATERF-5AT5g4723044.143.170
contig21166AT1g33760643.115.29476229612506e-07
contig5559 RAP2.5AT3g1521072.293.050
contig13018ATERF-5AT5g4723043.892.854.47408363363908e-11
contig23743ATERF-9AT5g4421090.742.781.53477793062422e-14
contig11060ABR1AT5g6475055.862.697.42624465574457e-07
contig3404ATERF-9AT5g4421047.162.630
contig3403ATERF-9AT5g4421049.512.262.16279246764181e-11
contig11909AT5g6189044.072.260
contig2076ATEBPAT3g1677048.212.235.79122872488872e-11
contig6750RAP2.5AT3g1521052.262.211.09859031453532e-10
contig21935TINY2AT5g1159067.982.190
C2H2 zinc-finger
contig21780AT3g4993003.962.21211001647301e-09
contig3187ZAT10AT1g2773003.640
contig567ZAT10AT1g2773003.635.96999381147476e-12
contig22821AT3g4993042.783.390
contig562STZAT1g2773048.183.380
contig35953.277.24878747340025e-05
contig563STZAT1g2773046.853.260
contig566STZAT1g2773064.133.25.89909045523327e-05
contig564STZAT1g2773048.122.954.46339498962764e-08
contig569STZAT1g2773046.672.419.59875697444564e-12
contig14282AT2g2871053.082.342.17861060813777e-07
contig7249AT3g4607049.382.290
C3H zinc-finger
contig722ATSZF2AT2g4014003.540
contig729CZF1AT2g4014053.882.950
contig719CZF1AT2g4014062.382.790
contig717CZF1AT2g4014052.392.592.07852782604537e-13
contig720ATSZF1AT3g5598056.623.191.77734818473086e-08
contig728AT3g55980502.380.01
MYB domain transcription factor
contig11515ATMYB2AT2g4719004.311.53477793062422e-14
contig22866ATMYB73AT4g3726003.770
contig19903ATMYB73AT4g3726076.923.241.10170914685033e-09
contig11954ATMYB78AT5g4962047.72.980
contig5383ATMYB73AT4g3726077.882.780
contig11235ATMYB73AT4g3726046.842.731.09623765025238e-08
GRAS transcription factor
contig3642SCL5AT1g5060062.533.240
contig18482ATGRAS2AT1g0753058.333.243.57762043883002e-10
contig3507AT3g4660049.912.730
contig3557PAT1AT5g4815057.062.720
contig4834PAT1AT5g4815063.442.042.21211001647301e-09
NAC domain transcription factor
contig2875ANAC002AT1g0172059.863.040
contig718ANAC002AT1g0172066.982.820
contig725ANAC002AT1g0172075.682.051.37196620183569e-09
contig4025NTL9AT4g3558072.482.453.87976777100843e-08
contig4026CBNACAT4g3558059.242.374.87321507933394e-12
contig14400ANAC083AT5g1318040.792.261.92653534804078e-07
bHLH, Basic helix-Loop-helix
contig8902AT4g2097004.790
contig12411AT2g2275004.151.53477793062422e-14
contig12410AT2g2276003.538.73481683691145e-06
contig21436AT3g0734041.013.154.49986558503901e-05
contig24140AT5g57150482.833.38680046143333e-13
contig23410AT3g0734052.342.698.59443189061066e-06
contig24822BANQUO 1AT5g3986067.952.652.26903319960744e-10
contig10541AT1g1012070.412.229.34996779453853e-11
contig5829AT4g2097038.852.220
HSF, Heat-shock transcription factor
contig1779ATHSFA2AT2g2615050.39−3.290

Genes in bold were tested by qPCR.

The expression of 88 Gossypium transcription factor (TF) genes in response to cotton boll weevil larvae feeding in flower buds Genes in bold were tested by qPCR. Furthermore, GSEA revealed significant biological processes altered in plants upon infection with cotton boll weevil larvae. We found that biological processes related to hormone biosynthesis and signalling, the response to organic substances, the regulation of biological processes, the detection of biotic stimuli, systemic acquired resistance, the respiratory burst involved in the defence response and innate immune responses were overrepresented by GSEA (Figure 1). Moreover, biological processes associated with defence against insects, such as the response to chitin, the regulation of the plant-type hypersensitive response (HR), the regulation of programmed cell death and death were also represented (Figure 1). One hundred thirty-four transcripts were found in the “response to chitin” category (GO:0010200), and 36 of them were also annotated with the gene ontology (GO) term “death” (GO:0016265) (Table 3). Of these, many transcripts encoded proteins involved in signal transduction such as receptor-like kinases, mitogen-activated protein kinases, calcium ion binding proteins and calmodulin-like proteins. In addition to genes associated with signal transduction, genes associated with hormone biosynthesis and the 26S proteasome pathway were annotated with the GO term “response to chitin and death” (Additional file 6).
Figure 1

Distribution of differentially expressed genes (DEGs) (x-axis) into Gene Ontology (GO) categories (biological process) (y-axis) according to Gene Set Enrichment Analysis (GSEA). Only biological processes (BPs) discussed in the results are presented here. A complete list of BPs can be found in Additional file 6.

Table 3

Subset of differentially expressed genes (DEG) in cotton flower buds in response to feeding by cotton boll weevil larvae

Contigs cottonGene symbolTAIR annotationSubjectIDlogFold changeAdjusted P-value
GO:0010200 - Response to chitin
contig7179*Protein kinase family protein with leucine-rich repeat domainAT5g259302.025.33E-09
contig14965Kin3Encodes a putative serine/threonine-specific protein kinaseAT2g172202.031.20E-08
contig21694Protein kinase superfamilyAT1g183902.655.45E-10
contig2968CPK28Member of Calcium Dependent Protein KinaseAT5g662102.470.01
contig17371 CCR4Serine/threonine-protein kinase-like domainAT5g478503.510
contig2100MAPK3Encodes a mitogen-activated kinaseAT5g575104.278.61E-07
contig2102 MAPK3Encodes a mitogen-activated kinaseAT3g456402.172.57E-10
contig14909 MKK9Member of MAP Kinase Kinase familyAT1g735002.221.58E-10
contig7892MAPKKK14Member of MEKK subfamilyAT2g300402.964.79E-11
contig4523ATPase E1-E2 type family protein/haloacid dehalogenase-like hydrolase family proteinAT3g633803.30
contig11388ATPase E1-E2 type family protein/haloacid dehalogenase-like hydrolase family proteinAT3g633802.27.24E-11
contig18754ATPase E1-E2 type family protein/haloacid dehalogenase-like hydrolase family proteinAT3g633802.153.53E-09
contig7822CML38Calmodulin-like 38 (CML38)AT1g766504.871.56E-06
contig7823 CML38Calmodulin-like 38 (CML38)AT1g766503.320
contig535 CML24Encodes a protein with 40% similarity to calmodulinAT5g377703.480
contig2687C2 calcium-dependent membrane targetingAT2g254604.020
contig12364Calcium-binding EF-hand family proteinAT4g272804.10
contig2688C2 calcium-dependent membrane targetingAT2g254603.891.72E-11
contig14266*ACA2Encodes a calmodulin-regulated Ca(2+)-pump located in the endoplasmic reticulumAT4g376402.172.26E-10
contig2595*ACA2Encodes a calmodulin-regulated Ca(2+)-pump located in the endoplasmic reticulumAT4g376402.157.16E-10
contig8294* EDA39Encodes a calmodulin-binding protein involved in stomatal movementAT4g330502.613.93E-12
contig1798PUMP5Encodes one of the mitochondrial dicarboxylate carriers (DIC)AT2g225006.130
contig9093*Encodes a cell wall bound peroxidaseAT5g641203.764.46E-14
contig9094*Encodes a cell wall bound peroxidaseAT5g641203.420
contig8534*DMR6Encodes a putative 2OG-Fe(II) oxygenaseAT5g245302.251.53E-06
contig1723*Protein phosphatase 2C family proteinAT4g339203.077.56E-13
contig1722*Protein phosphatase 2C family proteinAT4g339202.71.53E-14
contig1720*Protein phosphatase 2C family proteinAT4g339202.324.51E-07
contig8378Acyl-CoA N-acyltransferases (NAT) superfamily proteinAT2g320304.480
contig24379*PLA/PLA2AEncodes a lipid acyl hydrolaseAT2g265603.150
contig11992Alpha/beta-Hydrolases superfamily proteinAT3g199703.131.89E-10
contig1674TCH4Encodes a cell wall-modifying enzymeAT5g575605.849.62E-12
contig2539TCH4Encodes a cell wall-modifying enzymeAT5g575605.570
contig1676TCH4Encodes a cell wall-modifying enzymeAT5g575604.660
contig2101TCH4Encodes a cell wall-modifying enzymeAT5g575604.630
contig2538TCH4Encodes a cell wall-modifying enzymeAT5g575603.728.85E-13
contig2536TCH4Encodes a cell wall-modifying enzymeAT5g575603.221.27E-13
contig4524TCH4Encodes a cell wall-modifying enzymeAT5g575605.530
contig9042RRTF1Member of the ERF (ethylene response factor) of ERF/AP2 transcription factor familyAT4g3441037.970
contig8299RRTF1Member of the ERF (ethylene response factor) of ERF/AP2 transcription factor familyAT4g344104.851.39E-13
contig8860Member of the DREB subfamily A-5 of ERF/AP2 transcription factor familyAT1g192109.530
contig23571Member of the DREB subfamily A-5 of ERF/AP2 transcription factor familyAT1g192107.290
contig24042Member of the DREB subfamily A-5 of ERF/AP2 transcription factor familyAT1g192106.350
contig13134Member of the DREB subfamily A-5 of ERF/AP2 transcription factor familyAT5g511905.250
contig19639Member of the DREB subfamily A-5 of ERF/AP2 transcription factor familyAT1g192105.010
contig23741 ERF98Member of the ERF (ethylene response factor)AT3g232303.451.35E-09
contig5510 ERF-5Member of the ERF (ethylene response factor)AT5g472304.70
contig5512ERF-5Member of the ERF (ethylene response factor)AT5g472304.470
contig5511ERF-5Member of the ERF (ethylene response factor)AT5g472303.920
contig16446ERF-5Member of the ERF (ethylene response factor)AT5g472303.720
contig22401ERF-5Member of the ERF (ethylene response factor)AT5g472303.531.03E-10
contig5014ERF-5Member of the ERF (ethylene response factor)AT5g472303.170
contig13018ERF-5Member of the ERF (ethylene response factor)AT5g472302.854.47E-11
contig5559 ERF-4Member of the ERF (ethylene response factor)AT3g152103.050
contig6750ERF-4Member of the ERF (ethylene response factor)AT3g152102.211.10E-10
contig8076*GhWRKY40-like4Pathogen-induced transcription factorAT1g808404.980
contig1646*GhWRKY40-like3Pathogen-induced transcription factorAT1g808404.290
contig4254* GhWRKY40-like1Pathogen-induced transcription factorAT1g808402.456.95E-13
contig313*GhWRKY40-like2Pathogen-induced transcription factorAT1g808402.271.60E-11
contig5359* GhWRKY64-like1Member of WRKY Transcription Factor; Group IIIAT4g238105.370
contig9783* GhWRKY70-like1Member of WRKY Transcription Factor; Group IIIAT3g564004.751.14E-13
contig3807*GhWRKY70-like3Member of WRKY Transcription Factor; Group IIIAT3g564002.265.24E-11
contig3808*GhWRKY70-like4Member of WRKY Transcription Factor; Group IIIAT3g564002.243.55E-06
contig4599*GhWRKY70-like2Member of WRKY Transcription Factor; Group IIIAT3g564002.036.31E-05
contig4087 GhWRKY46-like1Member of WRKY Transcription Factor; Group IIIAT2g464004.371.77E-12
contig9296GhWRKY46-like3Member of WRKY Transcription Factor; Group IIIAT2g464002.888.66E-14
contig4086GhWRKY46-like2Member of WRKY Transcription Factor; Group IIIAT2g464003.50
contig2606* GhWRKY33-like1Member of the plant WRKY transcription factor familyAT2g384703.610
contig2605*GhWRKY33-like2Member of the plant WRKY transcription factor familyAT2g384703.360
contig1952GhWRKY33-like3Member of the plant WRKY transcription factor familyAT2g384702.563.01E-14
contig1953*GhWRKY33-like4Member of the plant WRKY transcription factor familyAT2g384702.432.46E-11
contig4409 GhWRKY22-like1member of WRKY Transcription Factor; Group II-eAT4g012502.561.57E-11
contig565STZRelated to Cys2/His2-type zinc-finger proteinsAT3g633805.40
contig6454STZRelated to Cys2/His2-type zinc-finger proteinsAT1g277305.220
contig561STZRelated to Cys2/His2-type zinc-finger proteinsAT1g277305.070
contig3187STZRING-H2 protein induced after exposure to chitinAT1g277303.640
contig1541*ATL2RING-H2 protein induced after exposure to chitinAT3g167203.770
contig1540*ATL2RING-H2 protein induced after exposure to chitinAT3g167203.710
contig3187STZRelated to Cys2/His2-type zinc-finger proteins found in higher plantsAT1g277303.640
contig567STZRelated to Cys2/His2-type zinc-finger proteins found in higher plantsAT1g277303.635.97E-12
contig563STZRelated to Cys2/His2-type zinc-finger proteins found in higher plantsAT1g277303.260
contig566STZRelated to Cys2/His2-type zinc-finger proteins found in higher plantsAT5g575603.25.90E-05
contig569STZRelated to Cys2/His2-type zinc-finger proteins found in higher plantsAT1g277302.419.60E-12
contig562STZRelated to Cys2/His2-type zinc-finger proteins found in higher plantsAT1g277303.380
contig5663Related to Cys2/His2-type zinc-finger proteins found in higher plantsAT1g277303.630
contig564ATSZF2Domain Zinc finger, C3H-typeAT2g401402.954.46E-08
contig722*ATSZF2Domain Zinc finger, C3H-typeAT2g401403.540
contig729*ATSZF2Zinc finger, C3H-typeAT2g401402.950
contig719*ATSZF2Zinc finger, C3H-typeAT2g401402.790
contig717*ATSZF2Zinc finger, C3H-typeAT2g401402.592.08E-13
contig720ATSZF1Salt-inducible zinc finger 1 (SZF1)AT3g559803.191.78E-08
contig728ATSZF1Salt-inducible zinc finger 1 (SZF1)AT3g559802.380.01
contig5654BRH1Encodes a novel ring finger proteinAT3g614603.610
contig22866ATMYB73Member of the R2R3 factor gene familyAT3g167203.770
contig19903ATMYB73Member of the R2R3 factor gene familyAT4g372603.241.10E-09
contig5383ATMYB73Member of the R2R3 factor gene familyAT4g372602.780
contig11235ATMYB73Member of the R2R3 factor gene familyAT4g372602.731.10E-08
contig3507GRASGRAS family transcription factorAT3g466002.730
contig2875ANAC002Belongs to a large family of putative transcriptional activators with NAC domainAT1g017203.040
contig718ANAC002Belongs to a large family of putative transcriptional activators with NAC domainAT1g017202.820
contig725ANAC002Belongs to a large family of putative transcriptional activators with NAC domainAT1g017202.051.37E-09
contig10852ATL6Encodes a putative RING-H2 zinc finger protein ATL6 (ATL6)AT3g052002.271.63E-10
contig7248RHL41Encodes a zinc finger proteinAT5g598202.665.29E-06
contig1779ATHSFA2Member of Heat Stress Transcription Factor (Hsf) familyAT2g26150−3.290
contig18462*BETAEncodes a beta carbonic anhydrase likely to be localized in the cytoplasmAT5g147402.690
contig4843Encodes an ABA- and drought-induced RING-DUF1117 geneAT5g595502.091.56E-07
contig17118 AOSMember of the cytochrome p450 CYP74 gene family that functions as an allene oxide synthaseAT5g426504.52.74E-09
contig3758 ACSEncodes a a member of the 1-aminocyclopropane-1-carboxylate (ACC) synthaseAT4g112803.480
contig12553* ACOEncodes 1-aminocyclopropane-1-carboxylate oxidaseAT1g050103.10
contig1208*ACOEncodes 1-aminocyclopropane-1-carboxylate oxidaseAT1g050102.161.48E-10
contig1206*ACOEncodes 1-aminocyclopropane-1-carboxylate oxidaseAT1g050102.097.20E-10
contig1960LOXPLAT/LH2 domain-containing lipoxygenase family proteinAT1g725203.730
contig9217 CYP707A3Encodes a protein with ABA 8′-hydroxylase activityAT5g453403.190
contig4960*JAZ1JAZ1 is a nuclear-localized protein involved in jasmonate signalingAT1g191802.265.01E-11
contig4959*JAZ1JAZ1 is a nuclear-localized protein involved in jasmonate signalingAT1g191802.192.02E-09
contig15546JAZ7Jasmonate-zim-domain protein 7 (JAZ7)AT2g346002.030
contig5978ATCHITIVEncodes an EP3 chitinaseAT3g544203.081.53E-14
contig21823ATCHITIVEncodes an EP3 chitinaseAT3g544202.572.26E-10
contig6127AtCAF1aEncodes one of the homologs of the yeast CCR4-associated factor 1AT3g442602.940
contig1389AT-SYR1Encodes a syntaxin localized at the plasma membraneAT3g118202.825.38E-12
contig1388AT-SYR1Encodes a syntaxin localized at the plasma membraneAT3g118202.412.02E-12
contig5585ATHSPRO2Ortholog of sugar beet HS1 PRO-1 2 (HSPRO2)AT2g400002.690
contig3792ATHSPRO2Ortholog of sugar beet HS1 PRO-1 2 (HSPRO2)AT2g400002.166.93E-05
contig2278Encodes an ABA- and drought-induced RING-DUF1117 geneAT5g595502.611.53E-14
contig4842Encodes an ABA- and drought-induced RING-DUF1117 geneAT5g595502.418.61E-06
contig4022ATNHL10Encodes a protein whose sequence is similar to tobacco hairpin-induced gene (HIN1)AT2g384702.568.82E-06
contig4024ATNHL10Encodes a protein whose sequence is similar to tobacco hairpin-induced gene (HIN1)AT2g359802.261.06E-09
contig22620Hs1pro-1 proteinAT3g558402.531.76E-09
contig3055(AT)SRC2Unknown proteinAT1g090702.386.49E-12
contig6505ARM repeat superfamily proteinAT4g272804.040
contig17616* PUB23Encodes a cytoplasmically localized U-box domain containing E3 ubiquitin ligaseAT2g359304.214.10E-12
contig2098ATPUB29E3 ubiquitin ligase activityAT3g187103.950
contig6504 ATCMPG1Ubiquitin-protein ligase activityAT1g661603.90
contig8776Ubiquitin-protein ligase activityAT5g374902.312.24E-07
GO:0016265 - Death
contig10876RLK1Encodes a receptor-like protein kinaseAT5g609002.565.07E-05
contig15035RLK1Encodes a receptor-like protein kinaseAT5g609002.288.92E-06
contig6958 RIPKEncodes a receptor-like cytoplasmic kinaseAT2g059402.131.14E-09
contig5615 EVREncodes a putative leucine rich repeat transmembrane protein kinaseAT2g318802.217.45E-09
contig12017CCR3CRINKLY4 related 3 (CCR3); kinase activityAT3g559502.243.33E-07
contig7179NAProtein kinase family protein with leucine-rich repeat domainAT5g259302.025.33E-09
contig10122ATPP2-A1Encodes a phloem lectinAT4g198402.790
contig7573 GRX480Encodes GRX480, a member of the glutaredoxin family that regulates protein redox stateAT1g284802.352.97E-12
contig8179 ATRBOHDNADPH/respiratory burst oxidase protein D (RbohD)AT5g479102.641.53E-14
contig9014ATRBOHDNADPH/respiratory burst oxidase protein D (RbohD)AT5g479102.487.49E-11
contig9015ATRBOHDNADPH/respiratory burst oxidase protein D (RbohD)AT5g479102.651.57E-09
contig4745ATMC1Metacaspase AtMCP1bAT1g021702.241.02E-09
contig5579ATGATL1The PARVUS/GLZ1 gene encodes a putative family 8 glycosyl transferaseAT1g193002.660
contig2607ATWRKY33Member of the plant WRKY transcription factor familyAT2g384703.141.00E-13
contig2076ATEBPEncodes a member of the ERF (ethylene response factor)AT3g167702.235.79E-11
contig14400ANAC083Encodes a NAC domain transcription factorAT5g131802.261.93E-07
contig12668JMTEncodes a S-adenosyl-L-methionine:jasmonic acid carboxyl methyltransferaseAT1g196403.40
contig13244AtLEA5Encodes AtLEA5, also known as Senescence-associated gene 21 (SAG21). Has a role on oxidative stress toleranceAT4g023802.347.06E-12
contig4041AtLEA5Encodes AtLEA5, also known as Senescence-associated gene 21 (SAG21). Has a role on oxidative stress toleranceAT4g023802.760
contig4042AtLEA5Encodes AtLEA5, also known as Senescence-associated gene 21 (SAG21). Has a role on oxidative stress toleranceAT4g023802.372.58E-09
contig4022ATNHL10Encodes a protein non-race specific disease resistance gene (NDR1)AT2g359802.568.82E-06
contig4024ATNHL10Encodes a protein non-race specific disease resistance gene (NDR1)AT2g359802.261.06E-09
contig3055(AT)SRC2SRC2 specifically binds the peptide PIEPPPHHAT1g090702.386.49E-12
contig1388AT-SYR1Encodes a syntaxinAT3g118202.412.02E-12
contig1389AT-SYR1Encodes a syntaxinAT3g118202.825.38E-12
contig21823ATCHITIVEncodes an EP3 chitinaseAT3g544202.572.26E-10
contig5978ATCHITIVEncodes an EP3 chitinaseAT3g544203.081.53E-14

Only DEG with the GO term for the biological process “response to chitin” (GO:0010200) and/or “death” (GO:0016265) are shown. The complete list of DEG is shown in Additional file 2.

*Genes present in both biological processes: response to chitin and death.

Genes in bold were tested by qPCR.

The DEGs were mapped using MapMan to generate a representative overview of the pathways affected (Figure 2). This analysis indicated the involvement of several genes in the biotic stress response including receptors recognising microbe-, pathogen-, herbivore- and damage-associated molecular patterns (MAMP, PAMP, HAMP and DAMP) as well as genes involved in triggering defensive responses such as transcription factors and the ET and JA signalling pathways (Figure 2).
Figure 2

Representative overview of DEG involved in biotic stress response in 48 h after infection with cotton boll weevil larvae. Log2-fold change of gene expression (cotton boll weevil compared to mock-inoculated control) was analysed by MapMan software. Yellow squares represent up-regulated genes and blue squares represent down-regulated genes. The colour saturation indicates log fold change > 4 and < 4. The figure shows that the molecular recognition of pathogens and herbivores by plants to trigger a defence response requires initial recognition including the following: 1. Microbe-, pathogen- and damage-associated molecular patterns (MAMP, PAMP and DAMP) are recognised by pattern recognition receptors (PRR) and lead to PAMP-triggered immunity (PTI). 2. Oviposition-associated compounds are recognised by unknown receptors and trigger defensive responses. 3. Putative herbivore-associated molecular patterns (HAMP) are recognised by receptors and lead to herbivore-triggered immunity (HTI). 4. Wounding by herbivores leads to the release of DAMP and to wound-induced resistance (WIR).

Distribution of differentially expressed genes (DEGs) (x-axis) into Gene Ontology (GO) categories (biological process) (y-axis) according to Gene Set Enrichment Analysis (GSEA). Only biological processes (BPs) discussed in the results are presented here. A complete list of BPs can be found in Additional file 6. Subset of differentially expressed genes (DEG) in cotton flower buds in response to feeding by cotton boll weevil larvae Only DEG with the GO term for the biological process “response to chitin” (GO:0010200) and/or “death” (GO:0016265) are shown. The complete list of DEG is shown in Additional file 2. *Genes present in both biological processes: response to chitin and death. Genes in bold were tested by qPCR. Representative overview of DEG involved in biotic stress response in 48 h after infection with cotton boll weevil larvae. Log2-fold change of gene expression (cotton boll weevil compared to mock-inoculated control) was analysed by MapMan software. Yellow squares represent up-regulated genes and blue squares represent down-regulated genes. The colour saturation indicates log fold change > 4 and < 4. The figure shows that the molecular recognition of pathogens and herbivores by plants to trigger a defence response requires initial recognition including the following: 1. Microbe-, pathogen- and damage-associated molecular patterns (MAMP, PAMP and DAMP) are recognised by pattern recognition receptors (PRR) and lead to PAMP-triggered immunity (PTI). 2. Oviposition-associated compounds are recognised by unknown receptors and trigger defensive responses. 3. Putative herbivore-associated molecular patterns (HAMP) are recognised by receptors and lead to herbivore-triggered immunity (HTI). 4. Wounding by herbivores leads to the release of DAMP and to wound-induced resistance (WIR).

Main processes or pathways affected by the response to infection with cotton boll weevil larvae

Several subsets of genes were identified, including protein kinases, Ca2+-binding sensor proteins (CaM), genes responsive to oxidative stress, transcription factors, cell-wall modification genes and phytohormone-responsive genes.

Genes associated with signal transduction: kinases, Ca2+-binding proteins and protein degradation

An appropriate defence response to a biotic threat initially requires recognition of the threat. Herbivore-associated molecular patterns such as chitin are detected by pattern recognition receptors (PRR) on the surface of the host plant cell, leading to HAMP-triggered immunity. Well-characterised plant PRR comprises leucine-rich repeat (LRR) receptor-like kinases (RLK). Members of this family of proteins have a common structure formed by an extracellular ligand-binding domain and a cytoplasmic kinase domain [26]. Eleven RLK/LRR are differentially expressed (up-regulated) and annotated with the GO term “response to chitin” and/or “death” (Table 3), including receptor-like cytoplasmic kinase (RIPK), receptor-like kinase localised on the plasma membrane (RLK1), serine/threonine-protein kinase-like domain (CCR4 and CCR3) and putative leucine rich repeat transmembrane protein kinase (EVR). Recognition of conserved PAMP by PRR triggers intracellular signalling via mitogen-activated protein kinase (MAPK, MAPKK, MAPKKK) cascades followed by the activation of WRKY transcription factors [27]. In our analysis, cotton boll weevil larvae affected the expression pattern of several mitogen-activated protein kinase genes. For instance, two mitogen activated protein kinases (MAPK) that encode MAPK3 in A. thaliana were transcriptionally up-regulated (contig2100 and contig2102). Similarly, a member of the MAP Kinase Kinase family (MKK9, contig14909) and a protein kinase kinase kinase (MAPKKK14, contig7892) that play a role in leaf senescence during pathogenesis by Alternaria blight in A. thaliana were also transcriptionally up-regulated [28]. The activation of MAPK through wounding is dependent on the direct binding of calmodulins (CaM) in a Ca2+ dependent manner. Mechanical wounding and insect attack trigger a transient increase of cytosolic Ca2+ levels within minutes. This Ca2+ oscillation acts as a mediator for stimulus responses to several Ca2+-binding proteins, such as calmodulins (CaM), calcineurin B-like proteins, and calcium-dependent protein kinases (CDPK) [29]. In our analysis, nine transcripts coding for calmodulin-like proteins, including EDA39, CML38, CML24 and ACA2, were present in the DEG related to chitin response. In Arabidopsis, these genes are involved in plant innate immunity signalling and function as sensors of the signal generated by Ca2+ influx into the cytosol (Table 3) [30]. Targeted protein degradation via the ubiquitin/26S-proteasome pathway is another important regulatory process. Among the thirteen G. hirsutum genes annotated to be involved in this process, four were also related to the chitin and/or death GO (Table 3).

Cotton boll weevil-induced transcription factor (TF) genes

Transcription factors serve as important regulators of biotic and abiotic stress responses by turning on or off the immune system during plant defence. Our results suggest that the gene expression reprogramming in response to the boll weevil larvae feeding depends on many transcription factors (Table 2). As many as 88 TFs, which are 20% of the DEGs, were differentially expressed after cotton boll weevil infection, with 87 up-regulated and only one down-regulated (HSF, Heat-Shock transcription factor). Among the TFs up-regulated by cotton boll weevil feeding, there are 23 ethylene response factors (AP2/ERE), 21 WRKY, 12 C2H2 zinc finger, nine helix–loop–helix (bHLH), six C3H zinc finger, six MYB, six NAC and five GRAS (Table 2). Members of the AP2/ERE and WRKY families were also overrepresented in the study reported by Libault et al. [31], in which the expression patterns of Arabidopsis transcription factors were analysed in response to purified chitooctaose. Similarly, Wei et al. [14] observed that among the differentially expressed genes in Arabidopsis leaves in response to diamond back moth feeding, there were 18 ethylene response factors (ERFs) and 10 WRKY. The ERF subfamily belongs to the APETALA2 (AP2)/ethylene-responsive-element-binding protein (EREBP) family, an FT family exclusive to plants. WRKY transcription factors in plants are one of the largest families of zinc finger transcription factors and modulate development as well as responses to abiotic stresses, wounding, pathogens, and herbivore attack [32]. WRKY TFs can act as both positive and negative regulators of plant defence pathways. The mechanisms activating WRKY TF might involve the MAP kinase cascades and/or calcium signalling [32-34]. In the present study, 21 WRKYs were up-regulated (Table 2). Considering the large number of genes in this family that were up-regulated in our experiment and their importance in the response to pathogen and herbivore attack, we investigated this TF family in more detail. A phylogenetic analysis including WRKY genes from cotton and Arabidopsis was performed (Figure 3 and Additional file 7). The phylogenetic analysis classified the cotton genes in the WRKY clades and identified several putative cotton homologues to Arabidopsis WRKYs that have been previously characterised. Among the genes with an altered response to cotton boll weevil larvae feeding, we identified nine WRKY genes in cotton with putative homologues in Arabidopsis that belong to group III: WRKY46, WRKY30, WRKY64 and WRKY70 (Figure 3 and Additional file 7) [33, 34]. In addition, GhWRKY40-like1 (AT1G80840), GhWRKY33-like1 (AT2G38470) and GhWRKY22-like1 (AT4G01250) genes belong to group IIa, I, and IIe, respectively (Figure 3 and Additional file 7). Other zinc finger-containing transcription factors from the C2H2 and C3H families were also up-regulated during boll weevil larvae feeding. Members of these families are also up-regulated by aphid and Pseudomonas syringae attack in Arabidopsis [35] (Table 2). MYB proteins are central regulators of development, metabolism, and the response to abiotic and biotic stresses [36]. In our analysis, we found six up-regulated genes coding for R2R3-MYB TFs (Table 2). Defence responses regulated by MYB transcription factors promote HR-related cell death and resistance against bacterial and necrotrophic pathogens. MYB transcription factors also play roles in the defence response against insects. Small Cabbage White caterpillars (Pieris rapae) induce local expression of AtMYB102 [37]. NAC transcription factors have been shown to regulate plant development, phytohormone signalling and abiotic stress responses [38]. Six NAC genes were differentially expressed in response to cotton boll weevil larvae feeding (Table 2). Some NAC genes are up-regulated in response to feeding by diamond back moths, wounding, and bacterial infection [39, 40]. bHLH transcription factors regulate plant cell and tissue development as well as phytohormone signalling. A limited number of bHLH transcription factors characterised to date have been found to be involved in the defence against pathogens or pests. Nine bHLH TFs were consistently up-regulated in this study. Other differentially expressed (up-regulated) transcription factors belonging to the GRAS family play diverse roles in root and shoot development, gibberellic acid (GA) signalling, phytochrome A signal transduction and the chitin induced defence response [41, 42]. We focused on three members of the AP2/ERE family and nine members of the WRKY family to further characterise the response to cotton boll weevil feeding by qPCR analysis (Table 2).
Figure 3

Phylogenetic tree of WRKY domains between cotton and . The amino acid sequences of the WRKY domain of cotton and Arabidopsis were aligned with MUSCLE, and the phylogenetic tree was constructed using the JTT model with an estimated γ-distribution parameter (G). The maximum-likelihood analyses were performed with the program PhyML version 3.0, and assessment of node confidence was performed using 1,000 bootstrap replicates. The members of group I, II (a-e) and III are labelled according to the classifications of AtWRKY domains by Eulgem et al. [33]. The triangles indicate cotton WRKY genes, and filled triangles represent the genes analysed by qPCR. Contig9787 was named GhWRKY70-like1; was named contig5359 GhWRKY64-like1; and contig16334 was named GhWRKY72-like1 after calculating the p-distance to determine the closest relationship with Arabidopsis members.

Phylogenetic tree of WRKY domains between cotton and . The amino acid sequences of the WRKY domain of cotton and Arabidopsis were aligned with MUSCLE, and the phylogenetic tree was constructed using the JTT model with an estimated γ-distribution parameter (G). The maximum-likelihood analyses were performed with the program PhyML version 3.0, and assessment of node confidence was performed using 1,000 bootstrap replicates. The members of group I, II (a-e) and III are labelled according to the classifications of AtWRKY domains by Eulgem et al. [33]. The triangles indicate cotton WRKY genes, and filled triangles represent the genes analysed by qPCR. Contig9787 was named GhWRKY70-like1; was named contig5359 GhWRKY64-like1; and contig16334 was named GhWRKY72-like1 after calculating the p-distance to determine the closest relationship with Arabidopsis members.

Oxidative stress-related genes affected by cotton boll weevil feeding

Oxidative radicals play an important role in plants during various stresses, including biotic stress such as insect infestation [43]. Among the genes up-regulated by oxidative stress, a transcript encoding the respiratory burst oxidative homolog D (RbohD) protein, a key enzyme involved in the reduction of reactive oxygen species (ROS) during the defence response, was identified. In addition, GRX480, a member of the glutaredoxin family that catalyses thiol disulphide reduction and might be a candidate controlling the redox state of regulatory proteins, was also up-regulated. GRX480 might represent a potential regulatory component of SA/JA antagonism [44].

Phytohormone-related genes induced by the boll weevil larvae feeding

Phytohormones are crucial in the stimulation of defence to insect herbivory. Several signalling pathways, including JA, SA, and ET, are believed to orchestrate the induction of insect defences [10]. Genes associated with the biosynthesis, signalling and response to stimuli by JA, SA, ABA (abscisic acid) and ET phytohormones were significantly overrepresented in the flower bud transcriptome affected by the boll weevil based on hypergeometric distribution analysis (p-values ≤ 0.005) (Figure 1). For instance, the genes associated with jasmonate biosynthesis are well represented among the genes up-regulated by cotton boll weevil feeding, including lipoxygenase (LOX), allene oxide synthesis (AOS), and jasmonic acid carboxyl methyltransferase (JMT). We identified genes associated with salicylic acid signalling, a widely known phytohormone involved in the induction of pathogen resistance (PR) genes as well as in the establishment of long-term immunity via the SAR pathways [45] (Figure 1). In addition, other genes up-regulated by boll weevil feeding are involved in ABA catabolism and ET biosynthesis. These include the gene encoding the ABA 8′-hydroxylase (CYP707A3), which catalyses the first step in the oxidative inactivation of ABA [46]. The genes involved in ethylene biosynthesis ACS (1-aminocyclopropane-1-carboxylate synthase) and ACO (1-aminocyclopropane-1-carboxylate oxidase) were also up-regulated after herbivory (Table 3).

Down-regulated genes involved in the response to cotton boll weevil feeding

Several putative genes involved in cell wall formation and degradation were identified as down-regulated. Contig12858 was one of the most highly down-regulated genes in our study (log2-fold change = −34.07, Additional file 2). Contig12858 is similar to genes belonging to the invertase/pectin methylesterase inhibitor family, from which some genes are known to be involved in cell wall modification but have also been reported to play an important role in basal disease resistance [47, 48]. Other genes with potential roles in cell wall reorganisation were down-regulated, such as an endoxyloglucan transferase (contig9845), which has a role in primary cell wall restructuring, as well as an alpha/beta hydrolase (contig12015) and a beta-expansin (contig9652) (Additional file 2). Interestingly, many down-regulated genes encoding heat shock proteins (Hsp) and one heat shock transcription factor (HSF) were observed. Among them, five belong to the small Hsp family, four to the Hsp70 family and six to the Hsp90 family (Additional file 2).

Validation of transcriptome data and evaluation of the spatial expression pattern of defence-related genes in response to infestation by cotton boll weevil larvae using “in situ” qPCR

To better characterise the potential role of some genes in plant defence against insect herbivory, we selected 32 DEGs that responded to feeding by the cotton boll weevil larvae. These genes function in different levels of the biotic stress defence response signalling pathway (Figure 2) and were annotated in biological processes such as the response to chitin and/or cell death (Table 3). Initially, these were analysed with real-time qPCR on 6 mm cotton flower buds infected by larvae or the control samples for further validation of the transcriptome data. The expression patterns observed between the RNA sequencing (RNAseq) and qPCR methods were highly similar (Additional file 8), indicating the accuracy of our transcriptome profile. To verify this observation, we calculated the Pearson correlation coefficient (r) between the different methods for all transcripts, and a correlation coefficient of 0.9406 (P ≤ 0.05) was observed. To evaluate the spatial expression pattern of these genes by “in situ” qPCR, two different areas of the 6 mm cotton flower buds were isolated by laser microdissection (LMD): the stamens (near the feeding site) and the carpels (away from the injured area) (Figure 4). Among the 28 up-regulated genes selected for analysis, 27 were up-regulated in both organs. Only CML38 showed a contrasting expression pattern of up-regulation in the stamens (near the feeding site) but down-regulation in the carpels (away from the feeding site) (Figure 5). The spatial expression analysis of four down-regulated genes showed that all four were repressed in stamens and carpels (Figure 5). We also evaluated if the expression near the feeding site (stamens) was significantly different from the expression away from the damaged tissue (carpels). Among the 32 genes tested by qPCR, transcripts from 24 genes were more highly expressed near the feeding site (Figure 5). A few genes, such as the transcription factors GhWRKY19-like1, GhWRKY22-like1, GhWRKY40-like1 and ERF-98 and the RbohH and HSP90.1 genes were more highly expressed in carpel tissues compared to stamen tissues (Figure 5 and Additional file 9).
Figure 4

Isolation of cotton tissues from paraffin-embedded sections by laser microdissection (LMD). Isolation of cotton tissues from paraffin-embedded sections by laser microdissection (LMD). Sections before LMD (a, c, e), and sections after LM (b, d, f). The area selected for laser microdissection is outlined in green (a region near the damage caused by larvae feeding, which comprised the stamen tissue, viewed at a, c and d) or blue (a region farther from the injured area, which comprised the carpel tissue, viewed at a, e, and f). The assessment of extracted RNA integrity from the stamen and carpel are shown in g and h, respectively. Electropherograms were obtained with an Agilent 2100 Bioanalyser. Open and closed arrowheads indicate the 18S and 28S ribosomal RNA peaks, respectively. RNA quality is expressed as the RNA integrity number (RIN). Scale bars = 100 μm.

Figure 5

Comparison of expression levels by “ ” qPCR of a subset of 32 DEG. These genes were examined from two different areas (stamen and carpel) isolated from 6 mm cotton flower buds infested by cotton boll weevil larvae by laser microdissection (LMD) in relation to the control. The reference genes GhACT4 and GhFBX6 were used to normalise the qPCR data. The relative expression level was calculated using the relative expression software tool (REST©), and a subsequent statistical test of the analysed CP values by a Pair-Wise Fixed Reallocation Randomization Test was performed.

Isolation of cotton tissues from paraffin-embedded sections by laser microdissection (LMD). Isolation of cotton tissues from paraffin-embedded sections by laser microdissection (LMD). Sections before LMD (a, c, e), and sections after LM (b, d, f). The area selected for laser microdissection is outlined in green (a region near the damage caused by larvae feeding, which comprised the stamen tissue, viewed at a, c and d) or blue (a region farther from the injured area, which comprised the carpel tissue, viewed at a, e, and f). The assessment of extracted RNA integrity from the stamen and carpel are shown in g and h, respectively. Electropherograms were obtained with an Agilent 2100 Bioanalyser. Open and closed arrowheads indicate the 18S and 28S ribosomal RNA peaks, respectively. RNA quality is expressed as the RNA integrity number (RIN). Scale bars = 100 μm. Comparison of expression levels by “ ” qPCR of a subset of 32 DEG. These genes were examined from two different areas (stamen and carpel) isolated from 6 mm cotton flower buds infested by cotton boll weevil larvae by laser microdissection (LMD) in relation to the control. The reference genes GhACT4 and GhFBX6 were used to normalise the qPCR data. The relative expression level was calculated using the relative expression software tool (REST©), and a subsequent statistical test of the analysed CP values by a Pair-Wise Fixed Reallocation Randomization Test was performed.

The differentially expressed transcriptome of cotton in response to boll weevil larvae feeding

To further evaluate whether cotton and Arabidopsis share a common gene set in response to tissue-chewing pests, we compared the differentially expressed transcriptome with the publicly available microarray data set GEO: GSE10681 (Arabidopsis leaf transcriptome in response to diamond back moth (DBM, Plutella xylostella larvae feeding). We also used a GSEA approach to determine the genes and pathways that were relevant in response to 24 hours of continuous DBM feeding and identified the biological processes that were overrepresented in the ordered list of DEG in this experiment (data not shown). We found several similarities between the cotton transcriptome and the public domain microarray data in terms of enriched biological processes. Biological processes related to hormone biosynthesis and signalling, such as JA, SA, ET and ABA, the responses to organic substances, the negative regulation of programmed cell death, the detection of biotic stimuli, systemic acquired resistance and the regulation of the immune response were overrepresented in both GSEA analyses. However, there were obvious differences in the enriched biological processes, including the response to auxin, gibberellin stimulus and secondary metabolite biosynthetic processes. These biological processes are overrepresented only in the leaf transcriptome analysis of Arabidopsis infested by DMB. On the other hand, the response to chitin and signalling are biological processes overrepresented in cotton flower buds damaged by cotton boll weevil larvae. There were only nine up-regulated genes that overlapped between the cotton larvae in floral tissue and the leaf injured by diamond back moth (Figure 6). Among them, there were transcripts encoding proteins that repress JA signalling, such as JAZ1 (At1g19180), JAZ7 (At3g34600), and JAS1 (At5g13220) as well as a disease resistance protein LRR (At2g34930) that specifically recognises pathogen effectors, resulting in effector-triggered immunity (ETI). Another gene represented in both data sets is a lipoxygenase (LoX4, At1g72520) that dioxygenates unsaturated fatty acids, leading to the lipoperoxidation of biological membranes. LoXs are involved in the apoptosis (programmed cell death) pathway and in biotic and abiotic stress responses in plants [49]. Finally, the gene 2-oxoglutarate and Fe(II)-dependent oxygenase (At5g05600), which is involved in the phenylpropanoid pathway and implicated in the synthesis of defence compounds, was found in both data sets and was also induced by Pieris brassicae eggs and Pieris rapae herbivory [17]. We found only one gene that was down-regulated in both experiments: a germin-like protein (GER3) with oxalate oxidase activity leading to hydrogen peroxide (H2O2) production, which is a ROS that is implicated in the herbivory-induced response in plants [17].
Figure 6

Venn diagram comparing the cotton boll weevil ( ) induced transcriptome with the response to another tissue-chewing pest, . The number of induced (up-regulated, left column) and repressed (down-regulated, right column) genes after 48 h of cotton boll weevil feeding (A. grandis) was compared to the response of another 24 h herbivory-treatment previously published [39]. Selected genes have a p-value ≤ 0.05 and |logFC| ≥ 2.0.

Venn diagram comparing the cotton boll weevil ( ) induced transcriptome with the response to another tissue-chewing pest, . The number of induced (up-regulated, left column) and repressed (down-regulated, right column) genes after 48 h of cotton boll weevil feeding (A. grandis) was compared to the response of another 24 h herbivory-treatment previously published [39]. Selected genes have a p-value ≤ 0.05 and |logFC| ≥ 2.0.

Discussion

G. hirsutum L. is an allotetraploid species with a large and complex genome, and comprehensive sequence information describing the genome is still incomplete. Moreover, previous studies have provided limited information on the transcriptional dynamics during the cotton defence response [50]. The recent availability of high-throughput sequencing technologies provides an unprecedented opportunity to thoroughly explore the defence response using large-scale expression profile analysis despite uncharacterised genomic sequences. In the present study, using massive parallel mRNAseq, we report the transcriptional changes in G. hirsutum L. flower buds in response to cotton boll weevil (A. grandis) larvae feeding. To this end, eggs from A. grandis that were ready to hatch were placed on stamens inside 6 mm flower buds, the stage at which the round unicellular microspores are found in the locules. Cotton boll weevil larvae chew for ~48 h on flower tissues, representing the early steps of insect damage. It has been suggested that pollen grains are the initial target of the larvae after egg hatching [51]. Subsequently, the larvae migrate to the ovary where they feed in the ovules and conclude their life cycle, leading to the abscission of floral structures [51]. In this study, we generated over 327 million sequence reads (100 bp in length) of raw sequence data using the Illumina sequencing of two biological replicates from 6 mm cotton flower buds damaged by cotton boll weevil larvae and undamaged flower buds. At the current level of sampling (at least 74 million tags per sample), the sensitivity of the RNASeq approach allowed the accurate identification of differentially expressed genes. A previous study in cotton showed by RNA-Seq that the transcriptome changes after pathogen (Verticillium dahliae) inoculation, although they generated only 27 million tags of 21 bp in length [50]. Dubey et al. [4] analysed the leaf transcriptome of cotton plants infested by aphids and whiteflies, generating a total of 3.8 million high quality reads through RNA-seq. In our study, the number of up-regulated genes (402) was much greater than the number of down-regulated genes (41) after damage by cotton boll weevil larvae. These results are supported by previous reports showing that larval feeding stimulates induction rather than suppression of genes [18, 39]. Several studies have examined the transcriptional profiles of different modalities of attack, such as pathogen or insect [15, 18, 31]. However, our results contrast with results observed in cotton plants damaged by whiteflies and aphids, in which the number of down-regulated genes was greater than the number of up-regulated genes in both cases after damage [4]. The MapMan analysis of the DEGs generated a representative overview of the plant response after larvae damage (Figure 2). This analysis suggested that the cotton plant could perceive boll weevil larvae-derived physical and chemical cues such as compounds in the oviposition fluid, chitin, and insect oral secretions. These elicitors dramatically alter the expression of several genes in the plant immune response pathways that are involved in biotic stress response. The herbivory-induced changes are mediated by elaborate signalling networks, including receptors/sensors, Ca2+ influx, kinase cascades, transcription factors, reactive oxygen species and phytohormone signalling pathways. We were able to identify genes belonging to all major networks previously described and thus relate these to the plant response to herbivory. The plant immune system is activated by various herbivore (or microbial)-associated molecular patterns (HAMP or MAMP) that are detected as non-self molecules. Such patterns are recognised by immune receptors that are either cytoplasmic or localised on the plasma membrane. Cell surface receptors include receptor-like kinases (RLK) that frequently contain extracellular leucine-rich repeats and an intracellular kinase domain for the activation of downstream signalling, as well as receptor-like proteins (RLP) that lack this signalling domain. It is therefore hypothesised that RLKs are required for RLPs to activate downstream signalling [52]. Several putative receptor-like kinases, such as the homologues of CCR4, RIPK and EVR, were up-regulated after cotton boll weevil larvae damage (Table 3). CCR4 encodes a transmembrane protein kinase (RLK) previously characterised as a key gene in the plant-insect interaction. Little et al. [17] showed that 41 RLKs, including CCR4, were induced by oviposition by P. brassicae, suggesting that RLKs play an important role in the detection of herbivore-associated molecular patterns. Interestingly, CCR4 was significantly more highly expressed in the stamen close to the larval damage than in the carpel, as assessed by our in situ qPCR analysis (Figure 5). Furthermore, the RIPK gene encodes a cytoplasmic NB-LRR immune receptor that recognises the AvrB and AvrRpm1 effectors from Pseudomonas syringae, leading to the activation of ETI [53]. This gene was also more highly expressed in the stamen than in the carpel tissue in our in situ qPCR analysis. Another leucine rich repeat transmembrane protein kinases analysed in our study, EVR, was equally induced by boll weevil larvae in the stamen and the carpel, is also expressed in response to P. syringae, and is involved in the regulation of cell death and innate immunity (Figure 5) [52]. The recognition of conserved HAMP, such as chitin or fatty acid-amino acid conjugates (FAC, the major component of insect oral secretions) by PRR triggers intracellular signalling via a mitogen-activated protein kinase (MAPK, MAPKK, MAPKKK) cascade [27]. Our study identified homologues to MAPK3 (At3g45640) and MKK9 (At1g73500) genes with the GO term “response to chitin”. The in situ qPCR analysis showed that these genes were highly expressed in the stamen and carpel from cotton flower buds infested by boll weevil larvae (Figure 5). MAPK, MAPKK, and MAPKKK activate several transcription factors such as ERFs and WRKYs during the response to pest attacks, as has been shown for ERF-5 [54]. A previous study in Arabidopsis showed that ERF-5 is phosphorylated by MAPK3 (At3g45640), suggesting that ERF-5 might play an important role in plant defence by positively regulating ethylene (ET) signalling [54]. The GhERF-4 and GhERF-5 genes were more highly expressed in the stamen than in the carpel, and the GhERF-98 gene was slightly more highly expressed in the carpel (Figure 5). ERF-4, ERF-5 and ERF-98 belong to the AP2/EREB family and were previously identified in the response to nematode or fungal inoculation (Alternaria brassicicola) as well as in response to chitin, a main elicitor of the plant defence response against insects [31, 55]. RNAseq analysis identified transcripts for 88 TFs that were differentially expressed in 6 mm cotton flower buds damaged by boll weevil larvae. In the present study, we selected the 21 cotton genes identified as WRKY for a phylogenetic analysis in combination with the full set of Arabidopsis WRKY genes. The expression patterns of nine WRKY genes were explored in detail with a combination of LMD and qPCR. The transcript level of GhWRKY30-like1 gene was the same in stamen and carpel tissue and was induced relative to the control. GhWRKY33-like, GhWRKY46-like1, GhWRKY64-like1, GhWRKY70-like1 and GhWRKY72-like1 were more highly expressed near the damage in stamen tissues. In Arabidopsis, it was also shown that WRKY46 is specifically induced by salicylic acid (SA) and infection by the biotrophic pathogen P. syringae, working redundantly with the structurally related genes WRKY70 and WRKY53 to positively regulate basal resistance to P. syringae [56]. Previous results have shown that the Arabidopsis WRKY genes are rapidly and strongly induced by chitin [31]. Some WRKYs are target genes of the upstream MAPK3 (At3g45640), such as WRKY33. Studies in Arabidopsis have shown that in response to pathogen invasion, MAPK3 (At3g45640) phosphorylates WRKY33, which directly binds to the W-boxes in the promoter of the ACS6 gene in vivo, suggesting that WRKY33 is directly involved in the activation of ACS6 expression downstream of the MAPK3 cascade [57]. This signalling event produces high levels of ethylene, which plays an important role in plant immunity. Moreover, AtWRKY33 is required to activate the synthesis of antimicrobial substances such as camalexin [34]. Interestingly, the other three WRKY genes (GhWRKY19-like1, GhWRKY22-like1 and GhWRKY40-like1) showed a higher expression in the carpel than in the stamen. In an elegant series of experiments, Asai et al. [58] showed that AtWRKY22 functions downstream of the flagellin receptor, a leucine-rich repeat (LRR) receptor kinase, and that a mitogen-activated protein kinase (MAPK) cascade conferred resistance to bacterial (P. syringae) and fungal (Botrytis cinerea) pathogens. These results strongly indicate that signalling events initiated by boll weevil larvae feeding might converge on a conserved system of pathogen-associated molecular pattern recognition, a MAPK cascade and transcription factors. Relatively little is known about the signal transduction pathways triggered by insect damage. Calcium ions (Ca2+) have been implicated as a second messenger in many plant signalling pathways including responses to herbivory [12]. It has been shown that mechanical wounding and insect damage trigger a transient increase in cytosolic Ca2+ levels [59]. For instance, the Egyptian cotton worm (Spodoptera littoralis) caused a transient increase in cytosolic Ca2+ in Phaseolus lunatus cells adjacent to the insect damage [59]. Cotton genes encoding calmodulin-like proteins (EDA39, CML38 and CML24) showed a higher expression close to the damage (stamen tissues) than in carpels, suggesting that these proteins might be a component of Ca2+ signalling that modulate plant defence responses against herbivory attack. Several studies have revealed that ROS are implicated in herbivory-induced responses in plants [12, 43, 60]. Superoxide anion (O2−), hydrogen peroxide (H2O2), singlet oxygen (1O2), and hydroxyl radical (·OH) are collectively called ROS; they are produced in mitochondria, chloroplasts, and peroxisomes, as well as on the external surfaces of plasma membranes. Feeding of Helicoverpa zea on soybean, as well as S. littoralis and Tetranychus urticae feeding on Medicago truncatula plants, increase ROS levels [12, 61]. Our analysis identified several transcripts encoding enzymes directly involved in oxidative stress that are up-regulated in response to feeding by boll weevil larvae. In situ qPCR analysis showed that transcripts encoding an NADPH oxidase (RbohD) were expressed at a higher level in carpel tissues than in stamen tissues compared to the control samples (Figure 5). AtrbohD was largely responsible for the accumulation of ROS during the defence response in Arabidopsis against P. syringae and Peronospora parasitica [60]. Our results suggest that the oxidative burst is also part of the cotton plant’s response to boll weevil larvae feeding. The boll weevil larval damage in flower buds also induced the expression of genes associated with phytohormone biosynthetic processes and signalling. Among the gene transcripts associated with JA, ET and ABA biosynthesis selected for analysis by qPCR were AOS, ACS, ACO and CYP701A3. All of these genes showed a significantly higher expression in the stamen and carpel tissue compared to the control samples. Among the JA metabolism genes that modulate plant responses to biotic stress by boll weevil larvae are AOS, which encodes the second enzyme involved in JA biosynthesis, and JMT, which is involved in the synthesis of the volatile compound methyl-JA (MeJA) used for plant defence. Jasmonate plays a central role in regulating the defence responses to herbivores that inflict various types of tissue damage. Jasmonate mutants are affected in their resistance to a wide range of arthropod herbivores, including caterpillars (Lepidoptera), beetles (Coleoptera), thrips (Thysanoptera) and leafhoppers (Homoptera) [9, 59, 62]. Jasmonate has been associated with the synthesis of proteinase inhibitors (PI), defence-related volatile compounds and secondary metabolites, such as nicotine, active phenolics and phytoalexins [12]. The genes involved in the ET biosynthesis pathway, ACS and ACO, were also up-regulated by boll weevil larvae damage. The primary function of ET in plant resistance to herbivores is the fine-tuning of JA-induced responses [12]. In tomato plants, ET potentiates the JA-induced transcript accumulation of secondary metabolites such as PIs [63]. The treatment of Arabidopsis plants with ethephon, a synthetic ET donor, transiently elevates the levels of JA and AOS transcripts [64]. Blocking the perception of ET with 1-MCP diminishes herbivory-induced volatile emission, which is mainly regulated by JA [65]. By using plants that ectopically express a loss-of-function ETR gene, von Dahl et al. [66] showed that ET signalling is crucial for increasing basal levels of nicotine, an effective defence against herbivory. These results indicated that JA and ET may be involved in elevating the basal resistance of cotton plants to herbivory. The ubiquitin-ligase genes are also known to play a role in the plant defence response [67]. Yang et al. [68] showed that the E3 ligase activity of Arabidopsis PUB17 is required for the initiation of the hypersensitive response (HR). Therefore, two predicted members of this family, PUB23 and CMPG1, which are among the DEGs in our experiment and were annotated in response to chitin and/or death, were also analysed by in situ qPCR. Spatial analysis of gene expression revealed that the transcripts of both genes were strongly up-regulated at the site close to the damage (stamen) compared to carpel tissues. These genes are among the 30 ubiquitin-ligase genes that up-regulated either 15 or 30 min after chitooctaose treatment in Arabidopsis [31]. Previous studies have shown that the expression of the PUB23 gene in Arabidopsis was induced by PAMPs and infection by pathogens. Moreover, the pub22/pub23/pub24 triple mutant displayed negative regulation of PAMP-triggered immunity [69]. Among the repressed gene transcripts related to the herbivory response, we identified transcripts for one germin-like protein (GER3) and many genes encoding heat shock proteins (Hsp). qPCR analysis revealed that the GER3 transcript was strongly down-regulated in the stamen and in the carpel (Figure 5). Interestingly, GER3 is included in the overlap between the down-regulated genes of the leaf transcriptome analysis of Arabidopsis damaged by DMB larvae and our study (Figure 6). GER3 leads to hydrogen peroxide (H2O2) production, a type of ROS that is implicated in the herbivory-induced response in plants, and was also repressed by P. brassicae eggs and P. rapae herbivory in a study in A. thaliana [17]. The transcriptional repression of genes encoding the heat shock proteins HSP20, HSP90.1 and HSP90.2 was observed in both stamen and carpel tissues (Figure 5). In many plant species, HSP90 isoforms are required for disease resistance against invading pathogens. HSP90 interacts with the disease resistance signalling components SGT1b and RAR1 and is required for RPS2-mediated resistance, a disease resistance (R) intracellular protein that specifically recognises pathogen effectors. For example, the AtHSP90.1 and AtHSP90.2 genes in Arabidopsis are required for RPS2-mediated resistance against P. syringae expressing AvrRpt2 and for RPM1-mediated resistance to P. syringae expressing AvrRPM1, respectively [70-72]. HSP90 is also essential for Rx-mediated resistance to Potato virus X (PVX), N-mediated resistance to Tobacco mosaic virus, and Pto-mediated resistance to P. syringae expressing AvrPto [73, 74]. In contrast, the hsp90.2-3 mutant with a point mutation in the ATP-binding domain of AtHSP90.2 is known to be more sensitive to biotrophic pathogens. In our results, several hsp genes were down-regulated, suggesting that larvae attempt to overcome the plant immune defence triggered by the effector.

Conclusion

Alternative strategies for protecting crops from insect pests are exploring the endogenous resistance mechanisms exhibited by plants to most herbivore insects through a greater understanding of induced defences in plants. This study aimed to unravel the changes in the transcriptome of G. hirsutum flower buds in response to feeding by cotton boll weevil (A. grandis) larvae. A large number of cotton transcripts and biological processes were significantly altered upon larvae infestation. Among the changes observed, we highlighted the induction of transcripts for the receptors/sensors that recognise chitin, insect oral secretions and signals from injured plant cells; differential modulation of transcripts encoding enzymes related to kinase cascades; transcription factors (such as WRKY and ERF); Ca2+ influx; reactive oxygen species; and modulation of transcripts encoding enzymes from phytohormone signalling pathways, mainly the JA and ET pathways. Cotton boll weevil larvae feeding affected many genes that have also been shown to be regulated in response to microbial or fungal infection, indicating the existence of complex crosstalk in the response to these different pathogens. The qPCR analysis associated with microdissection showed that almost all of the genes were more highly expressed in the stamen, the region close to the damage caused by larvae feeding, than in the carpel, farther away from the injured area. However, herbivory-induced defence responses in cotton flower buds were observed not only in the wounded regions but also in undamaged regions of the damaged flower buds. It is possible that a signal travels to other parts of the flower bud that transmit an herbivory alert. Although the identity of the mobile signal responsible for this systemic response remains unknown in many plants, studies performed in solanaceous plants demonstrated that herbivory rapidly induces a short-distance mobile signal that travels and activates MAPKs and triggers JA accumulation. Deciphering the signals that regulate herbivore-responsive gene expression in G. hirsutum flower buds might provide new strategies to manipulate this response for the production of insect-resistant transgenic plants.

Methods

Plant material and A. grandisinfestation assay

Cotton (Gossypium hirsutum L. variety “BRS Cedro”) plants were grown under a controlled temperature (27 ± 2°C) and natural photoperiod at Embrapa Genetic Resources and Biotechnology in Brasilia (DF, Brazil). Three-month old plants containing 6 mm flower buds were selected for the experiments. A population of A. grandis (Coleoptera: Curculionidae) was maintained at 27 ± 2°C, 70 ± 10% relative humidity, and a photoperiod of 14 h. Insects were routinely maintained on a standard rearing diet [75]. A 6 mm cotton flower bud previously drilled with a needle was inoculated with an A. grandis egg containing an active embryo. The orifice resulting from drilling was sealed with Vaseline to prevent dehydration of the egg. The hatching of larvae occurred approximately 2 h after egg inoculation. The larvae were removed with tweezers from the flower buds 48 h after inoculation, and four flower buds from different plants were immediately frozen in liquid nitrogen for the isolation of total RNA to perform RNAseq experiments. Similarly, a set of twelve flower buds infected with larvae were collected and fixed in cold ethanol:acetic acid (3:1) to subsequently isolate the selected tissue through laser microdissection. The insect infection experiments were performed in two biological replicates. The control samples were drilled cotton flower buds into which no eggs were introduced.

RNA isolation and sample preparation

Total RNA extraction was performed from 100 mg of macerated plant tissue in liquid nitrogen using the Invisorb Spin Plant RNA Mini kit (Invitek) according to the manufacturer’s protocol. RNA quality and quantity were determined using a Nanodrop 2000 (Thermo Scientific) and a Bioanalyser Chip RNA 7500 series II (Agilent), respectively. The analysis of RNA integrity was judged by the RNA Integrity Number (RIN), which was calculated with 2100 Expert Software. The software algorithm, which was developed by Schroeder et al. [76], categorises total RNA quality on a scale from 1 to 10, in which 10 corresponds to intact RNA and 1 corresponds to highly degraded RNA. Generally, plant RNA with a RIN value of 6–7 was of acceptable quality for qPCR and RNAseq analyses. RNA Integrity Number (RIN) values were greater than 9.0 for all samples. Two biological replicates of inoculated flower buds and controls were used for transcriptome sequencing. Library preparation and massive parallel sequencing were performed by Eurofins MWG Operon (Huntsville, AL). Sequencing libraries were prepared using NEBNext Ultra Directional RNA Library Prep Kit (New England Biolabs, MA), which included Poly-A-containing mRNA isolation from 5 μg total RNA using two rounds of purification with poly-T oligo-attached magnetic beads and fragmentation by sonication. First strand cDNA was generated using reverse transcriptase and random primers. Following the second strand cDNA synthesis and adaptor ligation, 400 bp cDNA fragments were isolated using gel electrophoresis and amplified by PCR. The products were loaded onto an Illumina HiSeq™ 2000 instrument and subjected to 200 cycles of paired-end (2 × 100 bp) sequencing. The processing of fluorescent images into sequences, base-calling, and quality value calculations were performed using the Illumina data processing pipeline (version 1.8).

Mapping of short reads and assessment of differential gene expression

Raw reads were filtered to obtain high-quality reads by removing low-quality reads containing more than 30% bases with Q < 20. After trimming low-quality bases from the 5′ and 3′ ends of the remaining reads, the resulting high-quality reads were aligned against a G. hirsutum expressed sequence tag (EST) assembly (with 28,432 unique genes/contigs) as the transcriptome reference (cotton EST database; http://www.leonxie.com) [77] using the BWA package [78]. Differential expression was estimated and tested with the DEGseq software package in R-bioconductor 2.15 for each library with reference to the control [79]. The count data were normalised to the total number of counts, accounting for the variance and the mean of the biological replicates. Contigs with adjusted p-value ≤ 0.05 and estimated absolute log2 (FC) ≥ 2 were determined to be significantly differentially expressed genes (DEG) and were selected.

Functional annotation

The DEG were subjected to blast using the program Blastx with the TAIR 9 protein database and further classified into categories according to the GO classification system (Additional file 2). The Blast2GO program (http://www.blast2go.com/b2ghome) [80] was also used to confirm the annotation of transcripts (Additional file 2). RNA-seq data can be accessed at the NCBI bioprojects under the accession number PRJNA245406. To identify relevant molecular mechanisms potentially associated with the response to cotton boll weevil larvae development within cotton flower buds, GSEA (Gene Set Enrichment Analysis) was performed [81]. A gene set was defined as all DEG, with annotations according to A. thaliana, that share the same ontology based on the GO database. The GSEA method identified biological processes (BP), molecular functions (MF) and cellular components (CC) that were overrepresented among the list of DEG (Figure 1 and Additional file 3). The overrepresentation was assessed with a statistical score based on a hypergeometric test with p-values ≤ 0.005. In addition, the differentially expressed genes were also functionally analysed using MapMan software, which is a user-driven tool that displays large genomic datasets onto diagrams of metabolic pathways or other processes such as biotic stress [82] (Figure 2, adapted by [10]). Finally, the differentially expressed genes were compared with the public databases generated from microarray analysis of Arabidopsis thaliana leaves that were infested by diamond back moth (Plutella xylostella) larvae 24 h after the onset of herbivory (GEO:GSE10681) (Figure 6) [39].

Defence-related genes in response to infestation by cotton boll weevil larvae used for qPCR analysis

Real time PCR (qPCR) assays were performed on 32 differentially expressed genes identified by RNAseq analysis (Additional file 9). Twenty-eight of them were reported as up-regulated in cotton boll weevil-infested flower buds: three receptor-like kinase proteins (CCR4, RIPK and EVR); two proteins from the mitogen-activated kinase signalling cascade (MAPK3, MKK9); three calmodulin-like proteins (CML28, CML24, EDA39); two genes from the oxidative burst process (GRX480, RbohD); twelve TFs, including nine WRKY transcription factors (WRKY19, 22, 30, 33, 40, 46, 64, 70 and 72) and three AP2/ERE transcription factors (ERF-4, ERF-5, ERF-98); four genes involved in phytohormone biosynthesis (CPY707A3, AOS, ACS, ACO); and two ubiquitin ligase proteins involved in targeted protein degradation (PUB23 and CMPG1). Among the down-regulated DEGs, we evaluated the expression pattern of a germin-like protein (GER3) and three members of the heat shock protein family (HSP90.1, HSP90.2 and HSP20). Primers were designed using Primer 3 software tools (http://frodo.wi.mit.edu/primer3/) with a melting temperature of 60°C, an amplicon length of 150 to 200 bp, and a GC content of 50 to 60% [83].

Tissue fixation, embedding, sectioning, and laser microdissection

To assess the spatial expression pattern of the genes described above, tissues from two different areas were isolated from 6 mm cotton flower buds infested by cotton boll weevil larvae using laser microdissection (LMD). These areas (Figure 4) included the following: (a) a region near the damage caused by larvae feeding, consisting of stamen tissues and (b) another region farther away from the injured area, consisting of carpel tissue. The 6 mm cotton flower buds infested by larvae and control samples were fixed overnight at 4°C in Farmer’s Fixative (3:1 ethanol:acetic acid) and dehydrated. After fixation, samples were prepared as previously described [84]. Briefly, the samples were subjected to an ethanol/xylene series followed by a xylene/paraplast series before being embedded in paraplast X-tra (Sigma-Aldrich). Embedded samples were sectioned using a rotatory microtome (HM 325, Microm). The 16-μm-thick sections from the infected flower buds and control samples were mounted on Leica FrameSlides PET-Membrane 1.4 μm slides with methanol (No. 11505151). The slides were dried at 40°C for 5 min and then stored at 4°C until laser microdissection was performed. Laser microdissection was performed using a Leica LDM CTR 6500.

RNA preparation, RNA amplification, and “in situ” qPCR

Isolated areas of carpel and stamen with biological replicates of infested and control samples (total of 180–200 microdissected cuts for each sample) were collected in microfuge tubes containing 100 μl RNALater solution (Ambion) and were used for RNA isolation (Figure 4). RNA was extracted from the captured tissues using the RNeasy Plant Mini kit (Qiagen) according to the manufacturer’s protocol, including a 15 min DNase incubation step (RNase-free DNase Set, Qiagen). The eluted RNA was concentrated under a vacuum until the remaining volume was approximately 10 μl. The quality of total RNA extracted from LMD-collected tissues was assessed using an RNA 6000 Pico kit on the Agilent 2100 Bioanalyser (Agilent Technologies). RNA Integrity Number (RIN) values were greater than 6.0 for all samples. The isolated total RNA was amplified using a MessageAmp aRNA Kit (Ambion) following the manufacturer’s instructions. One round of amplifications was performed, and the resulting anti-sense RNA (aRNA) was quantified by OD260. Equal amounts of aRNA (5.0 μg) of the two samples in biological replicates were reverse transcribed using 0.5 μl of Random Primers (C1181, Promega) and Superscript III reverse transcriptase (Invitrogen) according to the manufacturer’s instructions. Polymerase chain reactions were performed in the 7500 Fast Real-Time PCR detection system (Applied Biosystem) and SYBR® Green was used to monitor dsDNA synthesis. PCR efficiencies and the optimal quantification cycle threshold or Cq values were estimated using the online Real time PCR Miner [85]. Two independent biological samples for each experimental condition were evaluated using triplet technical replicates. The reference genes GhACT4 and GhFBX6 were used to normalise the qPCR data and have been discussed previously [86] (Additional file 9). The relative expression level was calculated using the relative expression software tool (REST©), which compares two treatment groups with multiple data points in the sample compared to the control groups and calculates the relative expression ratio between them. The mathematical model used is based on the PCR efficiencies and the mean CP deviation between the sample and control group of target genes and is normalised to the mean CP deviation of the reference genes [83]. The advantage of REST is the use of a subsequent statistical test of the analysed CP values by a Pair-Wise Fixed Reallocation Randomization Test [87].

Phylogenetic analysis of WRKY genes involved in the response to cotton boll weevil feeding

To identify the twenty-one cotton WRKY genes that were differentially expressed in our RNAseq experiment, a phylogenetic tree based on the WRKY domain of Arabidopsis was conducted (Additional files 2 and 7). The WRKY domain boundary was defined as described in Eulgem et al. [33] (excluding the N-terminal domains of Group I). Motif detection was performed with MEME 4.0 software [88] and the multiple amino acid sequence alignment of WRKY domains were conducted using MUSCLE (http://www.ebi.ac.uk/Tools/msa/muscle/) from 72 AtWRKY protein sequences from Arabidopsis (downloaded from AtTFDB (http://arabidopsis.med.ohio-state.edu/AtTFDB/) and 21 cotton WRKY genes. Phylogenetic trees were constructed after using a model test to pick the most appropriate evolutionary model for our analysis in the Mega 5.2 program [89]. The best-fitting amino acid substitution model for these gene families was the JTT model [90] with an estimated γ-distribution parameter (G). The maximum-likelihood analyses were performed with the program PhyML version 3.0 (http://www.atgc-montpellier.fr/phyml/) [91] and assessment of node confidence was performed using 1,000 bootstrap replicates. The trees were visualised and optimised in the Mega 5.2 program (Figure 3). Additional file 1: JPEG file showing contig size distribution from the aligned reads of transcriptome sequencing of cotton flower buds. (TIFF 11 MB) Additional file 2: Full list of the up-regulated and down-regulated cotton genes found in the RNA-seq analysis. Only those differentially expressed genes with significant expression changes (adjusted p-value ≤ 0.05, |logFC| ≥ 2.0) are shown. Columns A-G are the results from Blastx with Arabidopsis, and columns H-P are the results from the Blast2GO program. (XLS 780 KB) Additional file 3: Molecular function (A) and Cellular component (B) ontologies overrepresented by the Gene Set Enrichment Analysis (GSEA). (TIFF 521 KB) Additional file 4: Subset of differentially expressed genes (DEG) in cotton flower buds in response to cotton boll weevil larvae feeding that were annotated with the GO term “plasma membrane”. (XLS 44 KB) Additional file 5: Subset of differentially expressed genes (DEG) in cotton flower buds in response to cotton boll weevil larvae feeding that were annotated with the GO term “cell wall”. (XLS 38 KB) Additional file 6: Biological processes (BP) overrepresented by the Gene Set Enrichment Analysis (GSEA). (XLS 58 KB) Additional file 7: The twenty-one cotton WRKY genes that were differentially expressed in our RNAseq experiment, the domain sequence used in the phylogenetic tree, the protein sequence and the nucleotide sequence. (XLS 77 KB) Additional file 8: Comparison of qPCR and RNA sequencing expression data. Thirty-two differentially expressed genes responding to cotton boll weevil larvae feeding were analysed for RNA abundance using qPCR. The log fold-change in RNA sequencing was estimated and tested by DEGseq software, in which the count data were normalised to the total number of counts, taking the variance and the mean of the biological replicates into account for each library with reference to the control (blue bars). qPCR results (red bars) were calculated with the relative REST software using a mathematical model based on the PCR efficiencies and the mean CP deviation between the sample and control group of target genes and were normalised to the mean CP deviation of reference genes. We calculated the Pearson correlation coefficient (r) between the different methods for all transcripts. The correlation coefficient was 0.9406. (TIFF 8 MB) Additional file 9: Primers used for the qPCR expression analysis of DEG in tissues isolated from cotton flower buds infested by cotton boll weevil larvae. (XLS 52 KB)
  81 in total

1.  A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach.

Authors:  S Whelan; N Goldman
Journal:  Mol Biol Evol       Date:  2001-05       Impact factor: 16.240

2.  Ethylene-responsive element-binding factor 5, ERF5, is involved in chitin-induced innate immunity response.

Authors:  Geon Hui Son; Jinrong Wan; Hye Jin Kim; Xuan Canh Nguyen; Woo Sik Chung; Jong Chan Hong; Gary Stacey
Journal:  Mol Plant Microbe Interact       Date:  2012-01       Impact factor: 4.171

Review 3.  Plant-insect interactions: molecular approaches to insect resistance.

Authors:  Natalie Ferry; Martin G Edwards; John A Gatehouse; Angharad M R Gatehouse
Journal:  Curr Opin Biotechnol       Date:  2004-04       Impact factor: 9.740

Review 4.  Signal crosstalk and induced resistance: straddling the line between cost and benefit.

Authors:  Richard M Bostock
Journal:  Annu Rev Phytopathol       Date:  2005       Impact factor: 13.078

Review 5.  Plant immunity to insect herbivores.

Authors:  Gregg A Howe; Georg Jander
Journal:  Annu Rev Plant Biol       Date:  2008       Impact factor: 26.379

Review 6.  Computational methods for transcriptome annotation and quantification using RNA-seq.

Authors:  Manuel Garber; Manfred G Grabherr; Mitchell Guttman; Cole Trapnell
Journal:  Nat Methods       Date:  2011-05-27       Impact factor: 28.547

7.  Signal signature and transcriptome changes of Arabidopsis during pathogen and insect attack.

Authors:  Martin De Vos; Vivian R Van Oosten; Remco M P Van Poecke; Johan A Van Pelt; Maria J Pozo; Martin J Mueller; Antony J Buchala; Jean-Pierre Métraux; L C Van Loon; Marcel Dicke; Corné M J Pieterse
Journal:  Mol Plant Microbe Interact       Date:  2005-09       Impact factor: 4.171

8.  MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes.

Authors:  Oliver Thimm; Oliver Bläsing; Yves Gibon; Axel Nagel; Svenja Meyer; Peter Krüger; Joachim Selbig; Lukas A Müller; Seung Y Rhee; Mark Stitt
Journal:  Plant J       Date:  2004-03       Impact factor: 6.417

9.  Quantitative relationships between induced jasmonic acid levels and volatile emission in Zea mays during Spodoptera exigua herbivory.

Authors:  Eric A Schmelz; Hans T Alborn; Erika Banchio; James H Tumlinson
Journal:  Planta       Date:  2002-09-11       Impact factor: 4.116

10.  Systems analysis of chaperone networks in the malarial parasite Plasmodium falciparum.

Authors:  Soundara Raghavan Pavithra; Ranjit Kumar; Utpal Tatu
Journal:  PLoS Comput Biol       Date:  2007-09       Impact factor: 4.475

View more
  19 in total

1.  Differential transcriptome analysis of leaves of tea plant (Camellia sinensis) provides comprehensive insights into the defense responses to Ectropis oblique attack using RNA-Seq.

Authors:  Ya-Nan Wang; Lei Tang; Yan Hou; Ping Wang; Hua Yang; Chao-Ling Wei
Journal:  Funct Integr Genomics       Date:  2016-04-20       Impact factor: 3.410

2.  Utilization of microRNAs and their regulatory functions for improving biotic stress tolerance in tea plant [Camellia sinensis (L.) O. Kuntze].

Authors:  Anburaj Jeyaraj; Tamilselvi Elango; Xinghui Li; Guiyi Guo
Journal:  RNA Biol       Date:  2020-06-16       Impact factor: 4.652

3.  Dynamic transcriptome analysis and volatile profiling of Gossypium hirsutum in response to the cotton bollworm Helicoverpa armigera.

Authors:  Xin-Zheng Huang; Jie-Yin Chen; Hai-Jun Xiao; Yu-Tao Xiao; Juan Wu; Jun-Xiang Wu; Jing-Jiang Zhou; Yong-Jun Zhang; Yu-Yuan Guo
Journal:  Sci Rep       Date:  2015-07-07       Impact factor: 4.379

4.  Transgenic Cotton Plants Expressing Cry1Ia12 Toxin Confer Resistance to Fall Armyworm (Spodoptera frugiperda) and Cotton Boll Weevil (Anthonomus grandis).

Authors:  Raquel S de Oliveira; Osmundo B Oliveira-Neto; Hudson F N Moura; Leonardo L P de Macedo; Fabrício B M Arraes; Wagner A Lucena; Isabela T Lourenço-Tessutti; Aulus A de Deus Barbosa; Maria C M da Silva; Maria F Grossi-de-Sa
Journal:  Front Plant Sci       Date:  2016-02-19       Impact factor: 5.753

5.  Combined transcriptome and metabolome analyses to understand the dynamic responses of rice plants to attack by the rice stem borer Chilo suppressalis (Lepidoptera: Crambidae).

Authors:  Qingsong Liu; Xingyun Wang; Vered Tzin; Jörg Romeis; Yufa Peng; Yunhe Li
Journal:  BMC Plant Biol       Date:  2016-12-07       Impact factor: 4.215

6.  Transcriptome analysis reveals a comprehensive insect resistance response mechanism in cotton to infestation by the phloem feeding insect Bemisia tabaci (whitefly).

Authors:  Jianying Li; Lizhen Zhu; J Joe Hull; Sijia Liang; Henry Daniell; Shuangxia Jin; Xianlong Zhang
Journal:  Plant Biotechnol J       Date:  2016-03-31       Impact factor: 9.803

7.  Comparative transcriptome analysis of soybean response to bean pyralid larvae.

Authors:  Weiying Zeng; Zudong Sun; Zhaoyan Cai; Huaizhu Chen; Zhenguang Lai; Shouzhen Yang; Xiangmin Tang
Journal:  BMC Genomics       Date:  2017-11-13       Impact factor: 3.969

8.  Genome-wide transcriptomic and proteomic analyses of bollworm-infested developing cotton bolls revealed the genes and pathways involved in the insect pest defence mechanism.

Authors:  Saravanan Kumar; Mogilicherla Kanakachari; Dhandapani Gurusamy; Krishan Kumar; Prabhakaran Narayanasamy; Padmalatha Kethireddy Venkata; Amolkumar Solanke; Savita Gamanagatti; Vamadevaiah Hiremath; Ishwarappa S Katageri; Sadhu Leelavathi; Polumetla Ananda Kumar; Vanga Siva Reddy
Journal:  Plant Biotechnol J       Date:  2016-01-22       Impact factor: 9.803

9.  Transcriptome Analysis of Ramie (Boehmeria nivea L. Gaud.) in Response to Ramie Moth (Cocytodes coerulea Guenée) Infestation.

Authors:  Liangbin Zeng; Airong Shen; Jia Chen; Zhun Yan; Touming Liu; Zhaodong Xue; Yongting Yu
Journal:  Biomed Res Int       Date:  2016-02-29       Impact factor: 3.411

10.  Genome-wide identification of microRNAs responsive to Ectropis oblique feeding in tea plant (Camellia sinensis L.).

Authors:  Anburaj Jeyaraj; Shengrui Liu; Xiao Zhang; Ran Zhang; Mingzhu Shangguan; Chaoling Wei
Journal:  Sci Rep       Date:  2017-10-19       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.