Literature DB >> 26849139

Gene Structures, Evolution and Transcriptional Profiling of the WRKY Gene Family in Castor Bean (Ricinus communis L.).

Zhi Zou1, Lifu Yang1, Danhua Wang1, Qixing Huang2, Yeyong Mo1, Guishui Xie1.   

Abstract

WRKY proteins comprise one of the largest transcription factor families in plants and form key regulators of many plant processes. This study presents the characterization of 58 WRKY genes from the castor bean (Ricinus communis L., Euphorbiaceae) genome. Compared with the automatic genome annotation, one more WRKY-encoding locus was identified and 20 out of the 57 predicted gene models were manually corrected. All RcWRKY genes were shown to contain at least one intron in their coding sequences. According to the structural features of the present WRKY domains, the identified RcWRKY genes were assigned to three previously defined groups (I-III). Although castor bean underwent no recent whole-genome duplication event like physic nut (Jatropha curcas L., Euphorbiaceae), comparative genomics analysis indicated that one gene loss, one intron loss and one recent proximal duplication occurred in the RcWRKY gene family. The expression of all 58 RcWRKY genes was supported by ESTs and/or RNA sequencing reads derived from roots, leaves, flowers, seeds and endosperms. Further global expression profiles with RNA sequencing data revealed diverse expression patterns among various tissues. Results obtained from this study not only provide valuable information for future functional analysis and utilization of the castor bean WRKY genes, but also provide a useful reference to investigate the gene family expansion and evolution in Euphorbiaceus plants.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 26849139      PMCID: PMC4743969          DOI: 10.1371/journal.pone.0148243

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

WRKY transcription factors, defined by the presence of the conserved WRKY domain of approximate 60 amino acids, play an essential regulatory role in plant growth, development, metabolism, and biotic and abiotic stress responses [1-3]. Since the first WRKY-encoding gene was isolated from sweet potato (Ipomoea batatas) [4], its homologs have been found in a wide range of plants and several non-plant species including Giardia lamblia, Dictyostelium discoideum, diplomonads, social amoebae, fungi incertae sedis and amoebozoa [5,6]. Compared with low and non-plants, the WRKY genes in high plants were shown to be highly expanded. For example, there are 57 members in cucumber (Cucumis sativus), 58 in physic nut (Jatropha curcas), 59 in grapevine (Vitis vinifera), 72 in Arabidopsis thaliana, 103 in white pear (Pyrus bretschneideri), 105 in poplar (Populus trichocarpa), 105 in foxtail millet (Setaria italica) and more than 100 in rice (Oryza sativa) [7-14]. WRKY proteins contain one or two WRKY domains, comprising the highly conserved WRKYGQK heptapeptide at the N-termini and a novel zinc finger motif (Cx4–7Cx22–23HxH/C) at the C-termini [10]. Both of these two motifs are vital for the high binding affinity of the WRKY proteins to the consensus cis-acting element termed the W box (TTGACT/C) [15,16]. According to the number of WRKY domains and the features of their zinc finger motifs, WRKY proteins can be categorized into three main groups. The group I members have two WRKY domains and feature the zinc finger motif of C2H2. Both groups II and III members contain a single WRKY domain, and the group III members possess the C2HC zinc finger motif which is different from C2H2 as observed in groups I and II members. Base on the evolutionary relationship and certain amino acid motifs present outside the WRKY domain, the group II can be further divided into 5 subgroups (a–e) [10]. In contrast to the presence of a conserved PR intron located after the codon encoding arginine (N terminal to the zinc finger motif) of subgroups c-e as seen in the group III and the C-terminal WRKY domain of the group I, members of subgroups a and b harbor a VQR intron in the zinc finger motif instead [6,10,17]. Castor bean (Ricinus communis L.), a tropical perennial shrub that belongs to the Euphorbiaceae family, is one of the most important non-food oilseed crops cultivated for industrial, medicinal and cosmetic purposes. Although native to Africa, the economic importance of castor bean oil and its well-adaptation to unfavorable conditions has prompted its wide-domestication to many tropical, subtropical and warm temperate regions around the world [18,19]. Given the crucial role of WRKY transcription factors in plant adaptation, two independent groups performed the homology search against the recently available castor bean draft genome [20] for the RcWRKY genes [17,21]. The study performed by Li et al. [21] focused on the expression analysis of the 47 identified RcWRKY genes in roots, stems, leaves, male flowers, female flowers and fruits at different developmental stages (i.e. 7, 15, 30 and 45 days post-anthesis) by using quantitative real-time PCR (qRT-PCR). Another study carried out by Zou [17] described the identification of nine more family members (i.e. 56 RcWRKYs) based on the automatic annotation of the castor bean genome, mainly focusing on the analysis of the evolutionary relationships between RcWRKY members by using the conserved WRKY domains. However, when compared with physic nut, another Euphorbiaceae plant species without the occurrence of any recent whole-genome duplication as castor bean [20,22], the family number of castor bean [17] seems to be relatively small and several physic nut WRKY genes [7] have no counterparts in castor bean. These results suggest that the RcWRKY genes have not been fully identified or the loss of specific genes has occurred in the castor bean genome. Thereby, rechecking the RcWRKY gene family is still needed. Along with the 4.6 × draft genome of castor bean, as of Apr 2015, 88212 nucleotides and 62629 expressed sequence tags (ESTs) have been deposited in NCBI GenBank. In addition, RNA sequencing data from several tissues such as root, leaf, flower, seed and endosperm is also available in NCBI SRA, which includes 1,138,884 Roche 454 reads and 386,847,526 Illumina reads [23-26]. These datasets provide a good chance to analyze the castor bean WRKY gene family from a global view. In the present study, we take advantage of the genome sequences and available transcriptome data to identify the complete set of the RcWRKY genes and conduct the expert revision of their gene structures via mapping the ESTs and RNA sequencing reads against the scaffolds. Further, the sequence characteristics, evolutionary relationships and transcriptional profiling of the identified RcWRKY genes were also investigated.

Methods

Datasets and sequence retrieval

Sequences of 72 Arabidopsis and 58 physic nut WRKY proteins described before [7,10] were obtained from TAIR (release 10, http://www.arabidopsis.org/) and NCBI (http://www.ncbi.nlm.nih.gov/), respectively (the accession number are available in S1 Table). The genome sequences and annotation information of castor bean [20] were downloaded from phytozome v10.2 (http://phytozome.jgi.doe.gov/pz/portal.html), whereas the nucleotides, Sanger ESTs and raw RNA sequencing reads were downloaded from NCBI.

Identification and manual curation of the castor bean WRKY genes

To obtain the complete set of castor bean WRKY genes, the tBlastn search [27] was performed using a representative WRKY domain from each WRKY subgroups (I, IIa, IIb, IIc, IId, IIe and III) and the e-value was set to 10. Positive genomic sequences were also analyzed using the HMMER program [28] and Hidden Markov Model (HMM) trained with RcWRKYs. The presence of WRKY domains in candidate RcWRKY proteins was confirmed using the SMART program (http://smart.embl-heidelberg.de/) [29]. The predicted gene models were further checked with ESTs and raw RNA sequencing reads. Gene structures were displayed using GSDS [30]. Homology search for nucleotides or ESTs was performed using Blastn [27] and sequences with a similarity of more than 98% were taken into account, whereas RNA sequencing clean reads (see below) were mapped using Bowtie 2 [31] with default parameters and mapped read number of more than one was counted as expressed. The alternative splicing isoforms were identified using Cufflinks (v2.2.1) [32]. In addition, the ortholog of each RcWRKY in Arabidopsis and physic nut was identified using Blastp [27] (e-value, 1e−20) against AtWRKYs and JcWRKYs, and the reciprocal Blastp was performed to confirm true orthologs. Tandem or proximal duplications were considered when two duplicated genes were consecutive in the genome or separated by 20 or fewer gene loci, respectively.

Sequence alignments, phylogenetic analysis and classification of RcWRKY genes

Multiple alignments were performed using MUSCLE [33]. The alignment of all RcWRKY domains were displayed using Boxshade (http://www.ch.embnet.org/software/BOX_form.html), whereas the alignment including Dictyostelium discoideum WRKY1 [5] (UniProtKB accession number Q554C5; the N and C-terminal WRKY domain was denoted as DdWRKY1N or DdWRKY1C, respectively; the same as for other group I members), RcWRKYs, AtWRKYs and JcWRKYs were used for phylogenetic tree construction. By using DdWRKY1C as an outgroup, the tree was constructed using MEGA 6.0 [34] with the maximum likelihood method and with the bootstrap test replicated 1000 times. Classification of RcWRKYs into groups and subgroups was done based on the structural features and evolutionary relationships of the WRKY domains.

Protein properties and conserved motif analysis

Protein properties of RcWRKYs, e.g., the molecular weight (MW), isoelectric point (pI), and grand average of hydropathicity (GRAVY) were calculated using ProtParam (http://web.expasy.org/protparam/). Analysis for conserved motifs in RcWRKY proteins was carried out using MEME (http://meme.sdsc.edu/meme/cgi-bin/meme.cgi) [35]. The optimized parameters were: any number of repetitions; maximum number of motifs, 15; and the optimum width of each motif, between 6 and 50 residues. Subsequently, the MAST program was used to search detected motifs in protein databases. The online software 2ZIP (http://2zip.molgen.mpg.de/index.html) was used to predict the conserved Leu zipper motif, whereas HARF, LxxLL (x, any amino acid) and LxLxLx motifs were identified manually.

Gene expression analyses

To analyze the global expression profiles of RcWRKY genes among different tissues or certain tissue of developmental stages, RNA sequencing data of leaf (NCBI SRA accession number ERX021378), flower (ERX021379), endosperm (ERX021375 and ERX021376) and seed (ERX021377) described before [24] were examined. The clean reads were obtained by removing adaptor sequences, adaptor-only reads, reads with “N” rate larger than 10% (“N” representing ambiguous bases) and low quality reads containing more than 50% bases with Q-value≤5. Then, the clean reads were mapped to the 58 identified RcWRKY genes (coding sequence, CDS) and released transcripts using Bowtie 2 [31], and the RPKM (reads per kilo bases per million reads) method [36] was used for the expression annotation. Unless specific statements, the tools used in this study were performed with default parameters.

Results and Discussion

Characterization of 58 WRKY-encoding sequences in castor bean

The homology search resulted in 58 loci putatively encoding WRKY genes from 41 scaffolds of the castor bean genome. Among them, 57 loci were predicted by the genome annotation [20] and further annotated by the PlantTFDB which used the released gene models for the annotation of RcWRKY genes [37], whereas one more loci encoding 117 residues was identified from the scaffold28842 (Table 1) and its ortholog was also found in physic nut [7]. Since the gene models of RcWRKY genes were the result of an automatic annotation due to the lack of transcriptome data at that time, an expert revision of their gene structures was conducted via mapping the ESTs and reads against the scaffolds. Interestingly enough, the results showed that 20 out of the 57 predicted gene models seem not to be properly annotated (Table 1). The locus 29929.t000090 was predicted to encode 609 residues which is relatively shorter than its ortholog in physic nut (JcWRKY10, 740 residues) [7], however, hundreds of RNA sequencing reads indicated that the “TTNNNTTGAC” sequence was misassembled into its first exon. Thereby, this locus is promised to harbor four introns putatively encoding 711 residues (see S1 File). The locus 29820.t000050 was predicted to encode 558 residues, however, read mapping indicated that partial sequences of its second and third exons were annotated as the second intron, thus this locus is promised to encode 598 residues (see S2 File) which is similar to that of its physic nut ortholog (JcWRKY08, 576 residues) [7]. As for the locus 29635.t000028, though both the predicted and identified CDSs encode 510 residues, read mapping indicated that the “GCAA” sequence of the second intron was annotated as the second exon and the “GCAG” sequence of the third exon was annotated as the second intron (see S3 File). The locus 30174.t000563 was predicted to encode 468 residues, however, read mapping and ORF (open reading frame) analysis suggested that it represents only the 3’ sequence of the gene which is promised to encode 524 residues (see S4 File). The locus 29687.t000003 was predicted to contain five introns encoding 503 residues, however, sequence analysis indicated that the N-terminal WRKY domain of the deduced protein is incomplete. EST and read mapping suggested that this locus is promised to harbor four introns and putatively encode 511 residues (see S5 File). The locus 29848.t000095 was predicted to have two introns encoding 372 residues, however, read mapping indicated that it represents only the 3’ sequence of this gene which is promised to harbor four introns putatively encoding 451 residues. In addition, its third exon was also misannotated as an intron (see S6 File). The locus 30174.t000066 was predicted to encode 192 residues, however, read mapping indicated that this locus is promised to encode 196 residues (see S7 File). The locus 28040.t000001 was predicted to contain a single intron encoding 103 residues, however, read mapping and ORF analysis suggested that it represents only the 3’ sequence of this gene which is promised to have two introns putatively encoding 217 residues (see S8 File). The locus 29709.t000007 was predicted to encode 185 residues, however, it didn’t contain the complete WRKY domain. Instead, read mapping indicated that this locus is promised to encode 205 residues (see S9 File). The locus 29889.t000087 was predicted to encode 351 residues, however, read mapping and ORF analysis suggested that it represents only the 3’ sequence of the gene which is promised to encode 360 residues (see S10 File). The locus 30174.t000532 was predicted to encode 313 residues, however, read mapping indicated that this locus is promised to encode 308 residues (see S11 File). The locus 43951.t000001 was predicted to encode 195 residues, however, EST and read mapping indicated that another locus 30131.t000001 from scaffold43951 (1019 bp) also belongs to this gene, and the gene is promised to harbor three introns putatively encoding 318 residues (see S12 File). The locus 29848.t000101 was predicted to encode 211 residues, however, EST and read mapping indicated that this locus is promised to encode 330 residues (see S13 File). The locus 29848.t000100 was predicted to contain three introns encoding 139 residues, however, sequence analysis revealed that its WRKY domain is incomplete. Further read mapping indicated that this locus is promised to harbor three introns putatively encoding 242 residues. The first exon and the first intron of this gene were not annotated previously, whereas partial sequences of its fourth exon were not annotated or misannotated as the third intron (see S14 File). The locus 29736.t000019 was predicted to contain three introns encoding 562 residues, however, read mapping indicated that this locus is promised to harbor four introns putatively encoding 634 residues (see S15 File). The locus 29598.t000004 was predicted to contain three introns encoding 263 residues, however, read mapping indicated that this locus is promised to harbor two introns putatively encoding 353 residues and partial sequence of its first exon was misannotated as an intron (see S16 File). The locus 29644.t000015 was predicted to contain one intron encoding 105 residues, however, EST and read mapping indicated that another locus 29644.t000016 on the same scaffold also belongs to this gene, and the misannotation was resulted from the “TCTTGCTCCAGAAGAG” sequence that was misassembled into its first exon. Thereby, this locus is promised to harbor two introns putatively encoding 356 residues (see S17 File). The locus 28455.t000009 was predicted to contain four introns encoding 367 residues, however, read mapping indicated that this locus is promised to harbor two introns putatively encoding 317 residues (see S18 File). The locus 27996.t000002 was predicted to contain four introns encoding 466 residues, however, read mapping indicated that this locus is promised to harbor two introns putatively encoding 480 residues, and partial sequences of its first exon and intron were misannotated as an intron or an exon, respectively (see S19 File). The locus 28690.t000001 was predicted to contain three introns encoding 287 residues, however, read mapping indicated that this locus is promised to harbor two introns putatively encoding 339 residues (see S20 File).
Table 1

List of the 58 RcWRKY genes identified in this study.

Gene nameScaffold IDPredicted positionLocus IDTranscript IDIdentified positionEST hitsExpressedASaASb(Sub)group and commentsDeduced polypeptideAt_orthologJc_ortholog
Length (aa)MW (kDa)pIGRAVY
RcWRKY01scaffold2994950158–5201329949.t00000729949.m00012349111–52359-Yes-YesI48452.897.04-0.921AtWRKY01JcWRKY01
RcWRKY02scaffold27613207515–21365027613.t00003227613.m000639207484–2140162Yes-YesI56260.786.84-0.789AtWRKY20JcWRKY09
RcWRKY03scaffold29929510214–51286129929.t00009029929.m004587509685–512932-Yes--I, misassembled71177.455.66-0.675AtWRKY20JcWRKY10
RcWRKY04scaffold2896627612–2399828966.t00000328966.m00052429398–23813-Yes-YesI73379.826.13-0.761AtWRKY02,34JcWRKY11
RcWRKY05scaffold2971713600–1057629717.t00000229717.m00022213200–1057243YesYesYesI57563.476.71-1.043AtWRKY33,25,26JcWRKY07
RcWRKY06scaffold29820294372–29166129820.t00005029820.m001029294438–289616-Yes-YesI, misannotated59865.527.59-0.770AtWRKY33,25,26JcWRKY08
RcWRKY07scaffold29635203409–19881529635.t00002829635.m000468203640–1985431Yes-YesI, misannotated51055.697.33-0.812AtWRKY04,03JcWRKY06
RcWRKY08scaffold301743380314–338449630174.t00056330174.m0091663380374–33845089Yes--I, misannotated52457.127.75-0.837AtWRKY04,03JcWRKY05
RcWRKY09scaffold29805189773–19190029805.t00003529805.m001504187614–192328-Yes-YesI47452.008.59-0.946AtWRKY44JcWRKY04
RcWRKY10scaffold2968715684–2100929687.t00000329687.m00056215455–216241Yes-YesI, misannotated51156.015.67-0.811AtWRKY32JcWRKY02
RcWRKY11scaffold29848493883–49220129848.t00009529848.m004539494292–491256-Yes-YesI, misannotated45145.848.65-0.754AtWRKY32JcWRKY03
RcWRKY12scaffold297714111–871929771.t00000129771.m0000724111–8868-Yes--IIc21524.336.83-0.966AtWRKY51,50,59,68JcWRKY12
RcWRKY13scaffold29739137722–13685829739.t00002229739.m003586137722–136462-Yes-YesIIc15918.035.98-0.959AtWRKY50,51,59,68JcWRKY14
RcWRKY14scaffold28644112321–11440128644.t00002228644.m000915112007–114711-Yes--IIc16819.405.49-1.001AtWRKY51,50,59,68JcWRKY13
RcWRKY15scaffold29929718638–72048529929.t00012729929.m004624718359–720760-Yes--IIc20322.799.15-0.817AtWRKY75,45JcWRKY17
RcWRKY16scaffold30190612076–61358530190.t00014430190.m010908611849–613809-Yes--IIc19422.289.30-0.844AtWRKY75,45JcWRKY18
RcWRKY17scaffold301472203761–220439330147.t00074530147.m0144742203665–2204574-Yes--IIc16418.969.49-1.075AtWRKY75,45JcWRKY19
RcWRKY18scaffold301742076040–207775430174.t00006630174.m0086692075781–2078009-Yes-YesIIc, misannotated19622.428.93-0.622AtWRKY43,24,56JcWRKY21
RcWRKY19scaffold301903026476–302721930190.t00051430190.m0112783026363–3027328-Yes--IIc18521.069.01-0.661AtWRKY56,24,43JcWRKY20
RcWRKY20scaffold2804031334–2820628040.t00000128040.m00003532051–27986-Yes--IIc, misannotated21724.879.34-0.724AtWRKY13JcWRKY15
RcWRKY21scaffold2970941452–4307029709.t00000729709.m00117141208–44408-Yes-YesIIc, misannotated20523.607.07-1.020AtWRKY12JcWRKY16
RcWRKY22scaffold29889412384–41413329889.t00008729889.m003321412309–4141331Yes--IIc, misannotated36039.986.32-0.793AtWRKY48JcWRKY23
RcWRKY23scaffold29693677606–67630129693.t00009829693.m002060677801–676041-Yes--IIc31034.446.97-0.902AtWRKY28,08JcWRKY26
RcWRKY24scaffold300761245583–124428430076.t00018730076.m0046231245777–1243970-Yes--IIc31735.196.40-0.619AtWRKY23JcWRKY22
RcWRKY25scaffold301743224608–322143430174.t00053230174.m0091353225599–32186301Yes-YesIIc, misannotated30833.786.19-0.909AtWRKY57JcWRKY24
RcWRKY26scaffold29767166469–16365429767.t00001029767.m000208166564–163614-Yes--IIc29632.895.50-0.700AtWRKY49JcWRKY38
RcWRKY27scaffold28842N/AN/AN/A266259–272883-Yes-YesIIc, not predicted11713.576.30-1.262-JcWRKY58
RcWRKY28scaffold301314672–649730131.t00000130131.m0068504396–67227Yes-YesIIa, misassembled31835.498.64-0.803AtWRKY40,18,60JcWRKY27
scaffold43951916–3143951.t00000143951.m000016
RcWRKY29scaffold29848538602–53721129848.t00010129848.m004545538811–5339801Yes-YesIIa, misannotated33036.538.52-0.629AtWRKY40,18,60JcWRKY28
RcWRKY30scaffold29848523071–52219729848.t00010029848.m004544523248–521822-Yes-YesIIa, misannotated24227.849.63-0.719AtWRKY40,18,60JcWRKY29
RcWRKY31scaffold29842273087–27552729842.t00005229842.m003555272719–2756581YesYesYesIIb58063.105.95-0.686AtWRKY42,06,31JcWRKY31
RcWRKY32scaffold30010386731–38430630010.t00002530010.m000675386795–3840652Yes--IIb65270.285.89-0.735AtWRKY42,06,31JcWRKY32
RcWRKY33scaffold30076678694–68128230076.t00011230076.m004548678304–6818371YesYesYesIIb49853.868.01-0.545AtWRKY47JcWRKY30
RcWRKY34scaffold30064205783–20334030064.t00002830064.m000506205918–203262-Yes--IIb55961.326.42-0.611AtWRKY47JcWRKY33
RcWRKY35scaffold29736182805–18691929736.t00001929736.m002023182186–187534-Yes-YesIIb, misannotated63469.067.58-0.837AtWRKY72,61JcWRKY35
RcWRKY36scaffold29822915514–91928029822.t00015929822.m003484915472–9192806Yes--IIb56060.076.51-0.606AtWRKY72,61JcWRKY37
RcWRKY37scaffold301474428301–443202430147.t00035830147.m0140874428301–4432024-Yes--IIb65170.626.23-0.765AtWRKY72,61JcWRKY36
RcWRKY38scaffold299904397–643329990.t00000129990.m0004974081–6735-Yes--IIb53258.995.25-0.814AtWRKY09JcWRKY34
RcWRKY39scaffold2984896608–9481229848.t00002029848.m00446496843–9477746Yes-YesIId32134.619.54-0.485AtWRKY11,17JcWRKY43
RcWRKY40scaffold2959824172–2275229598.t00000429598.m00044524740–22377-Yes-YesIId, misannotated35339.929.75-0.663AtWRKY74,39JcWRKY40
RcWRKY41scaffold2964480028–7957829644.t00001529644.m00018781125–7886145YesYesYesIId, misassembled35638.959.43-0.630AtWRKY07JcWRKY41
scaffold2964481125–8011729644.t00001629644.m000188
RcWRKY42scaffold29883450119–45207829883.t00006329883.m002008449401–452648-Yes-YesIId35339.759.66-0.867AtWRKY21,39,74JcWRKY39
RcWRKY43scaffold301701224146–122566230170.t00024430170.m0138321224113–12262291Yes--IId37741.439.59-0.773AtWRKY07,15JcWRKY42
RcWRKY44scaffold28455136512–13459028455.t00000928455.m000362137020–134740-Yes--IId, misannotated31735.369.76-0.584AtWRKY21JcWRKY44
RcWRKY45scaffold301742058419–206007930174.t00006030174.m0086632058419–20600939YesYesYesIIe34737.775.92-0.676AtWRKY22JcWRKY47
RcWRKY46scaffold301903016527–301797130190.t00051230190.m0112763016527–3018091-Yes--IIe33437.454.98-0.601AtWRKY29JcWRKY45
RcWRKY47scaffold301692000467–199890730169.t00035830169.m0065812000700–1998672-Yes--IIe45950.165.69-0.814AtWRKY27JcWRKY48
RcWRKY48scaffold300761384793–138361230076.t00021230076.m0046481385658–1383503-Yes-YesIIe26530.195.33-0.988AtWRKY69,69JcWRKY46
RcWRKY49scaffold30026178343–17995230026.t00002530026.m001461178328–1802792Yes--IIe26730.025.90-1.064AtWRKY69,69JcWRKY50
RcWRKY50scaffold279967134–991227996.t00000227996.m0001456705–10469-Yes-YesIIe, misannotated48052.065.28-0.774AtWRKY35,14JcWRKY49
RcWRKY51scaffold301903508463–350705530190.t00005030190.m0108143508665–35051322YesYesYesIII33838.015.41-0.831AtWRKY41,53JcWRKY54
RcWRKY52scaffold301741952662–195434730174.t00004830174.m0086511952428–1954708-Yes--III33337.805.38-0.786AtWRKY30JcWRKY51
RcWRKY53scaffold301691742567–174465630169.t00031030169.m0065331952662–1954347-Yes-YesIII37041.175.54-0.697AtWRKY41,53JcWRKY53
RcWRKY54scaffold2869013680–1222428690.t00000128690.m0000251742258–1744988-Yes--III, misannotated33938.055.56-0.663AtWRKY41,53JcWRKY52
RcWRKY55scaffold29729344149–34216929729.t00006329729.m00233013973–11973-Yes--III31835.555.94-0.704AtWRKY55JcWRKY55
RcWRKY56scaffold29729570667–56889329729.t00010329729.m002370344520–341549-Yes--III33136.915.94-0.693AtWRKY55JcWRKY55
RcWRKY57scaffold29729563671–56493629729.t00010229729.m002369570667–5687141Yes-YesIII31435.455.87-0.603AtWRKY70JcWRKY57
RcWRKY58scaffold29915143043–14130429915.t00001529915.m000479563637–565065-YesYesYesIII33037.075.53-0.667AtWRKY70JcWRKY56

a Based on the EST data.

b Based on the RNA sequencing data.

N/A, not available.

“-”, not detected.

a Based on the EST data. b Based on the RNA sequencing data. N/A, not available. “-”, not detected. Based on the structural features (Fig 1) and evolutionary relationships (Fig 2, see below), a systematic name was assigned to each of the 58 RcWRKY genes (Table 1). Eleven members that contain two WRKY domains and feature the C2H2-type zinc finger motif (N: Cx4Cx22-23HxH; C: Cx4Cx23HxH) were categorized into the group I, whereas the remainings that harbor a single WRKY domain were categorized into the group II (39 members, featuring the C2H2 zinc finger: Cx4-5Cx23HxH) or III (8 members, featuring the C2HC zinc finger: Cx7Cx23HxC) (Table 1 and Fig 1). RcWRKY genes of the group II were further divided into 5 subgroups, i.e., IIa (3), IIb (8), IIc (16), IId (6) and IIe (6) (Fig 3). As shown in Fig 2, RcWRKY26 and RcWRKY27 seem to form two new subgroups: RcWRKY26, JcWRKY38 and AtWRKY49 were clustered together and shown to be closer to the N-terminal WRKY domains, whereas RcWRKY27 and its ortholog JcWRKY58 were closer to the group III members. However, both of them exhibit a zinc finger pattern Cx4Cx23HxH as observed in the subgroup IIc and the C-terminal WRKY domains of group I members (Fig 1). Thereby, they were classed into the subgroup IIc in this study. Compared with Arabidopsis, castor bean and physic nut have fewer family members in any (sub)group. Although the total number of family members is the same between castor bean and physic nut, castor bean contains one more group III member but one fewer subgroup IIc (Fig 3).
Fig 1

Comparison of the WRKY domain sequences from 58 RcWRKY proteins.

WRKY..N/C represents the N or C-terminal WRKY domain of group I members, respectively. “-” has been inserted for the optimal alignment. Conserved amino acid residues are shown in gray and the highly conserved WRKYGQ/KK heptapeptide and C2H2/C and residues are indicated by “*”. The four β-strands are indicated by right arrows. For each (sub)group, the position of a conserved intron is indicated by a down arrow.

Fig 2

Phylogenetic analysis of RcWRKY proteins with Arabidopsis and physic nut homologs.

The WRKY domains (WRKY..N/C representing the N and C-termini of group I members, respectively) extracted from deduced amino acid sequences were performed using MUSCLE and the phylogenetic tree adopting DdWRKY1C as an outgroup was constructed using bootstrap maximum likelihood tree (1000 replicates) method and MEGA6 software. The distance scale denotes the number of amino acid substitutions per site. The name of each (sub)group is indicated next to the corresponding group. Species and accession numbers are listed in Table 1 and S1 Table.

Fig 3

Distribution of the 58 RcWRKY genes and their Arabidopsis and physic nut homologs in subgroups.

Comparison of the WRKY domain sequences from 58 RcWRKY proteins.

WRKY..N/C represents the N or C-terminal WRKY domain of group I members, respectively. “-” has been inserted for the optimal alignment. Conserved amino acid residues are shown in gray and the highly conserved WRKYGQ/KK heptapeptide and C2H2/C and residues are indicated by “*”. The four β-strands are indicated by right arrows. For each (sub)group, the position of a conserved intron is indicated by a down arrow.

Phylogenetic analysis of RcWRKY proteins with Arabidopsis and physic nut homologs.

The WRKY domains (WRKY..N/C representing the N and C-termini of group I members, respectively) extracted from deduced amino acid sequences were performed using MUSCLE and the phylogenetic tree adopting DdWRKY1C as an outgroup was constructed using bootstrap maximum likelihood tree (1000 replicates) method and MEGA6 software. The distance scale denotes the number of amino acid substitutions per site. The name of each (sub)group is indicated next to the corresponding group. Species and accession numbers are listed in Table 1 and S1 Table. Although most RcWRKYs harbor the conserved heptapeptide WRKYGQK, the WRKYGKK variety was also observed in three members (i.e. RcWRKY12, RcWRKY13 and RcWRKY14) (Fig 1) as seen in physic nut, Arabidopsis and other plant species [7,10,12]. Except for the automatic genome annotation, homology analysis showed that no cDNA sequences of the 58 identified RcWRKY genes were reported in any public database. Nevertheless, 20 members had EST hits in NCBI GenBank (as of Apr 2015). Though most of them had only one hit, we still observed that three members (RcWRKY39, RcWRKY41 and RcWRKY05) matched more than 40 ESTs (Table 1). Further, read alignments against RNA sequencing data of root, leaf, flower, seed and endosperm supported the expression of other 38 RcWRKY genes. In addition, alternative splicing isoforms existing in 7 or 31 RcWRKY-encoding loci were supported by Sanger ESTs or RNA sequencing reads, respectively (Table 1). As described above, the 1019-bp scaffold43951 was predicted to encode a WRKY domain-containing peptide. However, since it can be anchored to the 2696182-bp scaffold30131 sharing a 300-bp overlapping sequence, thus the scaffold30131 instead of scaffold43951 was counted as one WRKY-encoding scaffold. Among these 41 WRKY-encoding scaffolds, nine of them, i.e., scaffold30174 (5), scaffold29848 (4), scaffold30190 (4), scaffold30076 (3), scaffold29729 (3), scaffold29929 (2), scaffold30147 (2), scaffold29644 (2) and scaffold30169 (2), were shown to encode more than one WRKY genes, whereas the remainings encode a single one (Table 1). The exon-intron structures of the 58 RcWRKY genes were investigated based on the optimized gene models. Though all the deduced polypeptides of the RcWRKY genes contain one or two complete WRKY domains (Fig 1), the length of these amino acid sequences is highly distinct (Table 1). Compared with the CDS length (354–2202 bp), the gene length (from start to stop codons) of RcWRKYs is even more variable (633–6280 bp) (Fig 4). All RcWRKY genes contain at least one intron in their CDSs: 5 have one intron; 30 (more than 51.7%) have two introns, which include all members of (sub)groups IId, IIe and III; 7 have three introns; 11 have four introns; and 5 have five introns (Fig 4). Except for RcWRKY29, similar exon-intron structures were also observed in physic nut [7], a plant species also belonging to the Euphorbiaceae family and having diverged from castor bean approximately 49.4 million years ago [20]. Although the peptide length is very similar, RcWRKY29 (CDS, 993 bp) was shown to contain three introns (Fig 4); in contrast, its physic nut ortholog (JcWRKY28, CDS, 996 bp) has four introns [7]. Sequence analysis indicated that RcWRKY29 has lost the second intron as observed in physic nut. Without any exception, all RcWRKY genes harbor one intron in the WRKY domain-coding sequences (the C-terminal WRKY domain of group I members) (Fig 1). In members of subgroups a and b, the conserved intron presents in the zinc finger motif (24 codons further towards the C-terminus), whereas in groups I and III, and subgroups c–e, the intron is located after the second base of the arginine codon close to the N-termini of the zinc finger motif (Fig 1). Similar results were also observed in Arabidopsis and other plant species [6,10], suggesting that this is a general feature of the entire gene family.
Fig 4

Exon-intron structures of the 58 identified RcWRKY genes.

The graphic representation of the optimized gene models is displayed using GSDS.

Exon-intron structures of the 58 identified RcWRKY genes.

The graphic representation of the optimized gene models is displayed using GSDS.

Phylogenetic analysis of RcWRKY proteins

The homology analysis via Blastp showed that the 58 RcWRKYs have 56 or 36 counterparts in physic nut and Arabidopsis, respectively (Table 1), suggesting specific gene expansion and gene loss occurred in these plant species. Since the amino acid sequences beyond the WRKY domain are highly variable, the WRKY domain sequences were extracted from D. discoideum, Arabidopsis, physic nut and castor bean WRKY proteins, and used for the phylogenetic tree construction. D. discoideum, a slime mold closely related to the lineage of animals and fungi, was shown to encode a single group I-like WRKY gene which appears to be obtained via lateral gene transfer having occurred pre-date the formation of the WRKY groups in flowering plants [10,38]. The tree adopting DdWRKY1C as an outgroup was shown in Fig 2. According to the phylogenetic tree, a high number of Arabidopsis WRKY family members were grouped in pairs (Fig 2), corresponding to the occurrence of one whole-genome triplication event and two recent doubling events [39,40]. In contrast, few gene pairs were identified in castor bean as seen in physic nut (Fig 2). RcWRKY55 and RcWRKY56 were clustered together with their closest homolog in physic nut (JcWRKY55) (Fig 2). Both of them were clustered in scaffold29729 (spaced by 39 loci) (Table 1), indicating that they were resulted from proximal duplication after the divergence of castor bean and physic nut. In addition, the C-terminal WRKY domains of RcWRKY08 and RcWRKY09 were also clustered together apart from that of JcWRKY05 and JcWRKY06, however, the N-terminal WRKY domains of RcWRKY08 and RcWRKY09 were clustered with that of JcWRKY05 and JcWRKY04, respectively; moreover, the Blastp analysis indicated the ortholog of RcWRKY08 and RcWRKY09 is JcWRKY05 or JcWRKY04, respectively. Thereby, RcWRKY08 and RcWRKY09 are promised to emerge before the divergence of castor bean from physic nut. The homology analysis also suggested that the castor bean has lost the ortholog of JcWRKY25, since its ortholog was detected in another two Euphorbiaceae plants, i.e., cassava (Manihot esculenta) and rubber tree (Hevea brasiliensis) ([41,42] Zou et al., unpublished data).

Protein properties and conserved motifs beyond the WRKY domain

The predicted RcWRKY proteins have an average length of about 383 residues, with the minimum of 117 residues for RcWRKY27 and the maximum of 733 residues for RcWRKY04, whereas the average molecular weight is about 42.22 kDa, with the minimum of 13.57 kDa for RcWRKY27 and the maximum of 79.82 kDa for RcWRKY04, which is consistent with their peptide length. Although harboring an average pI value of 7.08, more than 58.62% RcWRKY proteins have a pI value of less than 7, indicating that most of them are acid. All RcWRKY proteins were predicted to harbor a GRAVY value (average: -0.78) of less than 0, indicating their hydrophilic feather. According to the 2ZIP analysis, two RcWRKY proteins (i.e. RcWRKY29 and RcWRKY33) were predicted to harbor a conserved Leu zipper motif, which was shown to be involved in dimerization and DNA binding [43,44]. The HARF motif was identified in three subgroup IId members, RcWRKY39, RcWRKY41 and RcWRKY43, although little is known about its exact function. LxxLL, a coactivator motif, was not found in any of the 58 RcWRKY proteins. In contrast, the active repressor motif LxLxLx were identified in two out of the three subgroup IIa members (i.e. RcWRKY28 and RcWRKY30) and four out of eight subgroup IIb members (i.e. RcWRKY35, RcWRKY36, RcWRKY37 and RcWRKY38). To better understand the similarity and diversity of motif compositions among different RcWRKYs, a phylogenetic tree based on the full-length RcWRKY proteins was constructed (Fig 5) and the motifs in RcWRKY protein sequences were predicted using MEME (Fig 5, Table 2). Among 15 identified motifs, motifs 1, 2, 3 and 10 were characterized as WRKY domains that are broadly distributed across the RcWRKYs; the motif 9, characterized as the nuclear localization signal (NLS) sequence, was found in all members of subgroups IIa and IIb. In contrast, little information is available for other motifs: the motif 4 was found in most members of the group I, subgroups IIb and IIc; motifs 5 and 10 were found in most members of groups I and III; the motif 13 was found in the subgroup IId and group III; motifs 6, 7, 8 and 11 are limited to subgroups IIa and IIb members; motifs 12 and 15 are unique in the group III or I, respectively.
Fig 5

Structural and phylogenetic analysis of RcWRKY proteins.

The unrooted phylogenetic tree resulting from the full-length amino acid alignment of all the RcWRKY proteins is shown on the left side of the figure. The different colored balls at the bottom of the figure indicate different groups. The distribution of conserved motifs among the RcWRKY proteins is shown on the right side of the figure. Different motif types are represented by different color blocks as indicated at the bottom of the figure. The same color in different proteins indicates the same group or motif.

Table 2

Motif sequences of 58 RcWRKY proteins identified by the MEME tools.

MotifE-valueSitesWidthBest possible match
Motif13.0E-10475026DGYRWRKYGQKMVKGNPYPRSYYRCT
Motif25.2E-8715029GCPVRKHVERCAEDPTMVITTYEGEHNH
Motif39.7E-3261731DILDDGYRWRKYGQKPIKNSPHPRGYYRCTH
Motif48.80E-1142621KKKGEKKIREPRFAFQTRSEV
Motif58.80E-891721ERDHDGQIFEIIYKGTHNCPK
Motif63.90E-951141LEVLQAELERMKEENERLRQMLTQMCKNYNALQMHFCELMQ
Motif76.20E-781027VEAATAAITADPNFTAALAAAITSIIG
Motif81.20E-72836AAMAMASTTSAAASMLLSGSSSSADGIMNHNTF
Motif95.90E-58829CASSGRCHCSKRRKMRVKRVIRVPAISNK
Motif104.20E-30188NCPAKKKV
Motif113.40E-29821MASISASAPFPTITLDLTHS
Motif123.00E-24625WEQHTLVGELIQGRELARQLRIHLN
Motif131.10E-221418LVQKIVSKFKKVLSLLNW
Motif141.80E-17107NHHHHHH
Motif154.00E-20721SPYITIPPGLSPTALLDSPVF

Structural and phylogenetic analysis of RcWRKY proteins.

The unrooted phylogenetic tree resulting from the full-length amino acid alignment of all the RcWRKY proteins is shown on the left side of the figure. The different colored balls at the bottom of the figure indicate different groups. The distribution of conserved motifs among the RcWRKY proteins is shown on the right side of the figure. Different motif types are represented by different color blocks as indicated at the bottom of the figure. The same color in different proteins indicates the same group or motif.

Distinct expression profiles of RcWRKY family members in various tissues

To gain more information on the role of WRKY genes in castor bean, RNA sequencing data of leaf, male flower, endosperm and seed were investigated. The expanding true leaves, appearing after the first cotyledons and leaf-pair, represent the leaf tissue; the male flower tissue includes pollen and anthers but excludes sepals; the germinating seed tissue was obtained by soaking dry seeds in running water overnight followed by germination in the dark for 3 days; and the endosperm tissue includes two representative stages termed stages II/III (endosperm free-nuclear stage) and V/VI (onset of cellular endosperm development) [24]. Results showed that the expression of all 58 RcWRKY genes were detected in at least one of the examined tissues, i.e., 55 in leaf, 51 in male flower, 51 in endosperm and 51 in seed (Fig 6). And the cluster analysis showed that the expression pattern of RcWRKY genes was more similar between flower and seed, and two stages of endosperm (Fig 6), corresponding their biological characteristics. Among three genes not detected in leaves, RcWRKY14 was only and lowly expressed in male flowers, although previous qRT-PCR analysis showed that it was also expressed in roots and fruits at 50 days post-anthesis [21]. In contrast, its ortholog JcWRKY14 in physic nut was shown to be highly expressed in stems (shoot cortex), roots and seeds of late development (i.e. filling and maturation) stage as well as leaves [8]. RcWRKY16 was expressed in male flowers, germinating seeds and stage V/VI endosperm, and the expression levels were considerably low in seeds and endosperm, which is consistent with the qRT-PCR result [21]. Similar expression profile of its ortholog JcWRKY18 in physic nut was also observed [8]. RcWRKY36 was detected in stage II/III endosperm and germinating seeds, and the previous qRT-PCR analysis indicated that this gene was highly expressed in roots [21]. In physic nut, the expression of its ortholog JcWRKY37 was also shown to be restricted to roots [8]. Among seven genes not detected in male flowers, all of them were also not detected in stage V/VI endosperm; except for RcWRKY36, RcWRKY21, RcWRKY26, RcWRKY27, RcWRKY30, RcWRKY53 and RcWRKY56 were all detected in leaves; RcWRKY21 and RcWRKY27 were detected only in leaves; besides leaves, RcWRKY26 and RcWRKY30 were also detected in stage II/III endosperm, though the expression level was extremely low; RcWRKY53 was also detected lowly in stage II/III endosperm and germinating seeds; and RcWRKY56 was also detected in germinating seeds. Among seven genes not detected in endosperm, RcWRKY14, RcWRKY21, RcWRKY27 and RcWRKY56 were discussed above; RcWRKY12 was detected in leaves and male flowers which is consistent with the qRT-PCR result [21] and the expression pattern of its ortholog JcWRKY12 in physic nut [8]; RcWRKY15 and RcWRKY56 were lowly expressed in all other samples examined, in contrast, their physic nut orthologs JcWRKY17 and JcWRKY45 were shown to be highly expressed in roots, lowly expressed in stems and leaves, but not detected seeds of both early and late development stages [8]. Among seven genes (i.e. RcWRKY12, RcWRKY14, RcWRKY18, RcWRKY21, RcWRKY27, RcWRKY30 and RcWRKY55) not detected in germinating seeds, RcWRKY12, RcWRKY14, RcWRKY21 and RcWRKY27 were discussed above. RcWRKY18 was lowly expressed in all other samples examined. Compared with other tissues [21], qRT-PCR analysis showed that RcWRKY18 was considerably more expressed in roots, which is consistent with the root-preferred expression of its ortholog JcWRKY21 in physic nut [8]. RcWRKY30 was only detected in leaves and stage II/III endosperm, whereas its physic nut ortholog JcWRKY29 was shown to be lowly expressed in leaves, stems and roots, but not seeds [8]. RcWRKY55 was lowly expressed in all other examined samples except for stage V/VI endosperm, in contrast, the expression of its physic nut ortholog JcWRKY55 was shown to be restricted to roots and the expression level was extremely low [8].
Fig 6

Expression profiles of the 58 RcWRKY genes in leaf, flower, endosperm II/III, endosperm V/VI and seed.

Color scale represents RPKM normalized log2 transformed counts and red indicates low expression and yellow indicates high expression.

Expression profiles of the 58 RcWRKY genes in leaf, flower, endosperm II/III, endosperm V/VI and seed.

Color scale represents RPKM normalized log2 transformed counts and red indicates low expression and yellow indicates high expression. Based on the RPKM annotation, the total transcript abundance of RcWRKY genes in endosperm tissue (including both stages II/III and V/VI, with RPKM = 337.14 or 123.03, respectively) was relatively lower than that in other three tissues, i.e., leaf (RPKM = 585.83), male flower (RPKM = 576.19) and germinating seed (RPKM = 560.44) (Fig 6). RcWRKY58 (RPKM = 139.35), the most abundant WRKY family member in leaves, was detected in all other tissues examined, though the expression levels were considerably low. Similarly, its ortholog AtWRKY70 in Arabidopsis was also shown to be constitutively expressed during all leaf development stages [45,46]. Functional analysis indicated that AtWRKY70 plays a pivotal role in salicylic acid (SA)- and jasmonic acid (JA)-dependent defense signaling [47,48]. Moreover, AtWRKY70 together with AtWRKY54 co-operate as negative regulators of leaf senescence and modulate osmotic stress tolerance by regulating stomatal movement [46,49,50]. Besides highly expressed in leaves, its ortholog JcWRKY56 in physic nut was even more abundant in seeds of early development stage, and the expression levels in roots, stems and leaves were up-regulated by stresses such as drought and salinity [8]. RcWRKY49 (RPKM = 49.08), the most expressed RcWRKY gene in male flowers, was also lowly detected in other tissues, which is consistent with the qRT-PCR result [21]. In contrast, its ortholog JcWRKY50 in physic nut was expressed highly in roots, moderately in leaves and lowly in stems, and the expression levels were regulated by at least one of tested abiotic stresses, i.e. drought, salinity, phosphate starvation and nitrogen starvation [8]. Among two highly abundant RcWRKY genes in germinating seeds, RcWRKY42 (RPKM = 49.46) also represented the most expressed member in stages II/III (RPKM = 81.26) and V/VI (RPKM = 21.86) endosperm, whereas RcWRKY05 (RPKM = 47.04) was expressed moderately in male flowers (RPKM = 19.96) and leaves (RPKM = 7.90), lowly in stages II/III (RPKM = 2.61) and V/VI (RPKM = 0.62) (Fig 6). Although not detected in two seed development stages, the physic nut ortholog (JcWRKY39) of RcWRKY42 was highly expressed in roots, leaves and stems, and the expression levels were regulated by nitrogen starvation [8]. The expression levels of the physic nut ortholog (JcWRKY07) of RcWRKY05 were shown to be high in roots, leaves and early developmental seeds, and extremely low in stems [8]. The response of JcWRKY07 to drought, salinity and phosphate starvation stresses was observed in roots [8]. AtWRKY33, an Arabidopsis ortholog of RcWRKY05 was shown to function as a positive regulator of resistance toward the necrotrophic fungi Alternaria brassicicola and Botrytis cinerea [51,52], and gene overexpression can increases salt and heat tolerance [53,54]. As mentioned above, the total RcWRKY transcripts in stage II/III endosperm was two folds more than that in stage V/VI endosperm. Among 51 RcWRKY genes detected in endosperm, 34 members had a RPKM value exceeding 0.5 in at least one stage of developing endosperm (stages II/III and V/VI). Differential expression analysis indicated that 23 out of the 32 down-regulated RcWRKY genes and one out of two up-regulated genes exceeded two folds (Fig 6), suggesting their putative regulatory role in early endosperm development. In addition, RcWRKY genes are promised to be involved in the ABA-mediated seed filling. In vivo experiment showed that endogenous ABA levels were closely associated with storage material accumulation in developing castor bean seeds [55]. In vitro, exogenous ABA also enhanced the dry weight (including the accumulation of soluble sugar and total lipid content) of developing seeds cultured in a nutrient medium [56]. After the application of 10 μM ABA for 24 h, differential gene expression analysis indicated that 2568 genes were up or down-regulated at least two folds [56], which was shown to include 13 out of the 58 RcWRKY genes (S21 File). Among them, eleven (four group I members, two subgroup IId members, one subgroup IIa member, one subgroup IIb member, one subgroup IIc member, one subgroup IIe member and one group III member) were significantly up-regulated, whereas only two (one subgroup IIe member and one group III member) were down-regulated. RcWRKY41, the most up-regulated gene (more than 250 folds) (S21 File), was highly expressed in germinating seeds, leaves and male flowers (Fig 6), which is consistent with its high representative in Genbank EST database (Table 1); its ortholog AtWRKY11 in Arabidopsis, was also shown to be constitutively expressed and act as negative regulators of basal resistance to Pseudomonas syringae [57]. RcWRKY28, the second highly up-regulated gene (more than 15 folds) (S21 File), was expressed more in male flowers and germinating seeds than in leaves and endosperm, though its expression level was considerably lower in stage V/VI endosperm as compared with stage II/III (Fig 6); AtWRKY40, its ortholog in Arabidopsis, was also induced by ABA and acts as a transcriptional repressor in ABA signaling and abiotic stress but a positive regulator in effector-triggered immunity [58-63]. RcWRKY17, a group IIc member preferring to express in male flowers, female flowers and germinating seeds, was up-regulated for more than nine folds upon the ABA application; its ortholog AtWRKY75 in Arabidopsis, was shown to response to phosphate starvation, water deprivation, ethylene stimulus and biotic stress, and participate in lateral root development, leaf senescence and galactolipid biosynthesis [64-67]. RcWRKY45, a group IIe member preferring to express in germinating seeds and fruits at 50 days post-anthesis [21], was up-regulated for more than seven folds by ABA; AtWRKY22, its ortholog in Arabidopsis, was involved in dark-induced leaf senescence and submergence-mediated immunity [68-69]. These results suggested the putative role of RcWRKYs in the ABA signaling.

Conclusions

Based on the genome and transcriptome datasets, in the current study, a total of 58 WRKY genes were identified from castor bean, one of the most important non-food oilseed crops in the Euphorbiaceae family. According to the structural features and evolutionary relationships of the present WRKY domains, the identified RcWRKY genes were assigned to the group I, group II (subgroup a-e) and group III. The WRKY domain pattern was characterized as WRKYGQ/KKx13Cx4-7Cx22-23HxH/C. Compared with Arabidopsis that feathers a high number of duplicate genes, few gene pairs were identified in the RcWRKY gene family, corresponding to no recent whole-genome duplication event occurred in castor bean. Comparative genomics analysis also indicated that one gene loss, one intron loss and one recent proximal duplication occurred in the RcWRKY gene family as compared with physic nut, another Euphorbiaceae plant species underwent no recent whole-genome duplication event. Although only 20 family members had EST hits in public database, the expression of all 58 RcWRKY genes was supported by RNA sequencing reads derived from root, leaf, flower, seed and endosperm. Compared with tissues such as leaf, male flower and germinating seed, the total expression level of RcWRKY genes in endosperm tissue was shown to be relatively low. Distinct gene expression profiles were also observed in different developmental endosperm. Compared with stage II/III endosperm, 23 out of the 54 endosperm-expressed RcWRKY genes were down-regulated at least two folds at stage V/VI, whereas only one member was shown to be significantly up-regulated, suggesting their key regulatory role in early endosperm development. In a word, results obtained from this study not only provide global information in understanding the molecular basis of the WRKY gene family in castor bean, but also provide a useful reference to investigate the gene family expansion and evolution in Euphorbiaceus plants such as Hevea brasiliensis and Manihot esculenta, and other plant species that underwent recent whole-genome duplication events.

The gene model for RcWRKY03.

(PDF) Click here for additional data file.

The gene model for RcWRKY06.

(PDF) Click here for additional data file.

The gene model for RcWRKY07.

(PDF) Click here for additional data file.

The gene model for RcWRKY08.

(PDF) Click here for additional data file.

The gene model for RcWRKY10.

(PDF) Click here for additional data file.

The gene model for RcWRKY11.

(PDF) Click here for additional data file.

The gene model for RcWRKY18.

(PDF) Click here for additional data file.

The gene model for RcWRKY20.

(PDF) Click here for additional data file.

The gene model for RcWRKY21.

(PDF) Click here for additional data file.

The gene model for RcWRKY22.

(PDF) Click here for additional data file.

The gene model for RcWRKY25.

(PDF) Click here for additional data file.

The gene model for RcWRKY28.

(PDF) Click here for additional data file.

The gene model for RcWRKY29.

(PDF) Click here for additional data file.

The gene model for RcWRKY30.

(PDF) Click here for additional data file.

The gene model for RcWRKY35.

(PDF) Click here for additional data file.

The gene model for RcWRKY40.

(PDF) Click here for additional data file.

The gene model for RcWRKY41.

(PDF) Click here for additional data file.

The gene model for RcWRKY44.

(PDF) Click here for additional data file.

The gene model for RcWRKY50.

(PDF) Click here for additional data file.

The gene model for RcWRKY54.

(PDF) Click here for additional data file.

List of 13 differentially expressed RcWRKY genes upon the ABA treatment.

(PDF) Click here for additional data file.

List of the accession numbers of the WRKYs identified in Arabidopsis (72) and physic nut (58).

(XLSX) Click here for additional data file.
  66 in total

Review 1.  The role of WRKY transcription factors in plant abiotic stresses.

Authors:  Ligang Chen; Yu Song; Shujia Li; Liping Zhang; Changsong Zou; Diqiu Yu
Journal:  Biochim Biophys Acta       Date:  2011-09-20

2.  Integrated genome sequence and linkage map of physic nut (Jatropha curcas L.), a biodiesel plant.

Authors:  Pingzhi Wu; Changpin Zhou; Shifeng Cheng; Zhenying Wu; Wenjia Lu; Jinli Han; Yanbo Chen; Yan Chen; Peixiang Ni; Ying Wang; Xun Xu; Ying Huang; Chi Song; Zhiwen Wang; Nan Shi; Xudong Zhang; Xiaohua Fang; Qing Yang; Huawu Jiang; Yaping Chen; Meiru Li; Ying Wang; Fan Chen; Jun Wang; Guojiang Wu
Journal:  Plant J       Date:  2015-03       Impact factor: 6.417

3.  The Mg-chelatase H subunit of Arabidopsis antagonizes a group of WRKY transcription repressors to relieve ABA-responsive genes of inhibition.

Authors:  Yi Shang; Lu Yan; Zhi-Qiang Liu; Zheng Cao; Chao Mei; Qi Xin; Fu-Qing Wu; Xiao-Fang Wang; Shu-Yuan Du; Tao Jiang; Xiao-Feng Zhang; Rui Zhao; Hai-Li Sun; Rui Liu; Yong-Tao Yu; Da-Peng Zhang
Journal:  Plant Cell       Date:  2010-06-11       Impact factor: 11.277

4.  Members of a new family of DNA-binding proteins bind to a conserved cis-element in the promoters of alpha-Amy2 genes.

Authors:  P J Rushton; H Macdonald; A K Huttly; C M Lazarus; R Hooley
Journal:  Plant Mol Biol       Date:  1995-11       Impact factor: 4.076

5.  Characterization of a cDNA encoding a novel DNA-binding protein, SPF1, that recognizes SP8 sequences in the 5' upstream regions of genes coding for sporamin and beta-amylase from sweet potato.

Authors:  S Ishiguro; K Nakamura
Journal:  Mol Gen Genet       Date:  1994-09-28

6.  Functional divergence of duplicated genes formed by polyploidy during Arabidopsis evolution.

Authors:  Guillaume Blanc; Kenneth H Wolfe
Journal:  Plant Cell       Date:  2004-06-18       Impact factor: 11.277

7.  Identification and expression profiles of the WRKY transcription factor family in Ricinus communis.

Authors:  Hui-Liang Li; Liang-Bo Zhang; Dong Guo; Chang-Zhu Li; Shi-Qing Peng
Journal:  Gene       Date:  2012-05-01       Impact factor: 3.688

8.  The WRKY transcription factor superfamily: its origin in eukaryotes and expansion in plants.

Authors:  Yuanji Zhang; Liangjiang Wang
Journal:  BMC Evol Biol       Date:  2005-01-03       Impact factor: 3.260

9.  SMART: recent updates, new developments and status in 2015.

Authors:  Ivica Letunic; Tobias Doerks; Peer Bork
Journal:  Nucleic Acids Res       Date:  2014-10-09       Impact factor: 16.971

10.  Global analysis of WRKY transcription factor superfamily in Setaria identifies potential candidates involved in abiotic stress signaling.

Authors:  Mehanathan Muthamilarasan; Venkata S Bonthala; Rohit Khandelwal; Jananee Jaishankar; Shweta Shweta; Kashif Nawaz; Manoj Prasad
Journal:  Front Plant Sci       Date:  2015-10-26       Impact factor: 5.753

View more
  16 in total

1.  Genome-wide identification of WRKY transcription factors in kiwifruit (Actinidia spp.) and analysis of WRKY expression in responses to biotic and abiotic stresses.

Authors:  Zhaobin Jing; Zhande Liu
Journal:  Genes Genomics       Date:  2018-01-06       Impact factor: 1.839

2.  Influence of phosphorous fertilization on copper phytoextraction and antioxidant defenses in castor bean (Ricinus communis L.).

Authors:  Guoyong Huang; Muhammad Shahid Rizwan; Chao Ren; Guangguang Guo; Qingling Fu; Jun Zhu; Hongqing Hu
Journal:  Environ Sci Pollut Res Int       Date:  2016-11-23       Impact factor: 4.223

3.  Genome-wide identification and expression analyses of WRKY transcription factor family members from chickpea (Cicer arietinum L.) reveal their role in abiotic stress-responses.

Authors:  Muhammad Waqas; Muhammad Tehseen Azhar; Iqrar Ahmad Rana; Farrukh Azeem; Muhammad Amjad Ali; Muhammad Amjad Nawaz; Gyuhwa Chung; Rana Muhammad Atif
Journal:  Genes Genomics       Date:  2019-01-12       Impact factor: 1.839

4.  Genome-wide identification and comparative evolutionary analysis of the Dof transcription factor family in physic nut and castor bean.

Authors:  Zhi Zou; Xicai Zhang
Journal:  PeerJ       Date:  2019-02-05       Impact factor: 2.984

5.  Genome-wide identification and expression analysis of the phosphatase 2A family in rubber tree (Hevea brasiliensis).

Authors:  Jinquan Chao; Zhejun Huang; Shuguang Yang; Xiaomin Deng; Weimin Tian
Journal:  PLoS One       Date:  2020-02-05       Impact factor: 3.240

6.  Identification of the Group III WRKY Subfamily and the Functional Analysis of GhWRKY53 in Gossypium hirsutum L.

Authors:  Dongjie Yang; Yuanyuan Liu; Hailiang Cheng; Qiaolian Wang; Limin Lv; Youping Zhang; Guoli Song; Dongyun Zuo
Journal:  Plants (Basel)       Date:  2021-06-17

7.  Genome-wide identification and characterization of WRKY gene family in Salix suchowensis.

Authors:  Changwei Bi; Yiqing Xu; Qiaolin Ye; Tongming Yin; Ning Ye
Journal:  PeerJ       Date:  2016-09-07       Impact factor: 2.984

8.  Identification of the group IIa WRKY subfamily and the functional analysis of GhWRKY17 in upland cotton (Gossypium hirsutum L.).

Authors:  Lijiao Gu; Libei Li; Hengling Wei; Hantao Wang; Junji Su; Yaning Guo; Shuxun Yu
Journal:  PLoS One       Date:  2018-01-25       Impact factor: 3.240

9.  Genome-wide identification of the potato WRKY transcription factor family.

Authors:  Chao Zhang; Dongdong Wang; Chenghui Yang; Nana Kong; Zheng Shi; Peng Zhao; Yunyou Nan; Tengkun Nie; Ruoqiu Wang; Haoli Ma; Qin Chen
Journal:  PLoS One       Date:  2017-07-20       Impact factor: 3.240

10.  Identification of WRKY Gene Family from Dimocarpus longan and Its Expression Analysis during Flower Induction and Abiotic Stress Responses.

Authors:  Dengwei Jue; Xuelian Sang; Liqin Liu; Bo Shu; Yicheng Wang; Chengming Liu; Jianghui Xie; Shengyou Shi
Journal:  Int J Mol Sci       Date:  2018-07-25       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.