Literature DB >> 28280499

Deep Sequencing of Suppression Subtractive Hybridisation Drought and Recovery Libraries of the Non-model Crop Trifolium repens L.

Maciej Bisaga1, Matthew Lowe1, Matthew Hegarty1, Michael Abberton1, Adriana Ravagnani1.   

Abstract

White clover is a short-lived perennial whose persistence is greatly affected by abiotic stresses, particularly drought. The aim of this work was to characterize its molecular response to water deficit and recovery following re-hydration to identify targets for the breeding of tolerant varieties. We created a white clover reference transcriptome of 16,193 contigs by deep sequencing (mean base coverage 387x) four Suppression Subtractive Hybridization (SSH) libraries (a forward and a reverse library for each treatment) constructed from young leaf tissue of white clover at the onset of the response to drought and recovery. Reads from individual libraries were then mapped to the reference transcriptome and processed comparing expression level data. The pipeline generated four robust sets of transcripts induced and repressed in the leaves of plants subjected to water deficit stress (6,937 and 3,142, respectively) and following re-hydration (6,695 and 4,897, respectively). Semi-quantitative polymerase chain reaction was used to verify the expression pattern of 16 genes. The differentially expressed transcripts were functionally annotated and mapped to biological processes and pathways. In agreement with similar studies in other crops, the majority of transcripts up-regulated in response to drought belonged to metabolic processes, such as amino acid, carbohydrate, and lipid metabolism, while transcripts involved in photosynthesis, such as components of the photosystem and the biosynthesis of photosynthetic pigments, were up-regulated during recovery. The data also highlighted the role of raffinose family oligosaccharides (RFOs) and the possible delayed response of the flavonoid pathways in the initial response of white clover to water withdrawal. The work presented in this paper is to our knowledge the first large scale molecular analysis of the white clover response to drought stress and re-hydration. The data generated provide a valuable genomic resource for marker discovery and ultimately for the improvement of white clover.

Entities:  

Keywords:  MiSeq; SSH libraries; drought; recovery; white clover

Year:  2017        PMID: 28280499      PMCID: PMC5322231          DOI: 10.3389/fpls.2017.00213

Source DB:  PubMed          Journal:  Front Plant Sci        ISSN: 1664-462X            Impact factor:   5.753


Introduction

White clover (Trifolium repens L.) is an important component of sustainable agricultural systems, because of its high nutritional value and the ability of fixing atmospheric nitrogen. It is grown in temperate regions, mainly in mixed swards with companion ryegrass. White clover propagates and persists primarily by stolons. As the plant grows, the stolons break generating a large number of smaller plants. At this stage, the plants become more vulnerable to biotic and abiotic stresses (Brock et al., 1988; Fothergill et al., 1997; Sanderson et al., 2003) and the white clover component within a sward can undergo a dramatic decline, the so called “clover crash” (Fothergill et al., 1996). This phenomenon makes the crop performance unreliable and therefore less attractive to farmers, as the crop benefits are of little use, unless coupled with good persistence. Water deficiency is one of the environmental factors most effecting crop productivity and its negative effects on white clover are well documented. A 30 year study in Australia showed that water availability was the main limiting factor in white clover persistence (Hutchinson et al., 1995), while Belaygue and colleagues observed an 80% decrease in stolon numbers as a result of just a 30% decrease in relative water content in five different genotypes of white clover (Belaygue et al., 1996). White clover is recognized to be not as adapted to drought conditions as other forage legumes, such as lucerne. This is mainly due to its shallow rooted system and its poor control of stomata closure (Hart, 1987). Additionally, very limited variability in terms of drought tolerance has been found amongst the white clover genotypes available (Abberton and Marshall, 2005), hence the need to develop molecular tools that make the identification of source of variation more efficient than methods based entirely on phenotypic characterization (Prohens, 2011). The development of genomic resources for forage legumes has been slower than for grain legumes and other crops because of the complexity of their genome, heterozygosity and polyploidy (Annicchiarico et al., 2015). However, the number of transcription profiling is steadily growing and is available not only for model species, such as Medicago truncatula, but also for legume crops, such as soybean, chickpea, pigeonpea (Chen et al., 2008; Pandey et al., 2016 and references therein). In particular, transcriptome profiling of red clover under prolonged drought stress has recently become available (Yates et al., 2014). This work has provided useful insights in the response to drought of a crop phylogenetically close to white clover. However, red and white clovers are quite different in their growth habits and there was a need for investigating white clover response at an early stage of droughting, when the crop is most affected (Fothergill et al., 1996). Several approaches are available for studying transcription profiling from early methods, such as Expressed Sequence Tags (EST) and Suppression Subtractive Hybridization (SSH), through to microarray technology and more recently Next Generation Sequencing (NGS) platforms (Deshmukh et al., 2014). SSH is a powerful technique for the amplification of differentially expressed genes by simultaneous amplification and suppression of target and non-target sequences, respectively (Diatchenko et al., 1996). Despite the advent of next generation sequencing, SSH is still a popular technique because of the relatively low cost, low amount of starting material and the relatively low rate of false positives. It is however quite laborious and costly if the number of sequenced clones is increased to several thousands. We have revisited four SSH libraries constructed from leaf tissue of white clover plants subjected to and recovering from drought stress (Bisaga, 2013). In this work, only the forward drought and recovery libraries were sequenced, yielding 1,127 and 2,405 unigenes, respectively. By replacing the cloning step with Illumina technology sequencing, we were able to exploit the already available SSH libraries more efficiently by sequencing both forward and reverse libraries at a deep coverage (387x). This allowed us to identify 6,937 and 3,142 transcripts up- and down-regulated, respectively, in response to drought, and 6,695 and 4,897, in response to re-hydration, providing a global picture of transcriptional changes occurring in white clover under the two treatments. Most published studies analyze the effects of water deprivation on white clover from an agronomic and physiological point of view, but no studies are available in which the effect of water deprivation is characterized at a molecular level. This work is, to our knowledge, the first report of a large scale molecular study on the response of white clover to drought and re-hydration.

Materials and methods

Plant material and growth conditions

The white clover (T. repens L.) R3R4 genotype, isolated at IBERS for research purposes, was used in this study (Febrer et al., 2007). Forty eight clonal cuttings of R3R4 were grown in 3.5 inch pots filled with a John Innes #3 potting soil mix (JI#3 Base: sterilized loam: peat: grit: fertilizer = 1:1:1:1 by volume). The plants were grown at 20/10°C (day/night) under daylight with additional illumination provided by high pressure sodium lamps (Philips: 400 watt Son-T Agro) to give a photoperiod of 10 h, under glasshouse condition. After root formation, clones were transferred into 5 inch pots and grown for 5 weeks. Following re-potting into 8 inch pots, the plants were grown for further 6 weeks before applying the stress treatment.

Stress treatment and measurements

The plants were organized in four complete blocks. Each experimental block consisted of nine water-stressed and three watered control plants. Before dehydration treatment, plants were watered to full capacity (ca. 500 ml). Under our experimental condition the average water loss was ca. 200 ml/day (estimated by weighing the pots; Image 1 of Supplemental Data). This volume of water was added to the control plants daily to maintain them at a uniform non-stressed water status, while the plants nominated for droughting were subject to 9 days of restricted water regime (decreasing from 100 ml to nil), followed by three recovering days in which they were watered to full capacity again. Leaf material was collected daily between 8.00 am and 9.00 am, immediately frozen in liquid nitrogen and stored at −80°C for RNA extraction. Two young fully expanded leaves were used for relative water content (RWC) measurement. Leaves were weighed immediately after collection (fresh weight–FW), placed in water to hydrate to full turgidity and weighed again (turgid weight–TW). Samples were then dried overnight at 80°C and weighed (dry weight–DW). The RWC was calculated according to Smart and Bingham equation (Smart and Bingham, 1974): RWC = [(FW–DW)/(RW–DW)] × 100.

RNA extraction, cDNA synthesis and sqRT-PCR

Total RNA was extracted from 0.2 grams of leaf tissue from day1, 4, and 10 using an RNeasy® Plant Mini Kit (Qiagen) according to the manufacturer's instructions. cDNA was synthesized from 1 μg of total RNA using Oligo (dT)25 primer (0.5 mM final) and 200 U of RevertAid™ H Minus M-MuLV Reverse Transcriptase (Fermentas) in 20 μl reaction, according to manufacturer's instructions. All primers used for sqRT-PCR were designed using Roche Universal Probe Library Assay Design Software (https://lifescience.roche.com/webapp/wcs/stores/servlet/CategoryDisplay?tab=Assay+Design+Center&identifier=Universal+Probe+Library&langId=-1) and are listed in Table S1. The NCED gene was amplified from T. repens using a pair of degenerate primers (NCED5 and NCED3) designed on publicly available nucleotide sequences (Acc. No.: FF391502.1, FF393344.1, BB931293.1, BB929043.1, 5g025250.1, 5g025270.1, and 5g025290.1). Primers NCEDmb2F and NCEDmb2R suitable for sqRT-PCR were designed on the newly sequenced T. repens NCED gene. The housekeeping actin gene used as a control was amplified using primers TrACT5 and TrACT3 designed on the publicly available T. repens gene (Acc. No. AM419900.1). sqRT-PCR was performed in triplicate using Power SYBR® Green PCR Master Mix (Applied Biosystems) on a LightCycler® 480 Real-Time Instrument (Roche) according to manufacturer's instruction. A 10 μl reaction containing 100 ng of cDNA template and 100 nM of each gene-specific primer was subject to 40 cycles of 15 s at 95°C, 1 min at 57°C and 40 s at 72°C. PCR products were subjected to dissociation curve analysis by incubating at 15 s at 95°C, 15 s at 60°C, 15 s at (60°–5°C) and data were analyzed using the the 2−ΔΔCT method to quantify relative transcript abundance (Livak and Schmittgen, 2001). For internal control, the housekeeping actin gene was amplified from cDNA of control, drought and recovery and samples.

Construction of subtractive libraries

Four Suppression Subtractive Hybridisation (SSH) libraries were generated using the SMARTer™ PCR cDNA Synthesis Kit (Clontech) and the PCR-Select cDNA Subtraction Kit (Clontech) according to manufacturer's instructions. Two libraries were constructed by subtracting driver RNA sampled from plants at day 0 of the experiment from tester RNA sampled from stressed plants at day 4 of the drought treatment (Drought Forward library–DF) and vice-versa (Drought Reverse library–DR). The other two libraries were constructed by subtracting the same driver RNA at day 0 from tester RNA sampled from plants at day 1 of the rehydration process (Recovery Forward library - RF) and vice-versa (Recovery Reverse library–RR). Products of the secondary hybridisations were re-amplified using Nested PCR primer 1 and R2 (from PCR-Select cDNA Subtraction Kit; Clontech), purified using QIAquick PCR Purification Kit (Qiagen) and digested with RsaI to remove the adaptors prior to the library preparation for sequencing.

Sequencing and data analysis

RsaI digested samples were purified with AMpure XP beads (Agencourt) according to manufacturer's instructions and used for the construction of indexed sequencing libraries using the Illumina Nextera XT sample preparation kit as per the manufacturer's instructions. Libraries were pooled and diluted to a final concentration of 6 pM prior to sequencing on two independent runs using the Illumina MiSeq platform. De novo assembly and read mappings were carried out using CLC Genomics Workbench 5.5.2 (CLC bio A/S, Aarhus, Denmark). Statistical comparison of libraries was carried out using SAGEstat V4.2 (Ruijter et al., 2002) and q-value for false discovery rate was estimated using the R package q-value V1.0 (Storey, 2002). Functional annotation was carried out using Blast2GO 2.8.0 (Conesa et al., 2005). Statistical analysis of KEGG pathways was carried out using EC2KEGG (Porollo, 2014) and full results are shown in Table S5. MapMan analysis was carried out using MapMan 3.5.1R2 (Thimm et al., 2004) and Mercator (Lohse et al., 2014).

Data availability

Next Generation Sequencing (NGS) runs have been deposited at National Centre for Biotechnology Information (NCBI) in the Short Read Archive (SRA) database under accessions numbers SRR3621208, SRR3621729, SRR3621829 and SRR3621860. Contig sequences have been deposited in the Transcriptome Shotgun Assembly (TSA) database GEST00000000. The version described in this paper is the first version, GEST01000000. Since TSA does not accept sequences shorter than 200 bp and containing a stretch of “N” > 14 nucleotides, the complete set of assembled sequences is provided as Data Sheet 1 of Supplemental Data.

Results and discussion

Fast drought experiment

To identify genes showing changes in expression in response to water deficit and subsequent re-hydration, 48 plants of T. repens (genotype R3R4; Febrer et al., 2007) were subjected to a fast drought experiment as described in Material and Methods. The conditions were chosen following a number of pilot studies that tested pot size, soil volume, watering regime, and sampling technique/timing. The amount of water applied during the experiment decreased daily as shown in Figure 1. The leaf relative water content (RWC) was measured every day of the experiment to monitor the stress status of the plants. Plants whose RWC consistently declined every day and showed a rapid response to re-watering were selected for the RNA extraction. Figure 2 shows the RWC of the stressed plants in comparison to the control. The first significant decline in RWC is observed at day 4 (9.5% drop compared to un-stressed plants) and it reverted back to the same level as the control plants within 24 h of restoring normal watering (day 1 of recovery treatment).
Figure 1

Diagram of the fast drought experiment set up. Time of sampling and time and volume of watering are shown for each day of the experiment.

Figure 2

The effect of the drought (Day 1 to Day 9) and recovery (Day R1 to R3) treatment on RWC. Each value is the mean of two technical replicates. Error bars denote standard error.

Diagram of the fast drought experiment set up. Time of sampling and time and volume of watering are shown for each day of the experiment. The effect of the drought (Day 1 to Day 9) and recovery (Day R1 to R3) treatment on RWC. Each value is the mean of two technical replicates. Error bars denote standard error. The stress status of the experimental plants was also monitored at molecular level in semi-quantitative Real Time PCR (sqRT-PCR) using the 9-cis-epoxycarotenoid dioxygenase (NCED) gene and the actin gene as internal control. The product of NCED gene catalyzes the first step of ABA biosynthesis from carotenoids in chloroplasts and it is therefore activated very quickly in response water deficit stress, making it a useful marker for monitoring the onset of drought response (Iuchi et al., 2000). Figure 3 shows the expression profile of the NCED transcript during the course of the experiment. The level of expression increases sharply between day 3 and 4 (~6.4-folds). The expression starts decreasing after day 5 and, after a brief increase on day 9, returns to basal level following re-watering (day 9). This expression profile seems to be typical of NCED, as also observed in M. truncatula under drought stress (probe Mtr.35044.1.S1_at of the microarray experiment by Zhang et al. unpublished at http://mtgea.noble.org/v3/probeset.php?id=Mtr.35044.1.S1_at).
Figure 3

Expression profile of . Error bars denote standard deviation.

Expression profile of . Error bars denote standard deviation. The concomitant decrease in RWC and increase in NCED expression was a good indication of the beginning of the stress response.

Construction and sequencing of SSH libraries

Based on the RWC monitoring and the NCED expression time course, plant material from day 4 (Tester D4) and day 10 (Tester R1–day 1 of the recovery treatment) were chosen for the library preparation. In both cases, the plant material from day 1 was used as control (Driver D1). Four SSH libraries were constructed. For the drought treatment a forward library (DF) was constructed by using the RNA from D4 plant material as tester and the one from D1 as driver and a reverse library (DR) using D1 RNA as tester and D4 as driver. Similarly, for the recovery treatment, a forward (RF) library was constructed subtracting D1 RNA from R1 RNA and a reverse (RR) subtracting R1 RNA from D1 RNA. The libraries were constructed according to the Diatchenko‘s method (Diatchenko et al., 1996). In the original method, the libraries were cloned into E. coli and the individual clones are then Sanger sequenced. This laborious procedure was originally employed to sequence the drought forward library (Bisaga, 2013). The availability of next generation sequencing technology can however speed up the procedure and generate a much greater amount of data at a fraction of the cost per clone. We therefore omitted the cloning step and sequenced the products of the secondary amplification of the four libraries directly using the Illumina MiSeq platform.

De novo and reference-based assembly of a reference transcriptome

The T. repens genome is not yet available, however, at the time this work was carried out, a transcriptome sequence (Nagy et al., 2013) and the genome of the closely related M. truncatula were publicly available (http://www.jcvi.org/medicago/display.php?pageName=General§ion=Download). Neither resource was on its own optimal as a reference sequence to aid the reads assembly. The published transcriptome of 71,545 sequences was generated by de novo assembly of 454-pyrosequencing reads. It was obtained from a mixture of above ground tissues of white clover grown under un-stressed conditions, while ours was generated from stressed plants and, like all transcriptomes, could contain mis-assembled and chimeric transcripts. For M. truncatula, the whole transcript set is available from the genome annotation and the average similarity between T. repens and M. truncatula at transcript level is around 90%. However, when the white clover transcriptome is BLASTed against the Medicago CDS library (Mt4.0v1), using a cutoff E-value of 10−20, only 60% of the sequences had hits. We therefore decided to use a combination of de novo and reference-based assembly. Table 1 shows the number of reads generated for each library before and after quality checks. DR and RF libraries generated a larger number of reads because they were run with an updated MiSeq system. Reads were trimmed for quality (quality limit = 0.01), length (≥36 bp) and for Nextera and SSH adaptors using CLC Genome Workbench 5.5.2. Reads from all four libraries (50,756,845 in total) were de novo assembled into 20,981 contigs of an average length of 482 bp (word size = 24; bubble size 111; minimum contig length = 200; mismatch cost = 2; insertion cost = 3; length fraction = 0.5; similarity fraction = 0.9).
Table 1

Summary of quality and adaptor trimming.

DFDRRFRR
No. paired reads5,801,54820,157,26223,573,6186,324,432
Average length151123131151
No. paired reads after trimming4,905,54217,645,41820,549,0485,360,350
% trimmed85888785
Average length105110115108
No. broken pairs228,878750,5181,066,277250,814
Summary of quality and adaptor trimming. Given the highly fragmented nature of the SSH library, most genes are represented by multiple non-overlapping contigs, which makes a comparative quantitative analysis of expression levels problematic. To reduce redundancy, the de novo assembled contigs were therefore further assembled by mapping to reference sequences. The contigs were first mapped to the M. truncatula CDSs (Mt4.0v1_GenesCDSSeq_20130731_1800.fasta). Consensus sequences were extracted inserting “N” for ambiguity symbols and IUPAC ambiguity codes to resolve conflicts. This generated a set of 10,413 contigs named SSHrefseqM. The un-mapped contigs were then mapped to the white clover transcriptome (Nagy et al., 2013) and consensus sequences extracted in the same way to generate a set of 3,322 contigs named SSHrefseqW. The remaining un-mapped contigs (2,458) were named SSHrefseqA. The reference-based assembly was carried out using a mismatch cost of 2, insertion and deletion costs of 3, length fraction of 0.5 and similarity fraction of 0.8. The three sets were combined in a single reference transcriptome of 16,193 contigs of average length 625 bp named SSHrefseqAMW (Data Sheet 1 of Supplemental Data). A summary of the de novo and reference-based assembly is shown in Tables 2A,B, respectively. Summary of . The total number of contigs in SSHrefseqAMW (in bold) is the sum of the consensus contigs (marked with an asterisk) mapped to M. truncatula (SSHrefseqM), T. repens (SSHrefseqW) and un-mapped (SSHrefseqA). Mean base coverage was calculated as (number of reads mapped X average read length) / total length of contigs. The SSHrefseqAMW transcriptome was functionally annotated using Blast2GO (Conesa et al., 2005) (Table S2). In total 78% of the contigs had BLASTx hits in the NCBI database and 65% could be annotated. The great majority of the top BLAST hits were from M. truncatula and Cicer arietinum (43%), followed by Glycine max and Phaseolus vulgaris (6 and 3%, respectively).

Mapping of the four SSH libraries to SSHrefseqAMW

The reads from the four SSH libraries DF, DR, RF, and RR were independently mapped to the newly generated reference transcriptome SSHrefseqAMW using the same parameters as above, except for the similarity fraction, which was increased to 0.9. Table 3 shows a summary of the results from this mapping.
Table 3

Summary of the library-specific read mapping to the SSHrefseqAMW transcriptome reference.

Counts% of readsAverage lengthNumber of bases
SSHrefseqAMW16,193-625.0010,119,208.00
DF mapped reads3,151,99861.39102.07321,718,463.00
DF contigs11,095-310.003,437,391.00
DR mapped reads14,041,20776.33110.401,550,159,323.00
DR contigs10,118-315.003,187,663.00
RF mapped reads14,975,12969.28113.871,705,150,875.00
RF contigs15,829-467.007,385,140.00
RR mapped reads3,511,60962.58105.10369,084,303.00
RR contigs9,387-291.002,732,913.00
Summary of the library-specific read mapping to the SSHrefseqAMW transcriptome reference. A considerable overlapping between the four libraries was found (Figure 4). The SSH procedure is known to produce a number of false positives. The SSH procedure is based on two rounds of hybridisation and PCR amplification, with the rate of hybridization depending heavily on the transcript abundance, the degree of differential expression and the transcript length. Most genes are differentially regulated from a basal level, rather than being switched on and off, with the majority displaying limited differential expression. With a mathematical model (Gadgil et al., 2002), showed that under recommended conditions the SSH procedure is biased toward transcripts differentially regulated to a large degree and that most false-positives are contributed by rare sequences that are not differentially expressed. Furthermore, Bui et al. (2005) showed experimentally that abundant transcript may escape both subtraction and normalization. A major source of contamination may also come from the non-specific hybridisation between strands of partially homologous genes (Gadgil et al., 2002). While the effect of these factors could be trivial when the sequencing is carried out on a limited number of bacterial clones, it becomes significant if the SSH procedure is followed by deep sequencing, which in this case yielded a base coverage of nearly 400x (Table 2A). To resolve this contamination, expression levels in RPKM (Reads Per Kilobase per Million; Mortazavi et al., 2008) were analyzed. First contigs shorter than 50 bp were filtered out. Contigs shared between forward and reverse drought libraries were also discarded if the difference in RPKM was not significant (cut off of q = 0.05). The remaining contigs were assigned to the forward library if the log2 fold change ≥ 2 or to the reverse if ≥ −2. The DF library built up in this way was then subtracted in silico from the RF library and the remaining contigs were compared to the RR library in the same way as DF/DR comparison.
Figure 4

Venn diagram showing the extent of redundancy between libraries. The number of contigs in each library is expressed as absolute value and percent (in brackets).

Venn diagram showing the extent of redundancy between libraries. The number of contigs in each library is expressed as absolute value and percent (in brackets). Three sets of putative differentially expressed genes were generated. The first set comprises all contigs mapped in each library (non-Subtracted set, nS); the second set comprises only contigs unique to forward or reverse library (Subtracted and non-Enriched set, SnE); the third set comprises the subtracted contigs enriched for contigs with log2 fold change ≥ ±2 (Subtracted and Enriched set, SE). Results are summarized in Table 4 and full data set can be found in Table S3.
Table 4

Total number of contigs ≥50 bp in each library set and results of the functional annotation of individual libraries.

Total No. of contigsContigs with BLAST hits %Contigs with annotation %Contigs assigned to KEGG pathway
nS
DF9,58781.8969.13140
DR8,12483.0172.07115
RF15,66578.7465.11142
RR7,47781.8269.18134
SnE
DF4,07776.8060.95116
DR2,61477.4365.49115
RF6,52875.5461.35129
RR2,32576.7760.69106
SE
DF6,93780.3965.93135
DR3,14279.2568.95120
RF6,69576.2562.79127
RR4,89780.8466.71128

nS, non-Subtracted; SnE, Subtracted non-Enriched (i.e., unique to each library); SE, Subtracted and log2-fold change Enriched.

Total number of contigs ≥50 bp in each library set and results of the functional annotation of individual libraries. nS, non-Subtracted; SnE, Subtracted non-Enriched (i.e., unique to each library); SE, Subtracted and log2-fold change Enriched.

Functional annotation by Go terms and KEGG pathways

Contigs in each sets were compared for their enrichment in GO (Gene Ontology) terms and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways representation using Blast2GO. In each library BLASTx hits were found for 75–83% of the contigs and functional annotations could be assigned to 61–72% of the contigs (Table 4). Go terms associated with each library in the Biological Process category where compared in the three sets. Figure 5 shows the results for level 3 using a node score filter of 10% (full data available in Table S4). In the nS set all libraries appear to be similarly enriched in the GO term of this category. With the exception of the “response to chemical,” “cellular component organization” and “single-organism localization” that are not represented in the RF library, all other terms appear to be over-represented and following the same pattern. When the in silico subtraction and/or enrichment are applied (SnE and SE sets) some GO terms appear to be more specific to some libraries. In particular the “response to stress” term (marked with a red asterisk), which is expected to be over-represented in a library of material subjected to drought, shows the expected pattern only in the SE set: present in DF (up-regulated under drought) and in RR (down-regulated during recovery).
Figure 5

GO term assignment in the biological process category. Go terms at levels 2 and 3 are shown in the non-Subtracted (nS), Subtracted non-Enriched (SnE), and Subtracted Enriched (SE) sets. Values on the x axis are scores computed by Blast2GO as per Conesa et al. (2005).

GO term assignment in the biological process category. Go terms at levels 2 and 3 are shown in the non-Subtracted (nS), Subtracted non-Enriched (SnE), and Subtracted Enriched (SE) sets. Values on the x axis are scores computed by Blast2GO as per Conesa et al. (2005). As part of the Blast2GO analysis contigs in each library were assigned with an EC (Enzyme Code) number and mapped to the relevant KEGG pathway. Contigs were assigned to 106–142 pathways depending on the size of the library (Table 4). Tables 5, 6 summarize the results of the KEGG analysis for the drought and recovery library, respectively. Only pathways significantly enriched (p < 0.05), showing at least 50% of the number of enzymes present in the equivalent pathway of M. truncatula (mtr) and at least 1.5 folds difference between forward and reverse library are shown (full set of data available in Table S5). As expected, the largest number of differentially expressed contigs was associated with metabolic processes, such as amino acid, carbohydrate, nucleotide and lipid metabolism. All pathways appear to be over-represented in both drought and recovery forward libraries of the non-subtracted set (marked in bold), including pathways that are well-known to be downregulated under drought, such as those related to translation and energy metabolism. Those pathways are indeed under-represented in both SnE and SE sets from the drought treatment (Table 5) and over-represented in all sets (energy metabolism) or just in the SnE and SE sets (translation) from the recovery treatment (Table 6).
Table 5

List of KEGG pathways represented in each set as involved in response to drought.

CategorynSSnESE
PathwayDFDRmtrPathwayDFDRmtrPathwayDFDRmtr
Amino acid metabolismCysteine and methionine metabolism291628Valine, leucine and isoleucine degradation12518Arginine and proline metabolism15722
Glycine, serine and threonine metabolism211431Arginine and proline metabolism12522Valine, leucine and isoleucine degradation13618
Phenylalanine metabolism181215Tyrosine metabolism9513Tyrosine metabolism11513
Alanine, aspartate and glutamate metabolism171019Lysine degradation8315Arginine biosynthesis11514
Arginine and proline metabolism17822Lysine biosynthesis358Lysine degradation9315
Valine, leucine and isoleucine degradation14918
Tyrosine metabolism13813
Arginine biosynthesis13714
Lysine degradation9415
Lysine biosynthesis848
Valine, leucine and isoleucine biosynthesis739
Carbohydrate metabolismAmino sugar and nucleotide sugar metabolism331637Pentose phosphate pathway12514Galactose metabolism13615
Starch and sucrose metabolism322033Galactose metabolism12615Propanoate metabolism12215
Glyoxylate and dicarboxylate metabolism221314Propanoate metabolism8215Citrate cycle (TCA cycle)11716
Glycolysis / Gluconeogenesis221425Pyruvate metabolism61121Pentose and glucuronate interconversions10511
Pentose phosphate pathway16914Ascorbate and aldarate metabolism3612Butanoate metabolism10312
Fructose and mannose metabolism161023
Galactose metabolism15815
Inositol phosphate metabolism15821
Propanoate metabolism14515
Citrate cycle (TCA cycle)14816
Pentose and glucuronate interconversions12711
Butanoate metabolism12712
Ascorbate and aldarate metabolism11412
Nucleotide metabolismPurine metabolism35839Purine metabolism211439Purine metabolism271439
Pyrimidine metabolism18631Pyrimidine metabolism16931
Lipid metabolismGlycerolipid metabolism19818Fatty acid degradation949Fatty acid degradation1259
Glycerophospholipid metabolism19829Fatty acid elongation6010Sphingolipid metabolism9416
Fatty acid degradation1269Glycerolipid metabolism61118Fatty acid elongation6110
Sphingolipid metabolism10116Synthesis and degradation of ketone bodies123Arachidonic acid metabolism518
Steroid biosynthesis8411Linoleic acid metabolism325
Fatty acid biosynthesis7413
Arachidonic acid metabolism638
Fatty acid elongation6310
Metabolism of cofactors and vitaminsPantothenate and CoA biosynthesis12514Pantothenate and CoA biosynthesis8214Pantothenate and CoA biosynthesis11314
One carbon pool by folate818Nicotinate and nicotinamide metabolism5210Nicotinate and nicotinamide metabolism7210
Nicotinate and nicotinamide metabolism8110Porphyrin and chlorophyll metabolism41325Vitamin B6 metabolism428
Riboflavin metabolism518Thiamine metabolism428
Thiamine metabolism418Porphyrin and chlorophyll metabolism81725
Metabolism of other amino acidsbeta-Alanine metabolism12714beta-Alanine metabolism9314beta-Alanine metabolism12314
Selenocompound metabolism838Cyanoamino acid metabolism146Taurine and hypotaurine metabolism213
Cyanoamino acid metabolism726Selenocompound metabolism248
Biosynthesis of other secondary metabolitesPhenylpropanoid biosynthesis12818Tropane, piperidine and pyridine alkaloid biosynthesis646Tropane, piperidine and pyridine alkaloid biosynthesis746
Monobactam biosynthesis416Caffeine metabolism212Caffeine metabolism312
Stilbenoid, diarylheptanoid and gingerol biosynthesis427Monobactam biosynthesis046Flavone and flavonol biosynthesis122
Flavone and flavonol biosynthesis012Isoflavonoid biosynthesis123
TranslationAminoacyl-tRNA biosynthesis181220Aminoacyl-tRNA biosynthesis11820Aminoacyl-tRNA biosynthesis91820
Energy metabolismCarbon fixation in photosynthetic organisms211118Nitrogen metabolism158Carbon fixation in photosynthetic organisms81518
Sulfur metabolism1347
Oxidative phosphorylation727

Only pathways significantly enriched (p < 0.05), containing at least 50% of the number of enzymes present in the equivalent pathway of M. truncatula (mtr) and with at least 1.5-folds difference between forward and reverse library are shown. Pathways up-regulated under drought are marked in bold.

Table 6

List of KEGG pathways represented in each set as involved in recovery from drought.

CategorynSSnESE
PathwayRFRRmtrPathwayRFRRmtrPathwayRFRRmtr
Carbohydrate metabolismAscorbate and aldarate metabolism11712Starch and sucrose metabolism291233Glyoxylate and dicarboxylate metabolism22922
Amino sugar and nucleotide sugar metabolism21637Pyruvate metabolism211121
Pyruvate metabolism20521Glycolysis/Gluconeogenesis201120
Glyoxylate and dicarboxylate metabolism17414Fructose and mannose metabolism171017
Glycolysis/Gluconeogenesis16825Propanoate metabolism4114
Galactose metabolism15815
Fructose and mannose metabolism15523
Citrate cycle (TCA cycle)14216
Butanoate metabolism9212
Ascorbate and aldarate metabolism6112
C5-Branched dibasic acid metabolism204
Biosynthesis of other secondary metabolitesFlavonoid biosynthesis12813Flavonoid biosynthesis8313Phenylpropanoid biosynthesis10610
Isoquinoline alkaloid biosynthesis6411Isoquinoline alkaloid biosynthesis6411Flavonoid biosynthesis969
Monobactam biosynthesis426Monobactam biosynthesis406Isoquinoline alkaloid biosynthesis646
Stilbenoid, diarylheptanoid and gingerol biosynthesis427Flavone and flavonol biosynthesis202Monobactam biosynthesis404
Flavone and flavonol biosynthesis322Flavone and flavonol biosynthesis303
Energy metabolismSulfur metabolism1587Sulfur metabolism1167Carbon fixation in photosynthetic organisms20620
Nitrogen metabolism1168Carbon fixation in photosynthetic organisms10418Sulfur metabolism12612
Metabolism of cofactors and vitaminsPorphyrin and chlorophyll metabolism241325Porphyrin and chlorophyll metabolism14425Porphyrin and chlorophyll metabolism18718
Nicotinate and nicotinamide metabolism9610One carbon pool by folate828One carbon pool by folate848
Riboflavin metabolism418Pantothenate and CoA biosynthesis3103
Pantothenate and CoA biosynthesis3714Thiamine metabolism242
Amino acid metabolismPhenylalanine, tyrosine and tryptophan biosynthesis231420Cysteine and methionine metabolism191128Glycine, serine and threonine metabolism181018
Alanine, aspartate and glutamate metabolism211319Phenylalanine, tyrosine and tryptophan biosynthesis15420Valine, leucine and isoleucine degradation6136
Alanine, aspartate and glutamate metabolism12319Lysine degradation484
Arginine biosynthesis8314
Histidine metabolism539
Nucleotide metabolismPurine metabolism422739Purine metabolism271139Pyrimidine metabolism201320
Pyrimidine metabolism251531Pyrimidine metabolism19631
Metabolism of other amino acidsSelenocompound metabolism1068Glutathione metabolism10417Selenocompound metabolism838
Selenocompound metabolism808beta-Alanine metabolism6116
Glycan biosynthesis and metabolismN-Glycan biosynthesis9417Cyanoamino acid metabolism416Other glycan degradation535
Other glycan degradation647Taurine and hypotaurine metabolism213Glycosaminoglycan degradation313
Glycosaminoglycan degradation316Zeatin biosynthesis303Glycosphingolipid biosynthesis - ganglio series212
Glycosphingolipid biosynthesis - ganglio series214Glycosphingolipid biosynthesis - globo series212
Glycosphingolipid biosynthesis - globo series214
Lipid metabolismSphingolipid metabolism11616Glycerolipid metabolism13618Fatty acid degradation4124
Arachidonic acid metabolism858Sphingolipid metabolism8416Fatty acid elongation262
Synthesis and degradation of ketone bodies423Fatty acid elongation5210Linoleic acid metabolism232
Arachidonic acid metabolism408
Linoleic acid metabolism305
Synthesis and degradation of ketone bodies213
Metabolism of terpenoids and polyketidesTerpenoid backbone biosynthesis19923
Zeatin biosynthesis303

Only pathways significantly enriched (p < 0.05), containing at least 50% of the number of enzymes present in the equivalent pathway of M. truncatula (mtr) and with at least 1.5 folds difference between forward and reverse library are shown. Pathways up-regulated during recovery are marked in bold.

List of KEGG pathways represented in each set as involved in response to drought. Only pathways significantly enriched (p < 0.05), containing at least 50% of the number of enzymes present in the equivalent pathway of M. truncatula (mtr) and with at least 1.5-folds difference between forward and reverse library are shown. Pathways up-regulated under drought are marked in bold. List of KEGG pathways represented in each set as involved in recovery from drought. Only pathways significantly enriched (p < 0.05), containing at least 50% of the number of enzymes present in the equivalent pathway of M. truncatula (mtr) and with at least 1.5 folds difference between forward and reverse library are shown. Pathways up-regulated during recovery are marked in bold. In plants, such as white clover, that employ drought tolerance as the main strategy for coping with stress, the response is primarily focused on maintaining the cell water potential. This is achieved by the accumulation of soluble small molecular weight osmoprotectants, such as amino acids (proline, asparagine, serine), polyamines (putrescine, spermidine, spermine), glycine betaine and γ-amino-N-butyric acid (GABA) (Bartels and Sunkar, 2005). This is in agreement with the “arginine and proline metabolism” pathway, involved in the biosynthesis of proline, putrescine, spermidine and spermine, being one of the most represented in the DF library. It is also in line with the work of Li et al. (2015a,b), which showed the positive effect of both exogenous and endogenous polyamines on white clover tolerance to water deficit. Also significantly over-represented in the DF library is the “butanoate metabolism” pathway, which is the main route of GABA biosynthesis via decarboxylation of L-glutamate. Regulation of osmotic adjustment can also be achieved via carbohydrates, which are accumulated to varying degrees in different plants (Singh et al., 2015). Glucose, sucrose and fructose are the most common sugars to be accumulated under stress, however raffinose family oligosaccharides (RFOs), such as raffinose, stachyose, and verbascose, have also been shown to play a role in a variety of stress responses (Castonguay and Nadeau, 1998; Taji et al., 2002; Peters et al., 2007; Nishizawa et al., 2008; Egert et al., 2015). The first report of RFOs involvement in the response to drought came from the observed increased tolerance of Arabidopsis plants over-producing galactinol synthase, which catalyzes the first step in the synthesis of RFOs (Taji et al., 2002). An increased concentration of RFOs has also been associated with drought tolerance in alfalfa (Kang et al., 2011) and in various resurrection plants (Peters et al., 2007; Egert et al., 2015). It has been suggested that RFOs not only act as osmoprotectants, but also as ROS (Reactive Oxygen Species) scavenger providing protection from oxidative damage (Nishizawa et al., 2008). White clover is known to increase the concentration of soluble sugars under water deficit (Turner, 1990; Lee et al., 2008; Li et al., 2013) but their composition is not known. The over-representation of the “galactose metabolism” pathway (87 and 80% of the enzymes in the equivalent of M. truncatula in SnE and SE, respectively) suggests that RFOs play an important role at the onset of white clover response to drought. It has to be noted that other pathways in the “carbohydrate metabolism” category that are related to the accumulation of sugars, such as “starch and sucrose metabolism,” are also significantly over-represented in the DF library (see Table S5), but the difference between DF and DR is <1.5 folds and therefore have not been shown in Table 5. However, it is quite possible these pathways will appear more enriched at later stages of the drought treatment. The “fructose and mannose metabolism” pathway, on the other hand, is non-significantly over-represented in DR compared to DF, but significantly over-represented in RF compared to RR in the SnE and SE sets. It could be extrapolated that, following a delayed activation at a later stage of drought, the expression of the related genes is still elevated at day 1 of recovery (Table 6 and Table S5). It is well-known that inhibition of photosynthesis is among the early effects of drought stress. This is reflected in the down-regulation of genes encoding enzymes related to the “carbon fixation in photosynthetic organisms” and “porphyrin and chlorophyll metabolism” pathways (83 and 68% of the enzymes in the equivalent pathways of M. truncatula, respectively, in the SE set). As expected, these pathways are enriched in the RF library compared to DR in both SnE and SE set (Table 7). Also downregulated under drought treatment are 90% of the genes encoding enzymes related to the “aminoacyl-tRNA biosynthesis” pathways. This trend has been observed in other plants (Yamakawa and Hakata, 2010; Merewitz et al., 2011; Mohanty et al., 2016) and it has been related to the aminoacyl-tRNA molecules being associated with other processes in addition to protein synthesis, such as synthesis of porphyrin ring structure (Mocibob et al., 2010), and to the accumulation of osmolytes in the form of free amino acids (Yamakawa and Hakata, 2010). Comparison between RPKM values (A) and sq-RT-PCR results (B). In each library (–) denotes absence of the contig, small case denotes loss of the contig following statistical analysis; capital letter denotes retaining of the contig following statistical analysis. Contigs failed to amplify in sqRT-PCR are marked with a cross (. Contigs rescued by the enrichment of the SnE set are marked with an asterisk (. It was however unexpected to find a higher representation of the “flavone and flavonol biosynthesis” and “isoflavonoid biosynthesis” pathways in the DR library, and non-significant difference between the DF and DR libraries in the enrichment of the “flavonoid biosynthesis” pathway (Table S5). Flavonoids are secondary metabolites with strong antioxidant activity that are involved in plant protection against biotic and abiotic stresses, including drought (Tattini et al., 2004; Mouradov and Spangenberg, 2014). Flavonoids are classified in several sub-groups, one of which, isoflavonoids, is predominantly found in leguminous plants. (Iso) flavonoids have been shown to accumulate in shoots of alfalfa and leaves of Lotus japonicus (Kang et al., 2011; Garcia-Calderon et al., 2015). Ballizany et al. (2012b) showed an increase of flavonol glycosides of quercetin in white clover subject to water deficit and its positive association with retention of higher level of dry matter production under stress. In our study the pathways related to flavonoid biosynthesis are all significantly over-represented in the DF library (Table S5), but not compared to the DR library. In Ballizany et al. (2012b), the water deficit imposed was much harsher then in our experiment (9, 13, and 17 weeks above wilting point). Our library was constructed from material at the onset of the drought response and it is therefore conceivable that the biosynthesis of flavonoid is also at an early stage. In agreement with this hypothesis is the enrichment of these pathways in all three sets of the recovery treatment (Table 6). As hypothesized above for the “fructose and mannose metabolism,” transcripts associated with the synthesis of flavonoids could still be abundant following activation during advanced stages of drought stress. A time course of the expression of the genes involved in the different branches of the phenolic biosynthetic pathway in white clover under drought stress will help characterizing the stress response in more details and will clarify the potential of flavonoids as a target for improving drought tolerance (Kang et al., 2011; Ballizany et al., 2012a,b).

Mapman functional annotation

The differentially expressed contigs were also functionally annotated using MapMan (Thimm et al., 2004). Unlike GO, MapMan has the advantage of being developed for plant-specific pathways and of not separating different functional categories, allowing a global overview of high-throughput data in the context of pathways and processes. MapMan uses a different approach from GO in that it assigns genes to functional categories (BINs), rather than to a GO terms, that are represented in a hierarchically structured tree. For this analysis MapMan bins were assigned to the SSHrefseqAMW reference transcriptome using the Mercator pipeline (Lohse et al., 2014), so that it could be used as a mapping file. The data files were generated by calculating the log2 fold change of expression levels (in RPKM) in forward and reverse libraries. A metabolic overview of nS, SnE, and SE set is shown in Figure 6.
Figure 6

MapMan metabolic overview of white clover gene expression at the onset of drought (DF/DR) and recovery (RF/RR) in the nS (A), SnE (B), and SE (C) sets. Each square represents a transcript whose expression can be the same (white square), up-regulated (red squares) or down-regulated (blue squares) in the forward as compared to the reverse libraries.

MapMan metabolic overview of white clover gene expression at the onset of drought (DF/DR) and recovery (RF/RR) in the nS (A), SnE (B), and SE (C) sets. Each square represents a transcript whose expression can be the same (white square), up-regulated (red squares) or down-regulated (blue squares) in the forward as compared to the reverse libraries. The effect of the in silico subtraction and/or enrichment is clearly shown by the different patterns of expression in the nS (Figure 6A), SnE (Figure 6B), and SE (Figure 6C) sets. The nS set shows an elevated number of transcripts whose level does not significantly (p ≥ 0.05) change between forward and reverse library (white squares), especially in the recovery samples. In the SnE set almost all transcripts appear to be up-regulated (red squares) under drought and down-regulated (blue squares) during recovery. On the other end, the SE set shows a biologically meaningful pattern in which processes related to photosynthesis, such as components of the photosystem (“Light reaction”) and the biosynthesis of photosynthetic pigments (“Tetrapyrrole”) are mostly down-regulated under drought and up-regulated during recovery (Figure 6C). As the rate of photosynthesis slows down the cellular energy is provided by glycolysis and TCA cycle (Fernie et al., 2004) which appear to be up-regulated and down-regulated in drought and recovery, respectively. As discussed in the previous section, carbohydrate and amino acids metabolism are also oppositely regulated under the two treatments. Notably, the flavonoids and phenylpropanoids pathways show an overall up-regulation under water deficit, suggesting that the hierarchical bin system strategy adopted by MapMan is more likely to provide a more accurate representation. Also significantly up- and down-regulated under drought and recovery, respectively, are the transcripts related to the ascorbate-glutathione pathway, involved in ROS scavenging, to cell wall metabolism and to lipids biosynthesis and degradation. Lipids are important components of biological membranes, the initial site of cellular perception of the signals and major targets of environmental stresses. The change in lipids composition in response to stress is well documented (Gigon et al., 2004; Torres-Franklin et al., 2007; Toumi et al., 2008) and plays a primary role not only in the sensing and triggering of the signaling cascade (Wang, 2004) but also in maintaining membrane integrity and preserve cell compartmentalisation.

Validation of expression profiles by sqRT-PCR

To validate the data obtained from the statistical analysis of the SSH libraries, we carried out a sqRT-PCR. Twenty contigs were semi-randomly chosen to cover low, medium, and high level of expression (in RPKM) in the DF library. The sqRT-PCR data from 16 of the contigs that were successfully amplified validated the expression data obtained from the SSH libraries (Table 7). With the exception of SSHrefM3_contig_1729, whose RT-PCR fit only the nS data set, all the other results fit the dataset SE best. This means that when the sqRT-PCR values are higher or lower than the internal control C on Day D4 (Table 7B), the contig is included in the DF or DR libraries of the SE set, respectively (Table 7A). The same results are observed with the values of Day R1 and the RF or RR libraries. In particular, half of the contigs (marked with an asterisk) with RPKM values ranging from medium to high, would have been lost if the enrichment of the subtracted libraries had not been carried out. Incidentally, the “false negative” contig, SSHrefM3_contig_1729, is an enzyme from the “Flavone and flavonol biosynthesis” pathway and its expression appears to increase at day 9 of the stress treatment, as hypothesized above.

Conclusions

In this work we generated the first large scale molecular data related to white clover early response to drought and re-hydration by deep sequencing of SSH libraries. In a previous study only the forward libraries were sequenced using the standard protocol, generating 1,127 and 2,405 sequences for drought and recovery, respectively (Bisaga, 2013). By replacing the cloning step with a MiSeq sequencing run we were able to increase the number of transcripts to 9,587 and 15,665, respectively (Table 4; nS set). Reverse libraries were also sequenced, generating 8,124 and 7,477 transcripts down-regulated during drought and recovery, respectively (Table 4; nS set). These sequences were further processed using a procedure involving in silico subtraction of forward and reverse libraries, followed by enrichment with transcripts whose expression level is significantly different in the two libraries. When the three sets of data, non-subtracted (nS), subtracted non-enriched (SnE) and subtracted enriched (SE), were compared by functional annotation only the latter fit all three mapping approaches used, GO term, KEGG pathways and MapMan BINs assignment. The DF and DR libraries of the SE set, however, are not only quantitatively, but also qualitatively different from the sets of transcripts identified in the original study. About 25% of the transcripts sequenced in the original drought library were not found in the DF library of the SE set. A good example is a set of transcripts associated with the GO term “photosynthesis” (GO:0015979; GO:0019684), which in our study were recovered in every library, and after SE processing, in the DR and RF libraries. This type of artifact has been observed in similar studies, where the same transcript was recovered in both forward and reverse library (Bui et al., 2005; Norelli et al., 2009; Hall et al., 2011). It is believed to be associated with abundant transcript (such as photosynthetic genes), which appear to escape both subtraction and normalization (Bui et al., 2005). The original recovery forward library was found to be more similar to the one in this study, with only 10% of the transcripts being recovered in libraries different from SE-RF. One example is a set of genes associated with the GO term “response to osmotic stress” (GO:0006970), which in our study was found, as it would be expected, in the DF and RR libraries. In this case, however, only few transcripts appeared to have a high level of expression. The cause of this artifact is therefore different from the one observed for the photosynthetic genes recovered in the drought library, but it is difficult interpret without suitable kinetic studies. Overall the newly generated sets of data provide a biologically meaningful overview of the changes in gene expression associated with the early response of white clover to drought and rehydration. It provides a valuable resource for candidate genes and molecular markers discovery for improving the crop performance and persistence during the short intermittent drought spells most likely to affect cultivated white clover.

Author contributions

MB participated in planning the study, carried out the fast drought experiment, the SSH procedure and the libraries construction. ML helped with the fast drought experiment. MH carried out the sequencing and helped with the analysis. MA participated in planning the study. AR conceived and planned the study, directed the work, carried out the analyses and carried out the writing of the manuscript.

Funding

This work was funded by BBSRC as part of the Institute Strategic Programme Grant.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
(A)
Total number of input reads50,756,845
Total number of de novo contigs20,981
Average contig length in bp482
N50546
Number of mapped reads35,823,839
Average length of mapped reads in bp110
Mean base coverage387x
(B)
CountAverage lengthNo. of bases
Reference transcriptome (M. truncatula)62,3191,06066,028,174
No. input contigs20,98148210,119,208
No. mapped contigs14,7855037,442,262
No. un-mapped contigs6,1964322,676,946
No. mapped contig consensuses (SSHrefseqM)10,413*7157,442,262
References transcriptome (T. repens)71,54556340,246,931
No. input contigs6,1964322,676,946
No. mapped contigs3,7384161,554,812
No. un-mapped contigs (SSHrefseqA)2,458*4571,122,134
No. mapped contig consensuses (SSHrefseqW)3,322*4681,554,812
SSHrefseqAMW transcriptome reference16,19362510,119,208

The total number of contigs in SSHrefseqAMW (in bold) is the sum of the consensus contigs (marked with an asterisk) mapped to M. truncatula (SSHrefseqM), T. repens (SSHrefseqW) and un-mapped (SSHrefseqA). Mean base coverage was calculated as (number of reads mapped X average read length) / total length of contigs.

(A)
ContigRPKMDATA set
DFDRRFRRnSSnESE
SSHrefM3_contig_86380.006.930.950.00DRRFDRRFDRRF
SSHrefM3_contig_95920.0072.412.550.00DRRFDRRFDRRF
SSHrefA3_contig_5071.830.0028.376.04DFRFRRDFrfRRDFrfRR
SSHrefM3_contig_6981.780.002.081.75DFRFRRDFrfRRDFrfRR
SSHrefM3_contig_78603.750.000.162.02DFRFRRDFrfRRDFrfRR
SSHrefM3_contig_17294.0838.6530.94.44DFDRRFRRdfdrrfrrdfDRRFrr
SSHrefM3_contig_24646.8472.9350.445.03DFDRRFRRdfdrrfrrdfDRRFrr
SSHrefM3_contig_768514.120.001.8312.68DFRFRRDFrfRRDFrfRR
SSHrefM3_contig_694518.830.001.225.20DFRFRRDFrfRRDFrfRR
SSHrefM3_contig_56424.3124.5858.0332.24DFDRRFRRdfdrrfrrdfDRRFrr
SSHrefW3_contig_1414*26.383.1845.6595.31DFDRRFRRdfdrrfrrDFdrrfRR
SSHrefM3_contig_2299*42.160.7138.9975.21DFDRRFRRdfdrrfrrDFdrrfRR
SSHrefM3_contig_3336*48.950.8131.29120.42DFDRRFRRdfdrrfrrDFdrrfRR
SSHrefA3_contig_859517.360.0024.8336.76DFRFRRDFrfRRDFrfRR
SSHrefM3_contig_3263*734.405.33427.96824.25DFDRRFRRdfdrrfrrDFdrrfRR
SSHrefW3_contig_2289*1803.009.89625.812253.85DFDRRFRRdfdrrfrrDFdrrfRR
SSHrefM3_contig_3795*3441.0039.971314.083279.21DFDRRFRRdfdrrfrrDFdrrfRR
SSHrefM3_contig_3892*8341.000.21380.126350.87DFDRRFRRdfdrrfrrDFdrrfRR
SSHrefM3_contig_611*34441.00194.2044141.6040870.28DFDRRFRRdfdrrfrrDFdrrfRR
SSHrefM3_contig_32971963.7616.7818516.3764957.88DFDRRFRRdfdrrfrrDFdrrfRR
(B)
ContigFunctionCDay D4Day D9Day R1
SSHrefM3_contig_8638proline dehydrogenase mitochondrial-like (EC:1.5.5.2)11.06±0.170.56±0.139.26±0.35
SSHrefM3_contig_9592uncharacterized calcium-binding protein at1g02270-like10.68±0.211.13±0.211.7±0.34
SSHrefA3_contig_507alpha beta hydrolase domain-containing protein 11-like12.69±0.612.01±0.321.17±0.65
SSHrefM3_contig_698abc transporter b family member 19-like (EC:3.6.3.44)11.63±0.292.17±0.290.41±0.21
SSHrefM3_contig_7860formate–tetrahydrofolate ligase-like (EC:6.3.4.3)12.68±0.281.58±0.401.36±0.23
SSHrefM3_contig_1729caffeic acid 3-o-methyltransferase-like (EC:2.1.1.76)13.61±0.147.52±0.182.07±0.41
SSHrefM3_contig_2464heat shock cognate protein 80-like-
SSHrefM3_contig_7685isoflavone reductase homolog (EC:1.3.1.45)14.67±0.382.3±0.410.84±0.42
SSHrefM3_contig_6945acid beta-fructofuranosidase-like (EC:3.2.1.48, EC:3.2.1.26, EC:3.2.1.80)11.88±0.287.39±0.21.05±0.62
SSHrefM3_contig_564ubiquitin 11-
SSHrefW3_contig_1414*6-phosphogluconate decarboxylating 3 (EC:1.1.1.44)14.47±0.23.82±0.560.71±0.27
SSHrefM3_contig_2299*aconitate hydratase 1 (EC:4.2.1.3)13.15±0.413.54±0.340.57±0.21
SSHrefM3_contig_3336*bifunctional nuclease 1122.48±0.4316.68±0.40.38±0.26
SSHrefA3_contig_859unknown-
SSHrefM3_contig_3263*26s proteasome non-atpase regulatory subunit rpn12a-like112.77±0.583.1±0.41.75±0.36
SSHrefW3_contig_2289*cinnamoyl- reductase 1-like13.67±0.175.23±0.160.74±0.32
SSHrefM3_contig_3795*pathogenesis-related protein pr-4-like153.32±0.39274.31±0.4113.98±0.43
SSHrefM3_contig_3892*universal stress protein15.92±0.6740.38±0.21.05±0.41
SSHrefM3_contig_611*thiol protease aleurain-like17.15±0.3812.01±0.970.44±0.72
SSHrefM3_contig_329abscisic acid stress ripening protein-

In each library (–) denotes absence of the contig, small case denotes loss of the contig following statistical analysis; capital letter denotes retaining of the contig following statistical analysis. Contigs failed to amplify in sqRT-PCR are marked with a cross (.

Contigs rescued by the enrichment of the SnE set are marked with an asterisk (.

  37 in total

1.  Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method.

Authors:  K J Livak; T D Schmittgen
Journal:  Methods       Date:  2001-12       Impact factor: 3.608

2.  Homologs of aminoacyl-tRNA synthetases acylate carrier proteins and provide a link between ribosomal and nonribosomal peptide synthesis.

Authors:  Marko Mocibob; Nives Ivic; Silvija Bilokapic; Timm Maier; Marija Luic; Nenad Ban; Ivana Weygand-Durasevic
Journal:  Proc Natl Acad Sci U S A       Date:  2010-07-27       Impact factor: 11.205

3.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

4.  A stress-inducible gene for 9-cis-epoxycarotenoid dioxygenase involved in abscisic acid biosynthesis under water stress in drought-tolerant cowpea.

Authors:  S Iuchi; M Kobayashi; K Yamaguchi-Shinozaki; K Shinozaki
Journal:  Plant Physiol       Date:  2000-06       Impact factor: 8.340

5.  Effect of drought stress on lipid metabolism in the leaves of Arabidopsis thaliana (ecotype Columbia).

Authors:  Agnès Gigon; Ana-Rita Matos; Daniel Laffray; Yasmine Zuily-Fodil; Anh-Thu Pham-Thi
Journal:  Ann Bot       Date:  2004-07-26       Impact factor: 4.357

6.  MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes.

Authors:  Oliver Thimm; Oliver Bläsing; Yves Gibon; Axel Nagel; Svenja Meyer; Peter Krüger; Joachim Selbig; Lukas A Müller; Seung Y Rhee; Mark Stitt
Journal:  Plant J       Date:  2004-03       Impact factor: 6.417

7.  Protein accumulation in leaves and roots associated with improved drought tolerance in creeping bentgrass expressing an ipt gene for cytokinin synthesis.

Authors:  Emily B Merewitz; Thomas Gianfagna; Bingru Huang
Journal:  J Exp Bot       Date:  2011-08-09       Impact factor: 6.992

8.  Modulation of phenolic metabolism under stress conditions in a Lotus japonicus mutant lacking plastidic glutamine synthetase.

Authors:  Margarita García-Calderón; Teresa Pons-Ferrer; Anna Mrázova; Peter Pal'ove-Balang; Mária Vilková; Carmen M Pérez-Delgado; José M Vega; Adriana Eliášová; Miroslav Repčák; Antonio J Márquez; Marco Betti
Journal:  Front Plant Sci       Date:  2015-09-25       Impact factor: 5.753

Review 9.  Integrating omic approaches for abiotic stress tolerance in soybean.

Authors:  Rupesh Deshmukh; Humira Sonah; Gunvant Patil; Wei Chen; Silvas Prince; Raymond Mutava; Tri Vuong; Babu Valliyodan; Henry T Nguyen
Journal:  Front Plant Sci       Date:  2014-06-03       Impact factor: 5.753

10.  EC2KEGG: a command line tool for comparison of metabolic pathways.

Authors:  Aleksey Porollo
Journal:  Source Code Biol Med       Date:  2014-09-02
View more
  2 in total

1.  AhGLK1 affects chlorophyll biosynthesis and photosynthesis in peanut leaves during recovery from drought.

Authors:  Xing Liu; Limei Li; Meijuan Li; Liangchen Su; Siman Lian; Baihong Zhang; Xiaoyun Li; Kui Ge; Ling Li
Journal:  Sci Rep       Date:  2018-02-02       Impact factor: 4.379

2.  Transcriptome Analysis in Chinese Cabbage (Brassica rapa ssp. pekinensis) Provides the Role of Glucosinolate Metabolism in Response to Drought Stress.

Authors:  Seung Hee Eom; Seung-A Baek; Jae Kwang Kim; Tae Kyung Hyun
Journal:  Molecules       Date:  2018-05-15       Impact factor: 4.411

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.