| Literature DB >> 25604693 |
Ankur R Bhardwaj1, Gopal Joshi2, Bharti Kukreja3, Vidhi Malik4, Priyanka Arora5, Ritu Pandey6, Rohit N Shukla7, Kiran G Bankar8, Surekha Katiyar-Agarwal9, Shailendra Goel10, Arun Jagannath11, Amar Kumar12, Manu Agarwal13.
Abstract
BACKGROUND: Brassica juncea var. Varuna is an economically important oilseed crop of family Brassicaceae which is vulnerable to abiotic stresses at specific stages in its life cycle. Till date no attempts have been made to elucidate genome-wide changes in its transcriptome against high temperature or drought stress. To gain global insights into genes, transcription factors and kinases regulated by these stresses and to explore information on coding transcripts that are associated with traits of agronomic importance, we utilized a combinatorial approach of next generation sequencing and de-novo assembly to discover B. juncea transcriptome associated with high temperature and drought stresses.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25604693 PMCID: PMC4310166 DOI: 10.1186/s12870-014-0405-1
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Filtering of raw reads obtained through high throughput sequencing of RNA-Seq libraries
|
|
|
|
|
|---|---|---|---|
|
|
|
| |
|
|
|
| |
|
| 77926818 (100%) | 65644688 (100%) | 40181314 (100%) |
|
| 155835 (0.2%) | 4872907 (7.4%) | 889239 (2.2%) |
|
| 11662189 (15.0%) | 9706889 (14.8%) | 3747523 (9.3%) |
|
| 58438630 (75.0%) | 41320578 (62.9%) | 32342960 (80.5%) |
|
| 7670164 (9.8%) | 9744314 (14.8%) | 3201592 (8.0%) |
|
| 66108794 (84.8%) | 51064892 (77.8%) | 35544552 (88.5%) |
Raw reads from control (BC), high temperature (BHS) and drought (BDS) stress libraries were subjected to various quality control parameters and reads that had contamination of adapter sequence or of low quality were eliminated. Only high quality paired and orphan reads were pooled for assembly.
Figure 1Schematic overview of the methodology employed for data quality control (QC), assembly and downstream analysis. Name of tool used in each step of assembly or analysis is indicated in parenthesis.
Assembly statistics of high quality reads
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |
|
| 204991 | 248954 |
| 220102 | 170941 | 134378 | 99899 | 68700 |
|
| 69.8 | 96.1 | 111.1 |
| 104.4 | 91.9 | 72.4 | 47.0 |
|
| 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
|
| 10071 |
| 11901 | 11782 | 11856 | 9105 | 8870 | 7678 |
|
| 340 | 385 | 423 | 506 | 610 | 683 |
| 684 |
|
| 665 | 832 | 989 | 1144 | 1265 |
| 1241 | 1057 |
Pooled high quality reads were assembled at various K-mers using SOAPdenovo. For each of the K-mer various assembly parameters (such as number of contigs, assembly length, minimum, maximum and average transcript length and N50) were evaluated. The maximum value for each of the parameter in their respective k-mers has been italicized.
Output of clustered assembly
|
|
|
|---|---|
|
| 97175 |
|
| 79.4 |
|
| 817 |
Assemblies from all the K-mer lengths were subjected to clustering. The number of contigs after clustering, total length of assembly and average length of transcripts is shown.
Figure 2Investigation of assembly performance and annotation. (A) Length-wise distribution of contigs. The number of contigs present in each of the length category in clustered transcriptome of B. juncea is shown. Contig numbers gradually decreases with respect to increasing contig length. (B) Number of B. juncea transcripts (Y-axis) that were annotated on the basis of homology with genes from closely related species (X-axis). Transcripts were searched against EMBL plant protein database and based on BLAST score annotations were derived for each transcript. The number of transcripts hitting the protein dataset of various plant species is indicated.
Figure 3Estimation of normalization and expression changes in different libraries. (A) Box-and-whisker plot of log10 FPKM values in RNA-Seq libraries of control (BC), high temperature (BHS) and drought stress (BDS). The entire range is divided in 4 quartiles (Q1-Q4) each representing 25% of genes in the particular range. (B) Scatter plot and (C) Volcano plot of the transcriptome in high temperature (BHS) and drought (BDS) stress. In scatter plot, log10 FPKM values in control (X-axis) have been plotted against log10 FPKM values of stress treated sample (Y-axis) sample. In volcano plot, statistical significance (−log10 of p-value; Y- axis) has been plotted against log2 fold change (X-axis).
Figure 4Expression analysis of differentially expressed transcripts. (A) Unsupervised hierarchical clustering of differentially expressed transcripts in high temperature (BHS) and drought stress (BDS) conditions. Comparison was made against control sample using Pearson uncentered algorithm with an average linkage rule to identify clusters of genes based on their expression levels across samples. (B) Number of transcripts (C) transcription factors and (D) kinases that were regulated by high temperature stress, drought stress or by both stresses. The up-regulation, down-regulation and inverse corelation (up-regulated in one condition and down-regulated in other condition or vice versa) is indicated by arrows pointing upwards, downwards and upwards-downwards, respectively.
List of top 10 dysregulated pathways
|
|
|
|
|
|
|---|---|---|---|---|
|
| ABC transporters | Environmental Information Processing | Membrane transport | 87 |
|
| Ribosome | Genetic Information Processing | Translation | 76 |
|
| Purine metabolism | Metabolism | Nucleotide metabolism | 43 |
|
| Porphyrin and chlorophyll metabolism | Metabolism | Metabolism of cofactors and vitamins | 41 |
|
| Glycolysis/Gluconeogenesis | Metabolism | Carbohydrate metabolism | 37 |
|
| Amino sugar and nucleotide sugar metabolism | Metabolism | Carbohydrate metabolism | 36 |
|
| Two-component system | Environmental Information Processing | Signal transduction | 36 |
|
| Amino sugar and nucleotide sugar metabolism | Metabolism | Carbohydrate metabolism | 34 |
|
| Lipopolysaccharide biosynthesis | Metabolism | Glycan biosynthesis and metabolism | 33 |
|
| Purine metabolism | Metabolism | Nucleotide metabolism | 31 |
Differentially regulated transcripts were mapped on various metabolic pathways using corresponding KEGG identifiers. Derived pathway and associated BRITE Class with number of dysregulated genes are indicated.
Figure 5Gene ontology classification of differentially expressed transcripts under the ‘biological process’ category. Significant GO terms (having atleast 40 genes) associated with differentially expressed transcripts in high temperature (BHS) and drought (BDS) stress samples along with the number of genes is indicated.
Differential expression of transcripts annotated as transcription factors and kinases
|
|
|
|
| |||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |
| Transcription factors | ||||||||
| MYB | 122 | 34 | 4 | 13 | 17 | 12 | 14 | 26 |
| HSF | 39 | 24 | 7 | 2 | 9 | 21 | 2 | 23 |
| ERF | 26 | 22 | 2 | 9 | 11 | 6 | 9 | 15 |
| WRKY | 118 | 21 | 5 | 7 | 12 | 3 | 11 | 14 |
| bHLH | 101 | 18 | 1 | 12 | 13 | 1 | 9 | 10 |
| AP2 | 32 | 14 | 4 | 2 | 6 | 5 | 7 | 12 |
| DREB | 15 | 11 | 9 | 0 | 9 | 10 | 0 | 10 |
| JUMONJI | 37 | 8 | 0 | 7 | 7 | 0 | 4 | 4 |
| GATA | 29 | 7 | 0 | 5 | 5 | 2 | 2 | 4 |
| bZIP | 21 | 6 | 1 | 4 | 5 | 0 | 3 | 3 |
| PLATZ | 21 | 4 | 3 | 0 | 3 | 1 | 0 | 1 |
| TCP | 8 | 3 | 1 | 0 | 1 | 1 | 0 | 1 |
| CCAAT | 48 | 2 | 0 | 1 | 1 | 0 | 1 | 1 |
| HD | 5 | 2 | 0 | 1 | 1 | 0 | 1 | 1 |
| SCARECROW | 5 | 1 | 0 | 1 | 1 | 0 | 1 | 1 |
| GRAS | 5 | 1 | 1 | 0 | 1 | 1 | 0 | 1 |
| NFY | 37 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| C2H2 | 22 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Kinases | ||||||||
| Receptor-like kinases | 240 | 86 | 4 | 59 | 63 | 2 | 52 | 54 |
| MAP kinases | 116 | 29 | 6 | 14 | 20 | 2 | 10 | 12 |
| Casein kinases | 80 | 15 | 1 | 9 | 10 | 2 | 7 | 9 |
| Calcium-dependent protein kinases | 62 | 11 | 2 | 7 | 9 | 1 | 8 | 9 |
| CBL-interacting kinases | 59 | 6 | 0 | 3 | 3 | 1 | 3 | 4 |
| Cyclin-dependent kinases | 40 | 6 | 0 | 6 | 6 | 0 | 3 | 3 |
The members of various transcription factor and kinase families were fetched from assembled transcriptome data and analyzed for expression pattern under conditions of drought (BHS) and high temperature (BHS). The details of total and differentially regulated transcripts in respective families along with categorization as up-regulated, down-regulated and total regulated transcripts in BDS and BHS is presented.
Figure 6Relative abundance of selected transcripts as determined by qPCR. Expression profiling of a few differentially regulated transcripts was performed using quantitative real time PCR. The relative abundance (Y-axis) was calculated using ΔΔCt method. B. juncea seedlings were subjected for varied durations to either high temperature stress (BHS) at 42°C for 30 min, 2 h and 4 h or drought stress (BDS) by using 300 mM mannitol for 1 h, 3 h and 6 h. The mean of three independent biological replicates is presented.