| Literature DB >> 26335208 |
Anne de Jong1,2,3, Sjoerd van der Meulen4, Oscar P Kuipers4,5, Jan Kok4,5.
Abstract
BACKGROUND: Transcriptomics analyses of bacteria (and other organisms) provide global as well as detailed information on gene expression levels and, consequently, on other processes in the cell. RNA sequencing (RNA-seq) has over the past few years become the most accurate method for global transcriptome measurements and for the identification of novel RNAs. This development has been accompanied by advances in the bioinformatics methods, tools and software packages that deal with the analysis of the large data sets resulting from RNA-seq efforts.Entities:
Mesh:
Year: 2015 PMID: 26335208 PMCID: PMC4558784 DOI: 10.1186/s12864-015-1834-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Input files for the RNA-seq analysis pipeline
| A) Factors | B) Contrasts | C) Classes | ||||
|---|---|---|---|---|---|---|
| Experiment | Strain | Time | A_F71Y-WT | BSU00490 | green | CodY |
| WT1 | WT | T1 | B_R61K-WT | BSU01650 | green | CodY |
| WT2 | WT | T1 | C_R61H-WT | BSU01660 | green | CodY |
| F71Y1 | A_F71Y | T1 | Null-WT | BSU01670 | green | CodY |
| F71Y2 | A_F71Y | T1 | BSU01680 | green | CodY | |
| R61K1 | B_R61K | T2 | etc… | … | … | |
| R61K2 | B_R61K | T2 | BSU03981 | red | CcpA | |
| R61H1 | C_R61K | T2 | BSU03982 | red | CcpA | |
| R61H2 | C_R61K | T2 | BSU03990 | red | CcpA | |
| null1 | Null | T2 | BSU04160 | red | CcpA | |
| null2 | Null | T2 | BSU04470 | red | CcpA | |
| etc… | … | … | ||||
A) File describing the experiments and containing information of experiment replicates, B) File with the comparisons (contrasts) to be made, C) File with groups of genes/RNAs of interest
Figure 1Flow chart of the RNA-seq analysis pipeline. User input consists of the four data files defined in Table 1 and a project name. Parameters such as thresholds, p-value cutoffs and k-means settings are predefined or will be estimated by the analysis pipeline
Figure 2An illustration of images obtained by T-REx after analysis of the CodY dataset of Brinsmade et al.. a) Library sizes, b) Box plots of signals in each sample, c) MDS plot, d) Bar graph of up- and down-regulated genes, e) One of the k-means clusters, f) One of the Volcano plots, g) Network of genes and experiments, h) Correlation matrix of experiments, i) Heatmaps of Class genes to experiments, j) Correlation matrix of Class genes to Class genes. For the same images in high-resolution, see Additional file 4: Figure S2A – S2J. A tutorial for interpretation of T-REx results is given on the T-REx webserver
Overview table of the analysis of differential gene expression
| Contrast | Total number of genes | Up-regulated | Down-regulated |
|---|---|---|---|
| A_F71Y-WT | 4176 | 72 | 9 |
| B_R61K-WT | 4176 | 126 | 15 |
| C_R61H-WT | 4176 | 219 | 25 |
| Null-WT | 4176 | 282 (196/212) | 47 (27/29) |
The numbers of up- and down-regulated genes were determined using default cutoffs p-value ≤ 0.05 and fold-change ≥ 2. Within brackets p-value ≤ 0.05 and fold change ≥ 3 as was mentioned in Brinsmade et al. and our pipeline, respectively