| Literature DB >> 32426025 |
Ilana Lambert1, Christine Paysant-Le Roux2,3, Stefano Colella1, Marie-Laure Martin-Magniette2,3,4.
Abstract
BACKGROUND: RNAseq is nowadays the method of choice for transcriptome analysis. In the last decades, a high number of statistical methods, and associated bioinformatics tools, for RNAseq analysis were developed. More recently, statistical studies realised neutral comparison studies using benchmark datasets, shedding light on the most appropriate approaches for RNAseq data analysis.Entities:
Keywords: Analysis workspace; Co-expression; Contrasts; Differential expression; RNA-seq
Year: 2020 PMID: 32426025 PMCID: PMC7216733 DOI: 10.1186/s13007-020-00611-7
Source DB: PubMed Journal: Plant Methods ISSN: 1746-4811 Impact factor: 4.993
Fig. 1DiCoExpress workspace. A visual representation of the DiCoExpress Workspace. a Sources directory contains all R functions implemented in DiCoExpress. b Template_scripts directory is the directory where an analysis script for each project analyzed must be saved. c Data directory is the directory where for each project, the input files (target and count tables) must be saved. If an annotation file used to describe biologically the different result tables as well as an reference file for the enrichment analysis are available, they must be also be saved in this directory. d Results directory contains all results obtained for each project
Fig. 2Overview of DiCoExpress workflow. DiCoExpress is composed of seven functions written in R programming language and available in the directory Sources. After loading the input files with the (1) Load_Data_Files function, a data quality control is done with the (2) Quality_Control function. Differential expression analysis using generalized linear models is performed by using the (3) GLM_Contrasts and (4) DiffAnalysis_edgeR functions. The union or intersection lists of differentially expressed genes are generated by the (5) Venn_Intersection_Union function. A co-expression analysis is performed on these lists by the (6) Coexpression_coseq function. Finally, the functional characterisation of lists of genes is tested by using the (7) Enrichment function
Fig. 3Histograms of raw p-values Brassica napus analysis. Histograms of the raw p-value for the contrasts [MatureLeaf–Root] and [NoSi–Si] according to the CPM_Cutoff parameter when the filtering strategy is NbConditions a CPM_Cutoff = 1 (default argument). b CPM_Cutoff = 5
Target table of Brassica napus dataset in R
| Sample | Tissue | Treatment | Replicate |
|---|---|---|---|
| MatureLeaf_NoSi_R1 | MatureLeaf | NoSi | R1 |
| MatureLeaf_NoSi_R2 | MatureLeaf | NoSi | R2 |
| MatureLeaf_NoSi_R3 | MatureLeaf | NoSi | R3 |
| MatureLeaf_Si_R1 | MatureLeaf | Si | R1 |
| MatureLeaf_Si_R2 | MatureLeaf | Si | R2 |
| MatureLeaf_Si_R3 | MatureLeaf | Si | R3 |
| Root_NoSi_R1 | Root | NoSi | R1 |
| Root_NoSi_R2 | Root | NoSi | R2 |
| Root_NoSi_R3 | Root | NoSi | R3 |
| Root_Si_R1 | Root | Si | R1 |
| Root_Si_R2 | Root | Si | R2 |
| Root_Si_R3 | Root | Si | R3 |
The target table provides the experimental design. Each row describes a sample by specifying the level for each factor in the columns
Fig. 4Co-expression clusters for the Brassica napus analysis. Average expression profiles and size of the c co-expression clusters found by analysing of the 945 Differentially expressed genes (DEGs) from the union of three contrasts: [MatureLeaf_NoSi–MatureLeaf_Si], [Root_NoSi–Root_Si] and [MatureLeaf_NoSi–MatureLeaf_Si]-[Root_NoSi–Root_Si]