| Literature DB >> 33199699 |
Jimmy Vandel1, Céline Gheeraert2, Bart Staels2, Jérôme Eeckhoute2, Philippe Lefebvre2, Julie Dubois-Chevalier3.
Abstract
Transcriptomic analyses are broadly used in biomedical research calling for tools allowing biologists to be directly involved in data mining and interpretation. We present here GIANT, a Galaxy-based tool for Interactive ANalysis of Transcriptomic data, which consists of biologist-friendly tools dedicated to analyses of transcriptomic data from microarray or RNA-seq analyses. GIANT is organized into modules allowing researchers to tailor their analyses by choosing the specific set of tool(s) to analyse any type of preprocessed transcriptomic data. It also includes a series of tools dedicated to the handling of raw Affymetrix microarray data. GIANT brings easy-to-use solutions to biologists for transcriptomic data mining and interpretation.Entities:
Mesh:
Year: 2020 PMID: 33199699 PMCID: PMC7670435 DOI: 10.1038/s41598-020-76769-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Transcriptomic analysis workflows using GIANT Galaxy tools. The general steps of the workflows are indicated on the left. Two workflows depending on the initial raw data are represented, both starting from the design definition (at the center bottom) to generic data mining analyses (purple dashed area at the top). Specific steps from quality check to differential analysis are indicated for microarray (left, green dashed area) and RNA-seq (right, yellow dashed area). Steps in which GIANT tools can be used are coloured in red, specific RNA-seq steps with available Galaxy tools are coloured in blue. Arrows - and - indicate the tools which should be used in consecutive steps for microarray and RNA-seq data analysis, respectively. Note that running the Quality Check tool both before and after data normalization is recommended (*marked).
Figure 2Extract of a factor file describing experimental design (GEO:GSE46495). For each sample listed in the first column, associated values for 3 experimental factors (Diet, Tissue and Mouse ID) are given in the 3 following columns.
Figure 3Partial view of the differential expression tool form showing input files selection, definition of contrasts and auto-generation of complex interaction contrasts. Both normalized data and study design files are selected input files. Definition of contrasts requires selection of factors among those automatically extracted from the design file and definition of groups (to compare first group to second group) as a selection of one or several factor value combinations (dynamically generated based on selected factors). Interaction contrasts are automatically defined as a function of the control value selected by the user.
Figure 4Partial view of the differential expression tool form showing tuning parameters and optional outputs. False Discovery Rate (FDR) and Fold Change cutoffs are tuned to filter out genes/probes from the output file. P-value histograms and volcano plots for each contrast are added to the output upon user request, as well as additional gene information extracted from public databases.
Figure 5Graphics produced by the Quality Check tool. Before normalization: (a) boxplots and (b) histograms of raw data including all .CEL files and (c) MA-plot of a single .CEL file. After normalization: (d) histograms and (e) 3D PCA of normalized microarray expression data.
Figure 6Results issued from the Differential expression tool: (a) differential statistics, (b) p-value distribution for a given contrast and (c) F-ratio bar plot for differential model factors; Graphic generated by the Volcano plot tool: (d) volcano plot generated from statistics computed by the Differential expression tool.
Figure 7Results issued from the Heatmap and clustering tool: (a) cluster information added to differential statistics, (b) normalized microarray expression heatmap with hierarchical clustering of genes and samples and (c) scree plot showing within-clusters variance as a function of cluster number to assist in the cluster number choice.
Figure 8Graphics issued from the Quality Check tool: (a) 3D PCA of normalized RNA-seq expression data and the Heatmap and clustering tool: (b) normalized RNA-seq expression heatmap with hierarchical clustering of genes and samples.
Comparison of existing Galaxy tool suites.
| Tool suite | Tunable | Modularity | Design definition | QC plots | Interactive ouput | Filter options | Cross studies | Param. clustering |
|---|---|---|---|---|---|---|---|---|
| GIANT suite | ||||||||
| SARTools[ | ||||||||
| LIMMA-voom[ |
Compared functionalities from left to right are: tunable tool parameters, specific tool for each analysis step insuring modularity, possibility to build a design file, generation of QC plots, interactivity in generated files, input filtering options, possibility to cross information with another dataset and advanced clustering parameters. and signs indicate that the corresponding functionality is fully and partially available in the tool suite respectively.
Comparison of existing Galaxy volcano plot tools.
| Volcano plot tool | Tunable | Generic input | Interactive plot | Interactive table | Filter options | Gene labeling |
|---|---|---|---|---|---|---|
| GIANT volcano tool | ||||||
| Volcanoplot | ||||||
| LIMMA-voom[ |
Compared functionalities from left to right are: tunable tool parameters, generic input, interactivity in generated plots, interactivity in generated tables, input filtering options and labeling of genes in the volcano plot. and signs indicate that the corresponding functionality is fully and partially available in the volcano tool respectively.(Volcanoplot is available on Galaxy-toolshed : https://toolshed.g2.bx.psu.edu/view/iuc/volcanoplot/73b8cb5bddcd).
Comparison of some existing Galaxy heatmap and clustering tools.
| Heatmap and clustering tool | Generic input | Interactive output | Filter options | Cross studies | Param. clustering | Cluster assignation | Side colors | Colors definition |
|---|---|---|---|---|---|---|---|---|
| GIANT heatmap tool | ||||||||
| LIMMA-voom[ | ||||||||
| heatmap | ||||||||
| heatmap_colormanipulation | ||||||||
| plotHeatmap | ||||||||
| ggplot2_heatmap2[ |
Compared functionalities from left to right are: generic input, interactivity in generated files, input filtering options, possibility to cross information with another dataset, advanced clustering parameters, retrieve cluster assignation, display colored side bar and color personalization. and signs indicate that the corresponding functionality is fully and partially available in the heatmap and clustering tool respectively. (heatmap available at https://toolshed.g2.bx.psu.edu/view/guru-ananda/heatmap/dbd447fcd3e4 ; heatmap_colormanipulation available at https://toolshed.g2.bx.psu.edu/view/mir-bioinf/heatmap_colormanipulation/58772ebbeb9f ; plotHeatmap available at https://toolshed.g2.bx.psu.edu/view/earlhaminst/plotheatmap/bd8fd161908b).