| Literature DB >> 32373287 |
Chiara Damiani1,2,3, Lorenzo Rovida2, Davide Maspero2,4, Irene Sala2, Luca Rosato2, Marzia Di Filippo3,5, Dario Pescini3,5, Alex Graudenzi6, Marco Antoniotti2, Giancarlo Mauri2,3.
Abstract
We present MaREA4Galaxy, a user-friendly tool that allows a user to characterize and to graphically compare groups of samples with different transcriptional regulation of metabolism, as estimated from cross-sectional RNA-seq data. The tool is available as plug-in for the widely-used Galaxy platform for comparative genomics and bioinformatics analyses. MaREA4Galaxy combines three modules. The Expression2RAS module, which, for each reaction of a specified set, computes a Reaction Activity Score (RAS) as a function of the expression level of genes encoding for the associated enzyme. The MaREA (Metabolic Reaction Enrichment Analysis) module that allows to highlight significant differences in reaction activities between specified groups of samples. The Clustering module which employs the RAS computed before as a metric for unsupervised clustering of samples into distinct metabolic subgroups; the Clustering tool provides different clustering techniques and implements standard methods to evaluate the goodness of the results.Entities:
Keywords: Galaxy; Metabolism; RNA-seq; Sample stratification; TCGA
Year: 2020 PMID: 32373287 PMCID: PMC7191582 DOI: 10.1016/j.csbj.2020.04.008
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1Screenshot of the MaREA4Galaxy interface. The module for RAS computation is illustrated. In particular, the built-in (default) HMRcore GPR rules are chosen. In the ‘add dataset’ field there is the RNA-seq dataset which has been previously uploaded and that appears in green in the History panel on the right.
Fig. 2Screenshot of the MaREA4Galaxy interface. The module for metabolic reaction enrichment analysis is illustrated. The input format option ‘RNAseq dataset of all samples + sample group specification’ has been selected and the best clustering obtained with the k-means algorithm in the History has been selected as sample group specification.
Fig. 3Screenshot of the MaREA4Galaxy interface. The module for cluster analysis is illustrated. The RAS computed by the MaREA tool have been selected as input dataset and K-means has been chosen as clustering method. 2 to 5 number of clusters will be tested. The elbow and silhouette plots will be generated.
Fig. 4Evaluation of clustering goodness by MaREA4Galaxy. Left panel: elbow plot generated by the Clustering module, showing an elbow for . Right panel: silhouette plot generated by the Clustering module for , which has been returned as best clustering according to the average silhouette score reported in the plot’s title.
Fig. 5Example of metabolic map generated by MaREA4Galaxy. In the example, red arrows indicate reactions up-regulated, whereas blue arrows reactions down-regulated, in a subgroup of liver hepatocellular carcinoma patients. Black arrows refer to reactions without information about the corresponding gene-enzyme rule. Dashed gray arrows refer to non significant disregulations according Kolmogorov-Smirnov test with p-value 0.01. Solid gray arrows refer to reactions with a variation lower then 20%. As output maps are provided as vector graphics (in svg/pdf file formats), they can be zoomed-in at will.