| Literature DB >> 32670572 |
Matthias Pietzke1, Alexei Vazquez1,2.
Abstract
BACKGROUND: Metabolomics is gaining popularity as a standard tool for the investigation of biological systems. Yet, parsing metabolomics data in the absence of in-house computational scientists can be overwhelming and time-consuming. As a consequence of manual data processing, the results are often not analysed in full depth, so potential novel findings might get lost.Entities:
Keywords: Automatic; Graphs; Metabolites; Metabolomics; Plots; Processing; Visualisation
Year: 2020 PMID: 32670572 PMCID: PMC7350678 DOI: 10.1186/s40170-020-00220-x
Source DB: PubMed Journal: Cancer Metab ISSN: 2049-3002
Data structure for the different input layouts
| Input style | Layout | File types | Essential columns | Additional columns |
|---|---|---|---|---|
| Tracefinder | Both, metabolites and samples along rows | .csv or .xls(x) | Compound1, Filename, Area | Actual RT2, Formula2+3, Adduct2, m/z (Apex)2, m/z (Delta (ppm))2 or m/z (Delta)2 |
| Compound Discoverer | Metabolites in rows, samples in columns | .csv or .xls(x) | Compound, RT [min]4+5, Filename as: "Area: "+ "filename"+".raw (F"number")" or "Norm. Area: "+ "filename"+".raw (F"number")" | Molecular Weight2+5 ,Mass5, Formula3 |
| Compounds in Columns | Samples in rows (sample names in first column) metabolite names in first row (column 2-n) | .csv or .xls(x) | Samples in first column, name of column is ignored | Not allowed as they will be interpreted as metabolites |
| Other - List | Both, metabolites and samples along rows | .csv, .xls(x), tab-, comma, semicolon separated files | 4 columns in defined order, describing the Metabolite1, RT6, Filename and Area | Will be ignored |
| Other - Matrix | Metabolites in rows, samples in columns (inverse of compounds in columns) | First 2 columns containing Metabolite7 and RT6+8, following columns containing the samples | Not allowed as they will be interpreted as metabolites |
All imports need to have (only) one header row containing the column names. When Excel files are used, the data are expected in the first sheet. Remarks: 1-cannot contain empty values or duplicated names. 2-used for metabolite summary. 3-used for subtracting naturally occurring isotopes. 4-used to combine compound names with retention time (RT) as compound name typically contains duplicates. 5-used to label unknowns. 6-column RT can be empty but needs to be present. 7-Compounds can contain duplicate names as long as RT is supported to merge names and RT
Fig. 1The visual design of the tool. Top: Scheme illustrating the workflow of the application. Bottom: Screenshot of the application, showing the plot-design step. The plot in the middle is interactively updated based on the settings defined by the user, allowing a high level of flexibility
Fig. 2Illustration of plotting capabilities. a–f Standard plots included in the tool, with default parameters and colours. a Bar plot showing mean and standard deviation (SD). b Individual data-points with mean and SD. c Violin plot showing the distribution and the median (only available for at least 5 replicates). d Replication validation with points, showing the experimental replicates (only when experimental replicates are defined). e Boxplot (only available for at least 10 replicates). f Replication validation with bars, showing the experimental replicates (only when experimental replicates are defined). g–j Modifications on the plots allowing more flexibility, including user defined order and colours. g Modified text elements. h Hollow bars with added data points, removed grid lines, increased height and 90° direction on x-axis. i Hollow violin plot with statistics (t test with multiple pairwise comparisons). j Bar plot with decreased height and statistics (t test against spleen sample). k Points and error with increased width, legend instead of x-axis labels and added sample counts
Fig. 3Illustration of tracing experiments processed and visualised with Metabolite AutoPlotter. Plots show the incorporation of 13C either from glucose or from glutamine (highlighted on the left side) into some intracellular metabolites of HCT116 cells at different time points ranging from 1 to 24 h. Top: Absolute intensity plots and short colour scale, showing the peak area (arbitrary units). Bottom: relative intensity plots and full colour scale, showing the relative peak area (percentage)
Fig. 4Illustration of the natural abundance correction. The left panels show peak areas (arbitrary units) of ATP and GSH (glutathione), of the same experiment as in Fig. 3, before the natural abundance correction was performed. The interpretation of the results is difficult, particularly when no unlabelled reference is reported. The right panel shows the same compounds after the correction, here the amount of 13C incorporation can be deduced much easier. For the legend, see Fig. 3, top