| Literature DB >> 21461033 |
Maud M Koek, Frans M van der Kloet, Robert Kleemann, Teake Kooistra, Elwin R Verheij, Thomas Hankemeier.
Abstract
Due to the complexity of typical metabolomics samples and the many steps required to obtain quantitative data in GC × GC-MS consisting of deconvolution, peak picking, peak merging, and integration, the unbiased non-target quantification of GC × GC-MS data still poses a major challenge in metabolomics analysis. The feasibility of using commercially available software for non-target processing of GC × GC-MS data was assessed. For this purpose a set of mouse liver samples (24 study samples and five quality control (QC) samples prepared from the study samples) were measured with GC × GC-MS and GC-MS to study the development and progression of insulin resistance, a primary characteristic of diabetes type 2. A total of 170 and 691 peaks were quantified in, respectively, the GC-MS and GC × GC-MS data for all study and QC samples. The quantitative results for the QC samples were compared to assess the quality of semi-automated GC × GC-MS processing compared to targeted GC-MS processing which involved time-consuming manual correction of all wrongly integrated metabolites and was considered as golden standard. The relative standard deviations (RSDs) obtained with GC × GC-MS were somewhat higher than with GC-MS, due to less accurate processing. Still, the biological information in the study samples was preserved and the added value of GC × GC-MS was demonstrated; many additional candidate biomarkers were found with GC × GC-MS compared to GC-MS. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-010-0219-6) contains supplementary material, which is available to authorized users.Entities:
Year: 2010 PMID: 21461033 PMCID: PMC3040320 DOI: 10.1007/s11306-010-0219-6
Source DB: PubMed Journal: Metabolomics ISSN: 1573-3882 Impact factor: 4.290
Workflow for optimizing and carrying out GC × GC–MS data processing
| Task | Samples | Analyst time (h) | Computer time (h) |
|---|---|---|---|
| 1. Optimize processing method (peak width, smoothing, match required to combine 2D peaks from one entry) | 5 QCs | 8 | 20 |
| 2. Processing for construction of target table | 1 QC | 0 | 1 |
| 3. Construction target table (removing artifacts from, i.e. solvent and reagents) | 1 QC | 1 | 0 |
| 4. Find targets from GC–MS in GC × GC–MS target table | 1 QC | 16 | 0 |
| 5. Processing of samples using constructed target table | 29 (all) | 0 | 40 |
| 6. Copy data to spreadsheet | 29 (all) | 1 | 0 |
| 7. Removing entries with too many blanks | 29 (all) | 1 | 0 |
| 8. Assigning peaks of remaining blank valuesa | 29 (all) | 20 | 0 |
| Total time required | 29 (all) | 47 | 61 |
aReprocessing of all the data with a low match factor (200) takes about 20 h of extra computer time
Number of entries in a GC–MS and GC × GC–MS chromatogram of a pooled mouse liver sample
| S/N ratioa | GC–MS | GC × GC–MSb |
|---|---|---|
| S/N >3 | 435 | 3770 |
| S/N >50 | 165 | 1905 |
| S/N >100 | 96 | 1223 |
| S/N >200 | 52 | 835 |
| S/N >500 | 11 | 518 |
aSignal-to-noise ratio in the total ion current
bNumber of entries obtained by deconvolution software (see text for details) and after removal of artifacts from, i.e. solvents and reagents (eluting at low 2tR)
Comparing the RSDs of normalized MS responsea for the internal standards for GC–MS and GC × GC–MS in all samples (QC and study samples)
| Compound | RSD of MS response (%) | |
|---|---|---|
| GC–MS | GC × GC–MS | |
| Alanine-d4 | 7 | 8 |
| Leucine-d3 | 9 | 8 |
| Glutamic acid-d3 | 17 | 8 |
| Phenylalanine-d5 | 13 | 7 |
| Cholic acid-d4 | 4 | 6 |
aMS responses of the internal standards were corrected for variations in injection volume and MS response by normalization on the response of dicyclohexylphthalate (used as injection standard)
Fig. 1Comparison of RSDs for metabolites in the QC samples of GC–MS and GC × GC–MS data after processing; only metabolites detected in both methods are shown
Fig. 2Overview of the number of entries in the QC samples per RSD-range for GC–MS and GC × GC–MS
Fig. 3Total-ion GC–MS chromatogram of a pooled mouse-liver sample
Fig. 4Two-dimensional colour plot of a total ion GC × GC–MS chromatogram of a pooled mouse liver sample
Fig. 5PCA analysis of the overlap data (107 entries) for GC–MS (a) and GC × GC–MS (b) of mouse liver after 0 (red, +), 6 (blue, ∆) and 12 (green, ●) weeks (Color figure online)
Mahalanobis distances in the overlap data after PCA analysis of GC–MS and GC × GC–MS
| Groups | Mahalanobis distance between groups | |
|---|---|---|
| GC–MS | GC × GC–MS | |
|
| 36 | 47 |
|
| 44 | 74 |
|
| 13 | 10 |
Fig. 6PCA analysis of additional entries (compared to GC–MS; 584 peaks) in GC × GC–MS of mouse liver after 0 (red, +), 6 (blue, ∆) and 12 (green, ●) weeks (Color figure online)
Top 20 metabolites with highest loading in LD1 in PCDA analysisa
| Rank PCDA | GC × GC–MS | GC–MS | ||
|---|---|---|---|---|
| Compoundb | Loading LD1 | Compound | Loading LD1 | |
| 1 |
| −4.41 |
| −4.55 |
| 2 | M0617 (unsaturated fatty acid methyl ester) | −4.36 | GA0502 | −4.37 |
| 3 | M0480 | −4.29 | 1,2-Diglyceridec | 4.28 |
| 4 | M0071 | −4.27 | 1-Palmitoyl- | −4.22 |
| 5 |
| −4.17 |
| −4.18 |
| 6 | M0535 (purine) | −4.04 |
| −4.03 |
| 7 | Taurine* | 4.00 |
| −4.00 |
| 8 | Tyrosine* | −3.99 | GA0123c | −3.91 |
| 9 | M0221 (amino-organic acid) | −3.98 |
| −3.83 |
| 10 | M0444 | −3.92 |
| −3.82 |
| 11 | M0651 | −3.91 |
| −3.81 |
| 12 | M0593 (monoglyceride) | −3.90 | C20:1 fatty acidc | 3.80 |
| 13 |
| −3.85 | Pipecolinic acidc | −3.78 |
| 14 | M0182 (piperidine) | −3.84 |
| −3.77 |
| 15 | M0550 | −3.84 |
| −3.77 |
| 16 | M0283 (pyrrolidinone) | −3.81 |
| 3.76 |
| 17 | 1,5-Anhydro- | −3.80 |
| −3.75 |
| 18 | M0600 (polyunsaturated fatty acid) | −3.79 | GA0520c | −3.71 |
| 19 | M0597 | −3.75 |
| −3.71 |
| 20 | M0307 (deoxyglucose or isomer) | −3.72 |
| −3.70 |
aIn bold: metabolites present in both top 20s, in italics: metabolites present in both datasets, but not the top 20 of GC × GC–MS data. Metabolites marked with an asterisk are identified with authentic standards, other metabolites given a name are annotated via their mass spectra (more information in Sect. 2). The retention times and quantification masses of unknowns (M-coded and GA-coded metabolites) are listed in the Supplement (S-Table 2)
bTentative annotation via MS library match or characterization of type of metabolite based on the mass spectrum are given in brackets
cMetabolites were not identified in the GC × GC–MS data set, most likely due to uncertainty in the assignment of the identity (no reference standard available and mass spectrum not unique enough). Only number 3 and 4 from GC–MS were not measured with GC × GC–MS, because their elution temperature was too high. Most probably all other metabolites are present in the 2D data set, but under a different name (M-code). However, none of the unknown top 20 metabolites from the GC–MS data set were present in the top 20 of the GC × GC–MS data (checked manually via mass spectra)
Fig. 7Box plots of the relative concentrations of the PCDA variables in LD1: Campesterol (a; P < 0.0001;), M0617 (poly unsaturated fatty acid methyl ester; P < 0.0001) (b) and linoleic acid (c; P < 0.0001) M0600 (poly-unsaturated fatty acid) (d; P = 0.0002), tyrosine (e; P < 0.0001) and spermidine (f; P = 0.0015). P-values calculated with one-way ANOVA with α = 0.05. Analysis of differences between groups with Tukey post-hoc testing resulted in significant differences between groups t = 0 vs. t = 6 and t = 0 vs. t = 12 for all metabolites