| Literature DB >> 21949491 |
Maud M Koek, Renger H Jellema, Jan van der Greef, Albert C Tas, Thomas Hankemeier.
Abstract
Metabolomics involves the unbiased quantitative and qualitative analysis of the complete set of metabolites present in cells, body fluids and tissues (the metabolome). By analyzing differences between metabolomes using biostatistics (multivariate data analysis; pattern recognition), metabolites relevant to a specific phenotypic characteristic can be identified. However, the reliability of the analytical data is a prerequisite for correct biological interpretation in metabolomics analysis. In this review the challenges in quantitative metabolomics analysis with regards to analytical as well as data preprocessing steps are discussed. Recommendations are given on how to optimize and validate comprehensive silylation-based methods from sample extraction and derivatization up to data preprocessing and how to perform quality control during metabolomics studies. The current state of method validation and data preprocessing methods used in published literature are discussed and a perspective on the future research necessary to obtain accurate quantitative data from comprehensive GC-MS data is provided.Entities:
Year: 2010 PMID: 21949491 PMCID: PMC3155681 DOI: 10.1007/s11306-010-0254-3
Source DB: PubMed Journal: Metabolomics ISSN: 1573-3882 Impact factor: 4.290
Fig. 1Schematic of a typical workflow in metabolomics
Analytical strategies for metabolic research
| Metabolite target analysis | Quantitative (absolute or relative) analysis of one or few target metabolites. Typical strategy: selective sample pretreatment followed by separation (GC, LC, CE) coupled to sensitive selective detection |
| Metabolic profiling | Quantitative (absolute or relative) and qualitative (identification) multi-component analyses that define or describe metabolic patterns for a group of metabolically or analytically related metabolites (Horning and Horning |
| Metabolic fingerprinting | High throughput screening of samples to provide sample classification. Generally no quantification (or only relative quantification) and no identification of individual metabolites (Dunn and Ellis |
| Metabolomics (Metabonomics) | Quantitative (mostly relative quantification) and qualitative analysis of the complete set of metabolites present in a biological system (cells, body fluids, tissues). Typical strategy: generic sample pretreatment followed by separation coupled to MS detection |
Overview of GC(-MS) based metabolomics papers
| Authors | Technique | Focusb | Matrix | Validation parametersc |
|---|---|---|---|---|
| Aura et al. ( | S-GC-MS & S-GC×GC-MS | 7 | feaces | – |
| Birkemeyer et al. ( | GC-MS | 4 | microbial | – |
| Chang et al. ( | OS-GC-MS | 7 | plant | – |
| Coucheney et al. ( | OS-GC-MS | 3, 6, 7 | microbial | 4 |
| De Souza et al. ( | OS-GC-MS | 6 | microbial | – |
| Fan et al. ( | S-GC-MS | 7 | plant | 3, 4 |
| Fan et al. ( | GC-MS | 7 | plant | – |
| Fiehn et al. ( | OS-GC-MS | 5 | plant | – |
| Fiehn et al. ( | OS-GC-MS | 1, 2, 3, 4, 6 | plant | 2, 4 |
| Fiehn ( | OS-GC-MS | 6 | plant | – |
| Fiehn et al. ( | OS-GC-MS | 3, 4, 6, 7 | plant | – |
| Gullberg et al. ( | OS-GC-MS | 1 | plant | 4 |
| Guo and Lidstrom ( | OS-GC × GC-MS | 5, 6 | microbial | 2 |
| Hiller et al. ( | OS-GC-MS | 6 | microbial | 2a |
| Hope et al. ( | S-GC × GC-MS | 7 | plant | – |
| Huang and Regnier ( | OS-GC × GC-MS | 4 | serum | – |
| Humston et al. ( | SPME(HS)- GC × GC-MS | 6, 7 | plant | – |
| Humston et al. ( | OS-GC × GC-MS | 6, 7 | microbial | – |
| Jeong et al. ( | OS-GC-MS | 7 | plant | – |
| Jiye et al. ( | OS-GC-MS | 6, 7 | urine | – |
| Jonsson et al. ( | OS-GC-MS | 6 | plant | – |
| Jonsson et al. ( | OS-GC-MS | 6 | plant | – |
| Jonsson et al. ( | OS-GC-MS | 6 | urine | – |
| Koek et al. ( | OS-GC-MS | 1, 2, 3, 4 | microbial | 2, 3, 4, 5, 6, 7 |
| Koek et al. ( | OS-GC × GC-MS | 1, 2, 3, 4 | serum/plasma | 1, 2, 3, 4, 6 |
| Koek et al. ( | S-GC-MS | 2, 3, 7 | mouse CSF | 2, 3, 4, 5 |
| Koek et al. ( | OS-GC × GC-MS | 6, 7 | liver | 4 |
| Kuhara ( | OS-GC-MS | 7 | urine | – |
| Kusano et al. ( | OS-GC × GC-MS | 6 | plant | 4 |
| Lee and Fiehn ( | OS-GC-MS | 1, 7 | microbial | – |
| Li et al. ( | S-GC × GC-MS | 6 | plasma | 4 |
| Lu et al. ( | OS-GC-MS | 6, 7 | plasma | 4 |
| Ma et al. ( | S-GC-MS | 2, 6 | plant | – |
| Martins et al. ( | OS-GC-MS | 2, 3 | microbial | 4 |
| Matsumoto and Kuhara ( | S-GC-MS | 7 | urine | – |
| Mills and Walker ( | SPME(HS)-GC-MS | 7 | urine | – |
| Mohler et al. ( | OS- GC × GC-MS | 6, 7 | yeast | – |
| Mohler et al. ( | OS-GC × GC-MS | 1, 2, 3, 6 | yeast | 4 |
| Mohler et al. ( | OS-GC × GC-MS | 6 | microbial | – |
| Morgenthal et al. ( | OS-GC-MS | 1, 2, 3, 6 | plant | 4 |
| O’Hagan et al. ( | OS-GC-MS | 3, 4 | serum/yeast | 4 |
| O’Hagan et al. ( | OS-GC × GC-MS | 3 | serum | – |
| Oh et al. ( | S-GC × GC-MS | 6 | serum | – |
| Ong et al. ( | S-GC-MS | 7 | liver | 4 |
| Pan et al. ( | OS-GC-MS | 1 | liver | 2, 3, 4, 6, 7 |
| Pasikanti et al. ( | S-GC-MS | 4, 6 | urine | 2a, 4, 5, 7 |
| Pauling et al. ( | HS-GC-FID | 3, 4 | urine, breath | 4 |
| Pierce et al. ( | S-GC × GC-MS | 6 | plant | – |
| Pierce et al. ( | S-GC × GC-MS | 6 | urine | – |
| Qiu et al. ( | ECF-GC-MS | 1, 2, 3, 4 | urine | 2, 3, 4, 6 |
| Ralston-Hooper et al. ( | OS-GC × GC-MS | 7 | invertebrates | – |
| Roessner et al. ( | OS-GC-MS | 1, 2, 3, 4 | plant | 2, 3, 4 |
| Roessner et al. ( | OS-GC-MS | 6 | plant | – |
| Roessner et al. ( | OS-GC-MS | 6 | plant | 4 |
| Schauer et al. ( | OS-GC-MS | 5 | all | – |
| Sangster et al. ( | OS-GC-MS | 6 | plasma | – |
| Schmarr and Bernhardt ( | SPME(HS)- GC × GC-MS | 6 | plant | – |
| Shellie et al. ( | OS-GC × GC-MS | 6 | mouse spleen | – |
| Sinha et al. ( | OS-GC × GC-MS | 6 | urine | – |
| Strelkov et al. ( | OS-GC-MS (polar) + S-GC-MS (apolar metabolites) | 1, 2, 3, 4 | microbial | 4 |
| Styczynski et al. ( | MCF-GC-MS | 6 | microbial | – |
| Tian et al. ( | S-GC-FID/MS | 3, 6 | microbial | 2a, 3a, 4a, 6a |
| Tianniam et al. ( | OS-GC-MS | 7 | plant | – |
| Vikram et al. ( | HS-GC-MS | 7 | apples | – |
| Villas-Bôas et al. ( | MCF-GC-MS | 2, 3 | microbial | 2, 4, 6 |
| Villas-Bôas et al. ( | OS-GC-MS/MCF-GC-MS | 1 | yeast | 3 |
| Wagner et al. ( | OS-GC-MS | 5, 6 | plant | – |
| Weckwerth et al. ( | OS-GC-MS | 4, 6 | plant | – |
| Weckwerth et al. ( | OS-GC-MS | 1, 2, 3, 4, 6 | plant | 3, 4 |
| Welthagen et al. ( | OS-GC × GC-MS | 3, 6 | mouse spleen | 4 |
| Wishart et al. ( | OS-GC-MS | 7 | CSF | – |
| Zhang et al. ( | OS-GC-MS | 3, 4 | urine | 2, 4, 5, 6, 7 |
O oximation, S silylation, CF chloroformate derivatization, HS headspace sampling, SPME solid phase micro extraction
aValidation parameter only assessed in academic standard, i.e. standard without matrix
bFocus: 1 = extraction, 2 = derivatization, 3 = analysis, 4 = detection/quantification, 5 = identification, 6 = data preprocessing and analysis, 7 = application
cAnalytical validation parameters: 1 = selectivity (peak capacity), 2 = calibration model, 3 = accuracy (recovery), 4 = repeatability, 5 = intermediate precision, 6 = LLOQ/LLOD, 7 = stability
Fig. 2Illustration of the matrix enhancement effect of glucose on different metabolites measured with two different GC × GC-MS configurations. a ‘conventional’ setup with 30 m × 0.25 mm × 0.25 μm HP5-MS in the first and 1 m × 0.1 mm × 0.1 μm BPX-50 in the second dimension. b ‘high capacity’ setup with 30 m × 0.25 mm × 0.25 μm HP5-MS in the first and 2 m × 0.32 mm × 0.25 μm BPX-50 in the second dimension. In the ‘conventional’ setup in extracts with smaller amounts of glucose, the class-2 metabolite lysine and, to a lesser extent, citric acid adsorb and/or degrade on active sites present in the analytical system. In the extracts with high levels of glucose, the response for these metabolites increases, most probably because active sites are blocked. In the ‘high capacity’ setup using the more inert thicker film second dimension column the absorption is not present even at low levels of glucose in the matrix (Koek et al. 2008)
Fig. 3Example of deconvolution: three overlapping peaks were separated, making use of the mass spectral information. This results in a peak table with the response for all three individual metabolites and their corresponding mass spectrum
Definitions of validation parameters
| Selectivity | The ability of an analytical method to differentiate and quantify an analyte in the presence of other components in the sample. One way to establish method selectivity is to prove the lack of response in blank matrix, an approach not suitable for metabolomics analysis. The second approach is based on the assumption that small interferences can be accepted as long as precision and bias (at LLOQ level) remain within certain acceptance limits |
| Calibration model | The relationship between the concentration of an analyte in the sample and the corresponding detector response. There is general agreement that calibration samples should be prepared in blank matrix and that their concentrations must cover the whole calibration range. Recommendations on how many concentration levels should be studied with how many replicates per concentration level differ significantly. To establish a calibration model, we suggest measuring at least six different calibration levels, evenly spread over the whole calibration range, in duplicate (Table |
| Accuracy | The closeness of mean test results obtained by the method to the true value (concentration) of the analyte. Accuracy is determined by replicate analysis of samples containing known amounts of the analyte. Ideally, the accuracy or trueness of an analytical method is assessed by comparing the value found with a certified reference value or ‘true’ value (Hartmann et al. |
| Precision | The closeness of individual measures of an analyte when the procedure is applied repeatedly to multiple aliquots of a single homogeneous volume of biological matrix. Three different levels of precision can be determined, i.e. repeatability, intermediate precision and reproducibility. The repeatability or intra-batch precision is the precision over a short period of time using the same operating condition and is determined by repeated injection of individually prepared samples of the same test material. Intermediate precision or inter-batch precision expresses the within-laboratories variations, e.g. different days, different analyst, different equipment, etc. Reproducibility describes the precision between different laboratories and only has to be studied when the method is to be used in different laboratories |
| Limit of quantification | The lowest amount of metabolite that can be quantified with suitable precision and accuracy (Hartmann et al. |
Proposed minimum validation of analytical metabolomics methodsa
| Sample characteristics used for validation experiments | Validation parameters investigated | |||||||
|---|---|---|---|---|---|---|---|---|
| Concentration | Biol. sample | Standard solution | Added prior to sample preparation | Calibration curve + repeatability | Intermediate precision | Recovery and matrix effecte | LLOQ | Total number |
| Number of samples on days 1–14 | ||||||||
| Day 1 | Day 2 & 3, (&7, 10 and 14)d | Day 1 | ||||||
| C0 | x | No | 2 | 2 | ||||
| C1 | x | Very low | 3b | (5 × 3) | (3 after sample preparation) | f | 3 (21) | |
| C2 | x | Low | 2 | 2 | ||||
| C3 | x | x | Intermediate | 3b | 2 × 3(+3 × 3) | 3 std + 3 after sample prep. prior to derivatization | 15 (24) | |
| C3 | x | Intermediate | 6 × 1c | 6 | ||||
| C4 | x | Higher | 2 | 2 | ||||
| C5 | x | High | 3b | (5 × 3) | (3 after sample preparation) | 3 (21) | ||
| C6 | x | Highest | 2 | 2 | ||||
| Total | 35 (80) | |||||||
aMinimum validation for initial validation of a method all samples (also the samples between brackets) should be measured. For studies with a limited number of samples, analyzed within a few days, the samples between brackets could be discarded
bIt is recommended to analyze 3 samples so that data for a calibration line can also be used for determining intermediate precision (C1, C3, C5), recovery (C1, C3, C5) and LLOQ (C1)
cDetermination of analytical repeatability, one sample injected six times
dDetermination of intermediate precision over 3 days or 14 days (between brackets), analysis of three samples per day including sample preparation
eThe recovery of the extraction (excluding derivatization) can be calculated by determining the ratio between the response of the metabolites spiked before and after extraction. The matrix effect is determined by determining the ratio of the response of the metabolites spiked after extraction and the metabolites in a standard solution. The matrix effect calculation covers matrix effects during derivatization (generally decreasing response) and matrix effects during analysis (increase (matrix enhancement) or decrease in response due to matrix present)
fCalculated from RSD of lowest concentration point of calibration line (LOQ = 10 × SD/slope)
Different quality control standards and their function
| External standards | Internal standards | ||||
|---|---|---|---|---|---|
| Academic standard (no matrix) | Pooled QC | Exogenous standardd | Spike isotopically labeled metabolites | Labeled standard for every metabolite | |
| Control/detect | |||||
| • Storage | − | − | − | + | + |
| • Extraction | − | − | − | + | + |
| • Derivatization | − | − | − | + | + |
| • Injection vol. | − | − | + | − | − |
| • Detector sensitivitya | − | − | + | − | − |
| • Detector driftb | − | + | − | − | − |
| • Inertness analytical system | + | + | − | ±c | ±c |
| Correction | |||||
| • Detector responsea | − | − | + | − | − |
| • Detector driftb | − | + | − | − | − |
| • Batch correction | − | + | + | − | − |
| • Recovery metabolites | − | ± | − | ± | + |
aOverall sensitivity of the detector
bDetector drift, i.e. the change in detector response with mass-to-charge ratio (m/z) can vary with different masses (e.g. due to fouling) and should be addressed separately from the overall sensitivity
cThe ratio of different labeled metabolites, e.g. class 3/class 1 (critical/good performing metabolite; §2.1), can be used as an indicator for the inertness of the analytical system. However, deviations in the ratio can also be caused by other deviations, e.g. during sample workup
dStable compound that is not derivatized and not present in biological samples
Fig. 4Suggested quality-control scheme for GC-MS metabolomics studies; IS internal standard(s), QC quality-control sample. a)Depending on the matrix analyzed, one can choose to add IS for correction or leave these standards out (see Sect. 2.5)
Fig. 5Example of the effects of correction of peak areas of phenylalanine (a) and glycolic acid (b) measured in 5 consecutive batches (approximately 35 samples per batch, total of 180 samples); on the x-axis: sample number, and on the y-axis: (normalized) peak areas. Upper: absolute peak areas of uncorrected data, middle: normalized peak areas after IS correction, lower: normalized peak areas after IS and QC correction. In blue: regular samples, in red: QC samples, in green: blank samples and in turquoise: QC validation samples (the same as QC samples, but not used for correctional purposes) (Color figure online)