| Literature DB >> 25160088 |
Nathaniel Guy Mahieu1, Xiaojing Huang, Ying-Jr Chen, Gary J Patti.
Abstract
The aim of untargeted metabolomics is to profile as many metabolites as possible, yet a major challenge is comparing experimental method performance on the basis of metabolome coverage. To date, most published approaches have compared experimental methods by counting the total number of features detected. Due to artifactual interference, however, this number is highly variable and therefore is a poor metric for comparing metabolomic methods. Here we introduce an alternative approach to benchmarking metabolome coverage which relies on mixed Escherichia coli extracts from cells cultured in regular and (13)C-enriched media. After mass spectrometry-based metabolomic analysis of these extracts, we "credential" features arising from E. coli metabolites on the basis of isotope spacing and intensity. This credentialing platform enables us to accurately compare the number of nonartifactual features yielded by different experimental approaches. We highlight the value of our platform by reoptimizing a published untargeted metabolomic method for XCMS data processing. Compared to the published parameters, the new XCMS parameters decrease the total number of features by 15% (a reduction in noise features) while increasing the number of true metabolites detected and grouped by 20%. Our credentialing platform relies on easily generated E. coli samples and a simple software algorithm that is freely available on our laboratory Web site (http://pattilab.wustl.edu/software/credential/). We have validated the credentialing platform with reversed-phase and hydrophilic interaction liquid chromatography as well as Agilent, Thermo Scientific, AB SCIEX, and LECO mass spectrometers. Thus, the credentialing platform can readily be applied by any laboratory to optimize their untargeted metabolomic pipeline for metabolite extraction, chromatographic separation, mass spectrometric detection, and bioinformatic processing.Entities:
Mesh:
Year: 2014 PMID: 25160088 PMCID: PMC4188275 DOI: 10.1021/ac503092d
Source DB: PubMed Journal: Anal Chem ISSN: 0003-2700 Impact factor: 6.986
Figure 1Overview of the feature credentialing process. A sample is generated from two cultures of E. coli grown in parallel, one grown on natural-abundance glucose and a second grown on 13C-glucose as the sole carbon source. These two cultures are mixed in distinct ratios prior to harvesting, here 1:1 and 1:2. Extraction and LC/MS analysis is then performed on the standard samples. The resulting data are searched for pairs of coeluting peaks which satisfy the following requirements: (i) the intensities of the peaks must reflect the mixing ratio, (ii) the U-13C peak must predict a feasible number of carbons for the mass in question, and (iii) the exact masses of the peaks must predict an integer number of carbons. These requirements define a “credentialed space” in which the apex of a second peak must be found to qualify as an acceptable isotope. These candidate peaks are then aligned and grouped between the two samples. Each peak pair is compared across samples and a second, stricter intensity check is performed. This requires that the ratios of each sample (Ia12/Ia13 and Ib12/Ib13) are proportional to the mixed ratios of each sample. Peaks that pass these filters are considered credentialed.
Performance of Feature Credentialinga
| sample type | total features | credentialed features | percentage credentialed (%) |
|---|---|---|---|
| no injection | 1564 | 13 | 0.8 |
| extraction blank | 2736 | 18 | 0.7 |
| natural-abundance | 18643 | 120 | 0.6 |
| 12C/13C standard sample | 23567 | 2192 | 9.3 |
A summary of the results of the credentialing process after being applied to several different data sets. The rows labeled “no injection” and “extraction blanks” represent credentialed peaks due to carryover from previous credentialing runs. Natural-abundance E. coli is a negative control that estimates the false positive rate of the credentialing process.
Reoptimization of Published XCMS Parametersa
| XCMS parameter | published parameters | with optimized peak finding | with optimized retcor and group |
|---|---|---|---|
| ppm | 15 | 12 | 12 |
| peak width | 10, 120 | 15, 140 | 15, 140 |
| mzwid | 0.015 | 0.015 | 0.015 |
| bw | 5 | 5 | 10 |
| gapInit | 0.6 | ||
| total features | 32010 | 27260 | 27260 |
| credentialed features | 1475 | 1776 | 1817 |
Parameters used and the results of each step in the optimization process are shown. Published parameters were taken from a previously published method (ref (6)). The column labeled “with optimized peak finding” shows results for the optimization of findPeaks.centWave().
Figure 2MS/MS spectra from six representative credentialed features. MS/MS spectra were collected at four collision energies (0, 10, 20, and 40 V) on six credentialed ions. Three of these ions (A) uracil, (B) ADP, and (C) UDP-GlcA were identified based on accurate mass, carbon number, and METLIN database hits. These identifications were confirmed by comparing the experimental MS/MS spectra to the METLIN MS/MS reference spectra as shown. The upper spectrum of each plot is the experimental data, and the lower spectrum is the METLIN reference data. Unmatched peaks are depicted in red. The second three ions (D) 578.0093, (E) 1169.3011, and (F) 848.7473 were classified as unknowns as they did not match any METLIN database entries as either a fragment or parent mass. The MS/MS spectrum of each ion is displayed as normalized intensity at the same four collision energies.