| Literature DB >> 23323971 |
Karan Uppal1, Quinlyn A Soltow, Frederick H Strobel, W Stephen Pittard, Kim M Gernert, Tianwei Yu, Dean P Jones.
Abstract
BACKGROUND: Detection of low abundance metabolites is important for de novo mapping of metabolic pathways related to diet, microbiome or environmental exposures. Multiple algorithms are available to extract m/z features from liquid chromatography-mass spectral data in a conservative manner, which tends to preclude detection of low abundance chemicals and chemicals found in small subsets of samples. The present study provides software to enhance such algorithms for feature detection, quality assessment, and annotation.Entities:
Mesh:
Year: 2013 PMID: 23323971 PMCID: PMC3562220 DOI: 10.1186/1471-2105-14-15
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1xMSwrapper workflow.
Figure 2Quantitative evaluation of LC/MS profile alignment using apLCMS. Top row shows Pearson correlation within sample duplicates in both datasets; bottom row shows the median PID of feature intensities within sample duplicates. The effect of re-aligning profiles after removing poor quality samples (correlation coefficient, R2 < 0.7) on the quantitative reproducibility of features is shown in the bottom right panel. A noticeable difference in median PID can be seen between alignment using all samples and alignment using only high quality samples for both columns of Sample Set 2.
Evaluating fitness of parameter combinations based on parameter sensitivity analysis
| 12, 0.5 | 1454 | 33.12 | 460.4 | −1858 |
| 3,0.3 | 2350 | 39.75 | 1157.5 | −1625 |
| 3,0.5 | 1940 | 34.72 | 898.4 | −1532 |
| 3,0.8 | 1653 | 30.60 | 735 | −1407 |
| 3,0.3 υ 3,0.5 | 2363 | 36.16 | 1278.2 | −1253 |
| 3,0.3 υ 3,0.8 | 2384 | 35.69 | 1313.3 | −1185 |
| 3,0.5 υ 3,0.8 | 2022 | 32.22 | 1055.4 | −1200 |
| 3,0.3 υ 12,0.5 | 2310 | 36.59 | 1214.4 | −1342 |
| 3,0.5 υ 12,0.5 | 2037 | 33.40 | 1035 | −1303 |
| 3,0.8 υ 12,0.5 | 1816 | 30.37 | 904.9 | −1221 |
Data was taken from Column A of Sample Set 1. aOnly a subset of the results is shown in the table. bMedian PID (mPID) averaged over all features. cScoring function weighs more importance to number of features; dScoring function weighs more importance to quality of features.
Figure 3Variation in stringency for feature detection in sample analyses. Using apLCMS, min.run was varied from 25, 20, 15, 12, 9, 6, 3 (panel a); min.pres was varied from 0.3, 0.5, 0.8 (panel b); and m/z were matched to Madison Metabolomics Consortium Database (MMCD) (panel c) and Metlin database (panel d) for Column A from Sample Set 1 at 5 and 10 ppm mass tolerance. Results at 10 ppm tolerance level are shown here.
Figure 4xMSanalyzer improves the sensitivity of feature detection without compromising data quality. a) Histograms showing number of peaks with ranges of percent intensity differences (PID) for LC/MS profile alignments using apLCMS (left) and xMSanalyzer (right). The results show that the xMSanalyzer routine allows detection of more quantitatively reproducible features; b) Histograms showing the average log2 intensity levels in features with median PID less than 30% detected using apLCMS (left) and xMSanalyzer (right) in Sample Set 2, Column A. xMSanalyzer not only improves the overall quantitative reproducibility of features, but also allows detection of reliable low abundance features.
Comparison of the number of quantitative reproducible features between apLCMS and xMSanalyzer
| Sample Set 1 (Column A) | 839 out of1454 (57.7%) | 1208 out of 2384 (50.6%) | 1115 out of 1816 (61.3%) |
| Sample Set 1 (Column B) | 791 out of1238 (63.89%) | 1236 out of 2201 (56.1%) | 1081 out of 1615 (66.9%) |
| Sample Set 2 (Column A) | 134 out of1324 (10.1%) | 470 out of 2677 (17.5%) | 424 out of 2256 (18.7%) |
| Sample Set 2 (Column B) | 474 out of1573 (30.1%) | 966 out of 2969 (32.5%) | 897 out of 2546 (35.2%) |
| Average over all datasets | 560 (40%) | 970 (37.9%) | 879 (42.7%) |
The number of reproducible features (median PID < 30%) identified by apLCMS at min.run = 12 and min.pres = 0.5 and xMSanalyzer at P1{3,0.3} υ P2{3,0.8} that weighs more importance to the number of features as compared to quality, and at P1{12,0.5} υ P2{3,0.8} that gives balanced importance to the quality and quantity of features.
Comparison of the number of features detected (total and known) using apLCMS, xMSanalyzer, and XCMS
| Sample Set 1 (Column A) | 1454 | 2384 | 1027 |
| | MMCD: 314 (21.6%) | MMCD: 534 (22.3%) | MMCD: 222 (21.6%) |
| | Metlin: 292 (20.1%) | Metlin: 433 (18.1%) | Metlin: 230 (22.4%) |
| Sample Set 1 (Column B) | 1238 | 2201 | 998 |
| | MMCD: 309 (25%) | MMCD: 557 (25.3%) | MMCD: 261 (26.1%) |
| | Metlin: 279 (22.5%) | Metlin: 468 (21.2%) | Metlin: 252 (25.2%) |
| Sample Set 2 (Column A) | 1324 | 2677 | 1262 |
| | MMCD: 408 (30.8%) | MMCD: 732 (27.3%) | MMCD: 324 (25.7%) |
| | Metlin: 497 (37.5%) | Metlin: 705 (26.3%) | Metlin: 431 (34.2%) |
| Sample Set 2 (Column B) | 1573 | 2969 | 1395 |
| | MMCD: 508 (32.3%) | MMCD: 794 (26.7%) | MMCD: 359 (25.7%) |
| | Metlin: 693 (44.1%) | Metlin: 848 (28.5%) | Metlin: 514 (36.8%) |
| Average over all datasets | Total: 1397 | Total: 2558 | Total: 1171 |
| Known metabolites: 413 (29.6%) | Known metabolites: 634 (24.8%) | Known metabolites: 324 (27.7%) |
xMSanalyzer doubles the number of features detected in human patient population
| 12,0.5 (default) | 6538 | 4337 |
| | MMCD: 2412 | MMCD: 1556 |
| 3,0.3 | 14004 | 10729 |
| | MMCD: 4206 | MMCD:2795 |
| 3,0.8 | 10837 | 8069 |
| | MMCD: 3624 | MMCD:2367 |
| 3,0.3 υ 3,0.8 | 15955 | 9396 |
| (xMSanalyzer) | MMCD: 5579 | MMCD: 2819 |
Dialysis patients are highly susceptible to environmental chemicals due to repeated exposures to pharmaceuticals, water, and plastics during dialysis sessions. A dataset from 10 plasma samples collected in dialysis patients, each with 2 analytical replicates, were analyzed by xMSanalyzer (at min.exp = 2) and resulted in a >2-fold increase in feature detection (from 6538 to 15955 features on the AE column), in which most m/z are unmatched in the metabolomics database MMCD.