| Literature DB >> 22369182 |
Leepika Tuli1, Tsung-Heng Tsai, Rency S Varghese, Jun Feng Xiao, Amrita Cheema, Habtom W Ressom.
Abstract
BACKGROUND: Recent advances in liquid chromatography-mass spectrometry (LC-MS) technology have led to more effective approaches for measuring changes in peptide/protein abundances in biological samples. Label-free LC-MS methods have been used for extraction of quantitative information and for detection of differentially abundant peptides/proteins. However, difference detection by analysis of data derived from label-free LC-MS methods requires various preprocessing steps including filtering, baseline correction, peak detection, alignment, and normalization. Although several specialized tools have been developed to analyze LC-MS data, determining the most appropriate computational pipeline remains challenging partly due to lack of established gold standards.Entities:
Year: 2012 PMID: 22369182 PMCID: PMC3311572 DOI: 10.1186/1477-5956-10-13
Source DB: PubMed Journal: Proteome Sci ISSN: 1477-5956 Impact factor: 2.480
Figure 1Spike-in experiment to evaluate analysis of LC-MS data generated by a label-free LC-MS method. The experiment design involves two groups of samples: (i) serum samples with spike-in peptides, and (ii) serum samples alone. LC-MS data were generated by UPLC-QTOF mass spectrometer. Four software tools (msInspect, MZmine 2, Progenesis LC-MS, and XCMS) were applied for LC-MS data analysis to evaluate workflows implemented.
The sequence information of the nine selected MassPrep peptides
| Component name | Molecular weight (g/mol) | pKa | Peptide sequence | |
|---|---|---|---|---|
| 1 | RASG-1 | 1000.4938 | 9.34 | RGDSPASSKP |
| 2 | Angiotensin frag 1-7 | 898.4661 | 7.35 | DRVYIHP |
| 3 | Bradykinin | 1059.5613 | 12.00 | RPPGFSPFR |
| 4 | Angiotensin II | 1045.5345 | 7.35 | DRVYIHPF |
| 5 | Angiotensin I | 1295.6775 | 7.51 | DRVYIHPFHL |
| 6 | Renin substrate | 1757.9253 | 7.61 | DRVYIHPFHLLVYS |
| 7 | Enolase T35 | 1871.9604 | 7.34 | WLTGPQLADLYHSLMK |
| 8 | Enolase T37 | 2827.2806 | 3.97 | YPIVSIEDPFAEDDWEAWSHFFK |
| 9 | Melitin | 2845.7381 | 12.06 | GIGAVLKVLTTGLPALISWIKRKRQQ |
Figure 2Base peak chromatograms of the MassPrep peptides. The chromatograms are zoomed into each of the 13 unique features whose m/z and retention time values match with those of the MassPrep peptides, in comparison of serum with spike-in peptides (+MassPrep) and serum alone (-MassPrep) groups.
Performance comparison of msInspect, MZmine 2, Progenesis LC-MS, and XCMS.
| msInspect (Number of selected features = 2099, sensitivity = 9/13, FP = 2090) | ||||
|---|---|---|---|---|
| 1 | RGDSPASSKP | 501.25 (2) | 0.0139 (38) | Inf (1) |
| 2 | DRVYIHP | 450.23 (2) | 0.0114 (1) | Inf (1) |
| 3a | RPPGFSPFR | 530.78 (2) | 0.0164 (43) | 90.5 (1911) |
| 3b | RPPGFSPFR | 354.19 (3) | 0.0114 (1) | 32.5 (1935) |
| 4 | DRVYIHPF | 523.77 (2) | 0.0166 (220) | 47.8 (1921) |
| 5a | DRVYIHPFHL | 648.84 (2) | 0.0120 (28) | Inf (1) |
| 5b | DRVYIHPFHL | 432.89 (3) | 0.0126 (31) | Inf (1) |
| 6 | DRVYIHPFHLLVYS | 586.98 (3) | 0.0164 (43) | Inf (1) |
| 7a | WLTGPQLADLYHSLMK | 624.99 (3) | ||
| 7b | WLTGPQLADLYHSL(M)K | 630.35 (3) | 0.0164 (43) | Inf (1) |
| 8 | YPIVSIEDPFAEDDWEAWSHFFK | 943.43 (3) | ||
| 9a | GIGAVLKVLTTGLPALISWIKRKRQQ | 712.43 (4) | ||
| 9b | GIGAVLKVLTTGLPALISWIKRKRQQ | 570.15 (5) | ||
| MZmine 2 (Number of selected features = 539, sensitivity = 7/13, FP = 532) | ||||
| Feature No. | Peptide | FC (rank) | ||
| 1 | RGDSPASSKP | 501.25 (2) | 0.0278 (18) | 40.0 (243) |
| 2 | DRVYIHP | 450.23 (2) | 0.0271 (1) | 97.9 (152) |
| 3a | RPPGFSPFR | 530.78 (2) | 0.0271 (1) | 54.1 (202) |
| 3b | RPPGFSPFR | 354.19 (3) | 0.0271 (1) | 41.4 (238) |
| 4 | DRVYIHPF | 523.77 (2) | 0.0278 (18) | 204.9 (107) |
| 5a | DRVYIHPFHL | 648.84 (2) | 0.0399 (173) | 11.2 (509) |
| 5b | DRVYIHPFHL | 432.89 (3) | 0.0301 (53) | 9636.3 (34) |
| 6 | DRVYIHPFHLLVYS | 586.98 (3) | ||
| 7a | WLTGPQLADLYHSLMK | 624.99 (3) | ||
| 7b | WLTGPQLADLYHSL(M)K | 630.35 (3) | ||
| 8 | YPIVSIEDPFAEDDWEAWSHFFK | 943.43 (3) | ||
| 9a | GIGAVLKVLTTGLPALISWIKRKRQQ | 712.43 (4) | ||
| 9b | GIGAVLKVLTTGLPALISWIKRKRQQ | 570.15 (5) | ||
| Progenesis LC-MS (Number of selected features = 467, sensitivity = 8/13, FP = 459) | ||||
| Feature No. | Peptide | FC (rank) | ||
| 1 | RGDSPASSKP | 501.25 (2) | 0.0308 (180) | 73.0 (178) |
| 2 | DRVYIHP | 450.23 (2) | 0.0103 (1) | 33.1 (240) |
| 3a | RPPGFSPFR | 530.78 (2) | 0.0146 (4) | 32.4 (243) |
| 3b | RPPGFSPFR | 354.19 (3) | 0.0103 (1) | 46.9 (210) |
| 4 | DRVYIHPF | 523.77 (2) | 0.0146 (4) | 65.8 (184) |
| 5a | DRVYIHPFHL | 648.84 (2) | ||
| 5b | DRVYIHPFHL | 432.89 (3) | 0.0146 (4) | Inf (1) |
| 6 | DRVYIHPFHLLVYS | 586.98 (3) | ||
| 7a | WLTGPQLADLYHSLMK | 624.99 (3) | ||
| 7b | WLTGPQLADLYHSL(M)K | 630.35 (3) | 0.0222 (33) | 50.6 (201) |
| 8 | YPIVSIEDPFAEDDWEAWSHFFK | 943.43 (3) | ||
| 9a | GIGAVLKVLTTGLPALISWIKRKRQQ | 712.43 (4) | ||
| 9b | GIGAVLKVLTTGLPALISWIKRKRQQ | 570.15 (5) | 0.0441 (365) | Inf (1) |
| XCMS (Number of selected features = 66, sensitivity = 7/13, FP = 59) | ||||
| Feature No. | Peptide | FC (rank) | ||
| 1 | RGDSPASSKP | 501.25 (2) | 0.0437 (3) | 40.0 (8) |
| 2 | DRVYIHP | 450.23 (2) | 0.0224 (1) | 25.3 (19) |
| 3a | RPPGFSPFR | 530.78 (2) | 0.0437 (3) | 52.2 (7) |
| 3b | RPPGFSPFR | 354.19 (3) | 0.0437 (3) | 38.0 (9) |
| 4 | DRVYIHPF | 523.77 (2) | 0.0437 (3) | 137.8 (2) |
| 5a | DRVYIHPFHL | 648.84 (2) | ||
| 5b | DRVYIHPFHL | 432.89 (3) | 0.0437 (3) | 488.4 (1) |
| 6 | DRVYIHPFHLLVYS | 586.98 (3) | ||
| 7a | WLTGPQLADLYHSLMK | 624.99 (3) | ||
| 7b | WLTGPQLADLYHSL(M)K | 630.35 (3) | 0.0491 (66) | 17.9 (27) |
| 8 | YPIVSIEDPFAEDDWEAWSHFFK | 943.43 (3) | ||
| 9a | GIGAVLKVLTTGLPALISWIKRKRQQ | 712.43 (4) | ||
| 9b | GIGAVLKVLTTGLPALISWIKRKRQQ | 570.15 (5) | ||
The comparison is based on q-value and fold-change (FC) of spike-in peptides in comparing spike-in serum samples versus serum samples. Ranking is provided on q-values and FC. The number of differentially abundant features, sensitivity, and false positives (FP) are calculated based on the criterion of q-value < 0.05 and FC > 10
Summary of difference detection results by combining different statistical tests.
| msInspect | MZmine 2 | Progenesis | XCMS | |
|---|---|---|---|---|
| Total number of features detected | 31168 (12) | 12271 (12) | 9267 (9) | 21486 (13) |
| Number of features used for statistical analysis | 6525 (9) | 12092 (9) | 8415 (9) | 8703 (10) |
| 4824 (9) | 3505 (7) | 4465 (9) | 1896 (7) | |
| Wilcoxon rank-sum test ( | 603 (8) | 1318 (6) | 1584 (8) | 812 (8) |
| 603 (8) | 967 (6) | 1379 (8) | 672 (7) | |
| 2099 (9) | 539 (7) | 467 (8) | 66 (7) | |
| 388 (8) | 323 (6) | 238 (7) | 55 (7) | |
The total number of features detected and those satisfying different criteria, i.e., t-test with multiple hypothesis testing (q-value < 0.05), Wilcoxon sum-rank test (p-value < 0.05), and fold change (FC > 10). The plus symbol denotes the combination of different criteria. Only features present in at least two replicates in each group were used for statistical analysis. The numbers in parenthesis indicate the number of MassPrep features identified in each category. We found six features (Features 1, 2, 3a, 3b, 4, and 5b) identified as differentially abundant by each of the four software tools in each category