Literature DB >> 19602524

Normalization of peak intensities in bottom-up MS-based proteomics using singular value decomposition.

Yuliya V Karpievitch1, Thomas Taverner, Joshua N Adkins, Stephen J Callister, Gordon A Anderson, Richard D Smith, Alan R Dabney.   

Abstract

MOTIVATION: LC-MS allows for the identification and quantification of proteins from biological samples. As with any high-throughput technology, systematic biases are often observed in LC-MS data, making normalization an important preprocessing step. Normalization models need to be flexible enough to capture biases of arbitrary complexity, while avoiding overfitting that would invalidate downstream statistical inference. Careful normalization of MS peak intensities would enable greater accuracy and precision in quantitative comparisons of protein abundance levels.
RESULTS: We propose an algorithm, called EigenMS, that uses singular value decomposition to capture and remove biases from LC-MS peak intensity measurements. EigenMS is an adaptation of the surrogate variable analysis (SVA) algorithm of Leek and Storey, with the adaptations including (i) the handling of the widespread missing measurements that are typical in LC-MS, and (ii) a novel approach to preventing overfitting that facilitates the incorporation of EigenMS into an existing proteomics analysis pipeline. EigenMS is demonstrated using both large-scale calibration measurements and simulations to perform well relative to existing alternatives. AVAILABILITY: The software has been made available in the open source proteomics platform DAnTE (Polpitiya et al., 2008)) (http://omics.pnl.gov/software/), as well as in standalone software available at SourceForge (http://sourceforge.net).

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19602524      PMCID: PMC2752608          DOI: 10.1093/bioinformatics/btp426

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  20 in total

1.  Robust algorithm for alignment of liquid chromatography-mass spectrometry analyses in an accurate mass and time tag data analysis pipeline.

Authors:  Navdeep Jaitly; Matthew E Monroe; Vladislav A Petyuk; Therese R W Clauss; Joshua N Adkins; Richard D Smith
Journal:  Anal Chem       Date:  2006-11-01       Impact factor: 6.986

Review 2.  Analysis and validation of proteomic data generated by tandem mass spectrometry.

Authors:  Alexey I Nesvizhskii; Olga Vitek; Ruedi Aebersold
Journal:  Nat Methods       Date:  2007-10       Impact factor: 28.547

3.  Label-free comparative analysis of proteomics mixtures using chromatographic alignment of high-resolution muLC-MS data.

Authors:  Gregory L Finney; Adele R Blackler; Michael R Hoopmann; Jesse D Canterbury; Christine C Wu; Michael J MacCoss
Journal:  Anal Chem       Date:  2008-01-12       Impact factor: 6.986

4.  Elimination of systematic mass measurement errors in liquid chromatography-mass spectrometry based proteomics using regression models and a priori partial knowledge of the sample content.

Authors:  Vladislav A Petyuk; Navdeep Jaitly; Ronald J Moore; Jie Ding; Thomas O Metz; Keqi Tang; Matthew E Monroe; Aleksey V Tolmachev; Joshua N Adkins; Mikhail E Belov; Alan R Dabney; Wei-Jun Qian; David G Camp; Richard D Smith
Journal:  Anal Chem       Date:  2007-12-29       Impact factor: 6.986

5.  A statistical framework for protein quantitation in bottom-up MS-based proteomics.

Authors:  Yuliya Karpievitch; Jeff Stanley; Thomas Taverner; Jianhua Huang; Joshua N Adkins; Charles Ansong; Fred Heffron; Thomas O Metz; Wei-Jun Qian; Hyunjin Yoon; Richard D Smith; Alan R Dabney
Journal:  Bioinformatics       Date:  2009-06-17       Impact factor: 6.937

6.  A statistical model for iTRAQ data analysis.

Authors:  Elizabeth G Hill; John H Schwacke; Susana Comte-Walters; Elizabeth H Slate; Ann L Oberg; Jeanette E Eckel-Passow; Terry M Therneau; Kevin L Schey
Journal:  J Proteome Res       Date:  2008-06-26       Impact factor: 4.466

7.  DAnTE: a statistical tool for quantitative analysis of -omics data.

Authors:  Ashoka D Polpitiya; Wei-Jun Qian; Navdeep Jaitly; Vladislav A Petyuk; Joshua N Adkins; David G Camp; Gordon A Anderson; Richard D Smith
Journal:  Bioinformatics       Date:  2008-05-03       Impact factor: 6.937

8.  A reanalysis of a published Affymetrix GeneChip control dataset.

Authors:  Alan R Dabney; John D Storey
Journal:  Genome Biol       Date:  2006-03-22       Impact factor: 13.583

9.  Capturing heterogeneity in gene expression studies by surrogate variable analysis.

Authors:  Jeffrey T Leek; John D Storey
Journal:  PLoS Genet       Date:  2007-08-01       Impact factor: 5.917

10.  Normalization of two-channel microarrays accounting for experimental design and intensity-dependent relationships.

Authors:  Alan R Dabney; John D Storey
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

View more
  33 in total

1.  Simultaneous Improvement in the Precision, Accuracy, and Robustness of Label-free Proteome Quantification by Optimizing Data Manipulation Chains.

Authors:  Jing Tang; Jianbo Fu; Yunxia Wang; Yongchao Luo; Qingxia Yang; Bo Li; Gao Tu; Jiajun Hong; Xuejiao Cui; Yuzong Chen; Lixia Yao; Weiwei Xue; Feng Zhu
Journal:  Mol Cell Proteomics       Date:  2019-05-16       Impact factor: 5.911

2.  Learning and Imputation for Mass-spec Bias Reduction (LIMBR).

Authors:  Alexander M Crowell; Casey S Greene; Jennifer J Loros; Jay C Dunlap
Journal:  Bioinformatics       Date:  2019-05-01       Impact factor: 6.937

3.  DanteR: an extensible R-based tool for quantitative analysis of -omics data.

Authors:  Tom Taverner; Yuliya V Karpievitch; Ashoka D Polpitiya; Joseph N Brown; Alan R Dabney; Gordon A Anderson; Richard D Smith
Journal:  Bioinformatics       Date:  2012-07-19       Impact factor: 6.937

4.  Pre-analytic Considerations for Mass Spectrometry-Based Untargeted Metabolomics Data.

Authors:  Dominik Reinhold; Harrison Pielke-Lombardo; Sean Jacobson; Debashis Ghosh; Katerina Kechris
Journal:  Methods Mol Biol       Date:  2019

5.  RobNorm: model-based robust normalization method for labeled quantitative mass spectrometry proteomics data.

Authors:  Meng Wang; Lihua Jiang; Ruiqi Jian; Joanne Y Chan; Qing Liu; Michael P Snyder; Hua Tang
Journal:  Bioinformatics       Date:  2021-05-05       Impact factor: 6.937

6.  A statistical selection strategy for normalization procedures in LC-MS proteomics experiments through dataset-dependent ranking of normalization scaling factors.

Authors:  Bobbie-Jo M Webb-Robertson; Melissa M Matzke; Jon M Jacobs; Joel G Pounds; Katrina M Waters
Journal:  Proteomics       Date:  2011-11-17       Impact factor: 3.984

7.  Label-free quantitative LC-MS proteomics of Alzheimer's disease and normally aged human brains.

Authors:  Victor P Andreev; Vladislav A Petyuk; Heather M Brewer; Yuliya V Karpievitch; Fang Xie; Jennifer Clarke; David Camp; Richard D Smith; Andrew P Lieberman; Roger L Albin; Zafar Nawaz; Jimmy El Hokayem; Amanda J Myers
Journal:  J Proteome Res       Date:  2012-05-17       Impact factor: 4.466

8.  Metabolomic biomarkers and novel dietary factors associated with gestational diabetes in China.

Authors:  Xuyang Chen; Jamie V de Seymour; Ting-Li Han; Yinyin Xia; Chang Chen; Ting Zhang; Hua Zhang; Philip N Baker
Journal:  Metabolomics       Date:  2018-11-03       Impact factor: 4.290

9.  Bioinformatics Tools for Mass Spectrometry-Based High-Throughput Quantitative Proteomics Platforms.

Authors:  Alexey V Nefedov; Miroslaw J Gilski; Rovshan G Sadygov
Journal:  Curr Proteomics       Date:  2011-07       Impact factor: 0.837

10.  An introspective comparison of random forest-based classifiers for the analysis of cluster-correlated data by way of RF++.

Authors:  Yuliya V Karpievitch; Elizabeth G Hill; Anthony P Leclerc; Alan R Dabney; Jonas S Almeida
Journal:  PLoS One       Date:  2009-09-18       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.