MOTIVATION: LC-MS allows for the identification and quantification of proteins from biological samples. As with any high-throughput technology, systematic biases are often observed in LC-MS data, making normalization an important preprocessing step. Normalization models need to be flexible enough to capture biases of arbitrary complexity, while avoiding overfitting that would invalidate downstream statistical inference. Careful normalization of MS peak intensities would enable greater accuracy and precision in quantitative comparisons of protein abundance levels. RESULTS: We propose an algorithm, called EigenMS, that uses singular value decomposition to capture and remove biases from LC-MS peak intensity measurements. EigenMS is an adaptation of the surrogate variable analysis (SVA) algorithm of Leek and Storey, with the adaptations including (i) the handling of the widespread missing measurements that are typical in LC-MS, and (ii) a novel approach to preventing overfitting that facilitates the incorporation of EigenMS into an existing proteomics analysis pipeline. EigenMS is demonstrated using both large-scale calibration measurements and simulations to perform well relative to existing alternatives. AVAILABILITY: The software has been made available in the open source proteomics platform DAnTE (Polpitiya et al., 2008)) (http://omics.pnl.gov/software/), as well as in standalone software available at SourceForge (http://sourceforge.net).
MOTIVATION: LC-MS allows for the identification and quantification of proteins from biological samples. As with any high-throughput technology, systematic biases are often observed in LC-MS data, making normalization an important preprocessing step. Normalization models need to be flexible enough to capture biases of arbitrary complexity, while avoiding overfitting that would invalidate downstream statistical inference. Careful normalization of MS peak intensities would enable greater accuracy and precision in quantitative comparisons of protein abundance levels. RESULTS: We propose an algorithm, called EigenMS, that uses singular value decomposition to capture and remove biases from LC-MS peak intensity measurements. EigenMS is an adaptation of the surrogate variable analysis (SVA) algorithm of Leek and Storey, with the adaptations including (i) the handling of the widespread missing measurements that are typical in LC-MS, and (ii) a novel approach to preventing overfitting that facilitates the incorporation of EigenMS into an existing proteomics analysis pipeline. EigenMS is demonstrated using both large-scale calibration measurements and simulations to perform well relative to existing alternatives. AVAILABILITY: The software has been made available in the open source proteomics platform DAnTE (Polpitiya et al., 2008)) (http://omics.pnl.gov/software/), as well as in standalone software available at SourceForge (http://sourceforge.net).
Authors: Navdeep Jaitly; Matthew E Monroe; Vladislav A Petyuk; Therese R W Clauss; Joshua N Adkins; Richard D Smith Journal: Anal Chem Date: 2006-11-01 Impact factor: 6.986
Authors: Gregory L Finney; Adele R Blackler; Michael R Hoopmann; Jesse D Canterbury; Christine C Wu; Michael J MacCoss Journal: Anal Chem Date: 2008-01-12 Impact factor: 6.986
Authors: Vladislav A Petyuk; Navdeep Jaitly; Ronald J Moore; Jie Ding; Thomas O Metz; Keqi Tang; Matthew E Monroe; Aleksey V Tolmachev; Joshua N Adkins; Mikhail E Belov; Alan R Dabney; Wei-Jun Qian; David G Camp; Richard D Smith Journal: Anal Chem Date: 2007-12-29 Impact factor: 6.986
Authors: Yuliya Karpievitch; Jeff Stanley; Thomas Taverner; Jianhua Huang; Joshua N Adkins; Charles Ansong; Fred Heffron; Thomas O Metz; Wei-Jun Qian; Hyunjin Yoon; Richard D Smith; Alan R Dabney Journal: Bioinformatics Date: 2009-06-17 Impact factor: 6.937
Authors: Elizabeth G Hill; John H Schwacke; Susana Comte-Walters; Elizabeth H Slate; Ann L Oberg; Jeanette E Eckel-Passow; Terry M Therneau; Kevin L Schey Journal: J Proteome Res Date: 2008-06-26 Impact factor: 4.466
Authors: Ashoka D Polpitiya; Wei-Jun Qian; Navdeep Jaitly; Vladislav A Petyuk; Joshua N Adkins; David G Camp; Gordon A Anderson; Richard D Smith Journal: Bioinformatics Date: 2008-05-03 Impact factor: 6.937
Authors: Tom Taverner; Yuliya V Karpievitch; Ashoka D Polpitiya; Joseph N Brown; Alan R Dabney; Gordon A Anderson; Richard D Smith Journal: Bioinformatics Date: 2012-07-19 Impact factor: 6.937
Authors: Bobbie-Jo M Webb-Robertson; Melissa M Matzke; Jon M Jacobs; Joel G Pounds; Katrina M Waters Journal: Proteomics Date: 2011-11-17 Impact factor: 3.984
Authors: Victor P Andreev; Vladislav A Petyuk; Heather M Brewer; Yuliya V Karpievitch; Fang Xie; Jennifer Clarke; David Camp; Richard D Smith; Andrew P Lieberman; Roger L Albin; Zafar Nawaz; Jimmy El Hokayem; Amanda J Myers Journal: J Proteome Res Date: 2012-05-17 Impact factor: 4.466
Authors: Yuliya V Karpievitch; Elizabeth G Hill; Anthony P Leclerc; Alan R Dabney; Jonas S Almeida Journal: PLoS One Date: 2009-09-18 Impact factor: 3.240