Literature DB >> 22628520

Application of survival analysis methodology to the quantitative analysis of LC-MS proteomics data.

Carmen D Tekwe1, Raymond J Carroll, Alan R Dabney.   

Abstract

MOTIVATION: Protein abundance in quantitative proteomics is often based on observed spectral features derived from liquid chromatography mass spectrometry (LC-MS) or LC-MS/MS experiments. Peak intensities are largely non-normal in distribution. Furthermore, LC-MS-based proteomics data frequently have large proportions of missing peak intensities due to censoring mechanisms on low-abundance spectral features. Recognizing that the observed peak intensities detected with the LC-MS method are all positive, skewed and often left-censored, we propose using survival methodology to carry out differential expression analysis of proteins. Various standard statistical techniques including non-parametric tests such as the Kolmogorov-Smirnov and Wilcoxon-Mann-Whitney rank sum tests, and the parametric survival model and accelerated failure time-model with log-normal, log-logistic and Weibull distributions were used to detect any differentially expressed proteins. The statistical operating characteristics of each method are explored using both real and simulated datasets.
RESULTS: Survival methods generally have greater statistical power than standard differential expression methods when the proportion of missing protein level data is 5% or more. In particular, the AFT models we consider consistently achieve greater statistical power than standard testing procedures, with the discrepancy widening with increasing missingness in the proportions. AVAILABILITY: The testing procedures discussed in this article can all be performed using readily available software such as R. The R codes are provided as supplemental materials. CONTACT: ctekwe@stat.tamu.edu.

Mesh:

Substances:

Year:  2012        PMID: 22628520      PMCID: PMC3400956          DOI: 10.1093/bioinformatics/bts306

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  26 in total

1.  Gaussian mixture clustering and imputation of microarray data.

Authors:  Ming Ouyang; William J Welsh; Panos Georgopoulos
Journal:  Bioinformatics       Date:  2004-01-29       Impact factor: 6.937

2.  Increased power for the analysis of label-free LC-MS/MS proteomics data by combining spectral counts and peptide peak attributes.

Authors:  Lee Dicker; Xihong Lin; Alexander R Ivanov
Journal:  Mol Cell Proteomics       Date:  2010-09-07       Impact factor: 5.911

3.  DNA microarray data imputation and significance analysis of differential expression.

Authors:  Rebecka Jörnsten; Hui-Yu Wang; William J Welsh; Ming Ouyang
Journal:  Bioinformatics       Date:  2005-08-23       Impact factor: 6.937

4.  Clinical proteomics: the promises and challenges of mass spectrometry-based biomarker discovery.

Authors:  Imelda E deVera; Jonathan E Katz; David B Agus
Journal:  Clin Adv Hematol Oncol       Date:  2006-07

5.  PRISM: a data management system for high-throughput proteomics.

Authors:  Gary R Kiebel; Ken J Auberry; Navdeep Jaitly; David A Clark; Matthew E Monroe; Elena S Peterson; Nikola Tolić; Gordon A Anderson; Richard D Smith
Journal:  Proteomics       Date:  2006-03       Impact factor: 3.984

6.  Association of the clusterin gene polymorphisms with type 2 diabetes mellitus.

Authors:  Makoto Daimon; Toshihide Oizumi; Shigeru Karasawa; Wataru Kaino; Kaoru Takase; Kyouko Tada; Yumi Jimbu; Kiriko Wada; Wataru Kameda; Shinji Susa; Masaaki Muramatsu; Isao Kubota; Sumio Kawata; Takeo Kato
Journal:  Metabolism       Date:  2010-09-20       Impact factor: 8.694

7.  Comparison of spectral counting and metabolic stable isotope labeling for use with quantitative microbial proteomics.

Authors:  Erik L Hendrickson; Qiangwei Xia; Tiansong Wang; John A Leigh; Murray Hackett
Journal:  Analyst       Date:  2006-10-11       Impact factor: 4.616

8.  Talin immunogold density increases in sciatic nerve of diabetic rats after nerve growth factor treatment.

Authors:  Waleed M Renno; Farid Saleh; Ivo Klepacek; Farida Al-Awadi
Journal:  Medicina (Kaunas)       Date:  2006       Impact factor: 2.430

Review 9.  Mass spectrometry-based label-free quantitative proteomics.

Authors:  Wenhong Zhu; Jeffrey W Smith; Chun-Ming Huang
Journal:  J Biomed Biotechnol       Date:  2009-11-10

10.  Comparing transformation methods for DNA microarray data.

Authors:  Helene H Thygesen; Aeilko H Zwinderman
Journal:  BMC Bioinformatics       Date:  2004-06-17       Impact factor: 3.169

View more
  10 in total

1.  Multivariate two-part statistics for analysis of correlated mass spectrometry data from multiple biological specimens.

Authors:  Sandra L Taylor; L Renee Ruhaak; Robert H Weiss; Karen Kelly; Kyoungmi Kim
Journal:  Bioinformatics       Date:  2016-09-04       Impact factor: 6.937

Review 2.  Review, evaluation, and discussion of the challenges of missing value imputation for mass spectrometry-based label-free global proteomics.

Authors:  Bobbie-Jo M Webb-Robertson; Holli K Wiberg; Melissa M Matzke; Joseph N Brown; Jing Wang; Jason E McDermott; Richard D Smith; Karin D Rodland; Thomas O Metz; Joel G Pounds; Katrina M Waters
Journal:  J Proteome Res       Date:  2015-04-22       Impact factor: 4.466

3.  The effects of nonignorable missing data on label-free mass spectrometry proteomics experiments.

Authors:  Jonathon J O'Brien; Harsha P Gunawardena; Joao A Paulo; Xian Chen; Joseph G Ibrahim; Steven P Gygi; Bahjat F Qaqish
Journal:  Ann Appl Stat       Date:  2018-11-13       Impact factor: 2.083

4.  Accounting for undetected compounds in statistical analyses of mass spectrometry 'omic studies.

Authors:  Sandra L Taylor; Gary S Leiserowitz; Kyoungmi Kim
Journal:  Stat Appl Genet Mol Biol       Date:  2013-12

5.  Comparison of imputation and imputation-free methods for statistical analysis of mass spectrometry data with missing data.

Authors:  Sandra Taylor; Matthew Ponzini; Machelle Wilson; Kyoungmi Kim
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 13.994

6.  Statistical characterization of therapeutic protein modifications.

Authors:  Tsung-Heng Tsai; Zhiqi Hao; Qiuting Hong; Benjamin Moore; Cinzia Stella; Jeffrey H Zhang; Yan Chen; Michael Kim; Theo Koulis; Gregory A Ryslik; Erik Verschueren; Fred Jacobson; William E Haskins; Olga Vitek
Journal:  Sci Rep       Date:  2017-08-11       Impact factor: 4.379

7.  Joint Bounding of Peaks Across Samples Improves Differential Analysis in Mass Spectrometry-Based Metabolomics.

Authors:  Leslie Myint; Andre Kleensang; Liang Zhao; Thomas Hartung; Kasper D Hansen
Journal:  Anal Chem       Date:  2017-03-07       Impact factor: 6.986

8.  GMSimpute: a generalized two-step Lasso approach to impute missing values in label-free mass spectrum analysis.

Authors:  Qian Li; Kate Fisher; Wenjun Meng; Bin Fang; Eric Welsh; Eric B Haura; John M Koomen; Steven A Eschrich; Brooke L Fridley; Y Ann Chen
Journal:  Bioinformatics       Date:  2020-01-01       Impact factor: 6.937

9.  Statistical protein quantification and significance analysis in label-free LC-MS experiments with complex designs.

Authors:  Timothy Clough; Safia Thaminy; Susanne Ragg; Ruedi Aebersold; Olga Vitek
Journal:  BMC Bioinformatics       Date:  2012-11-05       Impact factor: 3.169

10.  High-resolution metabolomic profiling of Alzheimer's disease in plasma.

Authors:  Megan M Niedzwiecki; Douglas I Walker; Jennifer Christina Howell; Kelly D Watts; Dean P Jones; Gary W Miller; William T Hu
Journal:  Ann Clin Transl Neurol       Date:  2019-12-11       Impact factor: 4.511

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.