Literature DB >> 27260494

A multi-model statistical approach for proteomic spectral count quantitation.

Owen E Branson1, Michael A Freitas2.   

Abstract

UNLABELLED: The rapid development of mass spectrometry (MS) technologies has solidified shotgun proteomics as the most powerful analytical platform for large-scale proteome interrogation. The ability to map and determine differential expression profiles of the entire proteome is the ultimate goal of shotgun proteomics. Label-free quantitation has proven to be a valid approach for discovery shotgun proteomics, especially when sample is limited. Label-free spectral count quantitation is an approach analogous to RNA sequencing whereby count data is used to determine differential expression. Here we show that statistical approaches developed to evaluate differential expression in RNA sequencing experiments can be applied to detect differential protein expression in label-free discovery proteomics. This approach, termed MultiSpec, utilizes open-source statistical platforms; namely edgeR, DESeq and baySeq, to statistically select protein candidates for further investigation. Furthermore, to remove bias associated with a single statistical approach a single ranked list of differentially expressed proteins is assembled by comparing edgeR and DESeq q-values directly with the false discovery rate (FDR) calculated by baySeq. This statistical approach is then extended when applied to spectral count data derived from multiple proteomic pipelines. The individual statistical results from multiple proteomic pipelines are integrated and cross-validated by means of collapsing protein groups. BIOLOGICAL SIGNIFICANCE: Spectral count data from shotgun proteomics experiments is semi-quantitative and semi-random, yet a robust way to estimate protein concentration. Tag-count approaches are routinely used to analyze RNA sequencing data sets. This approach, termed MultiSpec, utilizes multiple tag-count based statistical tests to determine differential protein expression from spectral counts. The statistical results from these tag-count approaches are combined in order to reach a final MultiSpec q-value to re-rank protein candidates. This re-ranking procedure is completed to remove bias associated with a single approach in order to better understand the true proteomic differences driving the biology in question. The MultiSpec approach can be extended to multiple proteomic pipelines. In such an instance, MultiSpec statistical results are integrated by collapsing protein groups across proteomic pipelines to provide a single ranked list of differentially expressed proteins. This integration mechanism is seamlessly integrated with the statistical analysis and provides the means to cross-validate protein inferences from multiple proteomic pipelines.
Copyright © 2016 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  DESeq; Proteomics; Spectral Counting; baySeq; edgeR; q-Value

Mesh:

Substances:

Year:  2016        PMID: 27260494      PMCID: PMC4967010          DOI: 10.1016/j.jprot.2016.05.032

Source DB:  PubMed          Journal:  J Proteomics        ISSN: 1874-3919            Impact factor:   4.044


  66 in total

1.  Probability-based validation of protein identifications using a modified SEQUEST algorithm.

Authors:  Michael J MacCoss; Christine C Wu; John R Yates
Journal:  Anal Chem       Date:  2002-11-01       Impact factor: 6.986

2.  Refinements to label free proteome quantitation: how to deal with peptides shared by multiple proteins.

Authors:  Ying Zhang; Zhihui Wen; Michael P Washburn; Laurence Florens
Journal:  Anal Chem       Date:  2010-03-15       Impact factor: 6.986

3.  A bayesian mixture model for comparative spectral count data in shotgun proteomics.

Authors:  James G Booth; Kirsten E Eilertson; Paul Dominic B Olinares; Haiyuan Yu
Journal:  Mol Cell Proteomics       Date:  2011-05-20       Impact factor: 5.911

4.  MassMatrix: a database search program for rapid characterization of proteins and peptides from tandem mass spectrometry data.

Authors:  Hua Xu; Michael A Freitas
Journal:  Proteomics       Date:  2009-03       Impact factor: 3.984

5.  baySeq: empirical Bayesian methods for identifying differential expression in sequence count data.

Authors:  Thomas J Hardcastle; Krystyna A Kelly
Journal:  BMC Bioinformatics       Date:  2010-08-10       Impact factor: 3.169

6.  Characterization of global yeast quantitative proteome data generated from the wild-type and glucose repression saccharomyces cerevisiae strains: the comparison of two quantitative methods.

Authors:  Renata Usaite; James Wohlschlegel; John D Venable; Sung K Park; Jens Nielsen; Lisbeth Olsson; John R Yates Iii
Journal:  J Proteome Res       Date:  2008-01       Impact factor: 4.466

7.  A cross-platform toolkit for mass spectrometry and proteomics.

Authors:  Matthew C Chambers; Brendan Maclean; Robert Burke; Dario Amodei; Daniel L Ruderman; Steffen Neumann; Laurent Gatto; Bernd Fischer; Brian Pratt; Jarrett Egertson; Katherine Hoff; Darren Kessner; Natalie Tasman; Nicholas Shulman; Barbara Frewen; Tahmina A Baker; Mi-Youn Brusniak; Christopher Paulse; David Creasy; Lisa Flashner; Kian Kani; Chris Moulding; Sean L Seymour; Lydia M Nuwaysir; Brent Lefebvre; Frank Kuhlmann; Joe Roark; Paape Rainer; Suckau Detlev; Tina Hemenway; Andreas Huhmer; James Langridge; Brian Connolly; Trey Chadick; Krisztina Holly; Josh Eckels; Eric W Deutsch; Robert L Moritz; Jonathan E Katz; David B Agus; Michael MacCoss; David L Tabb; Parag Mallick
Journal:  Nat Biotechnol       Date:  2012-10       Impact factor: 54.908

8.  Proteomic analysis reveals new cardiac-specific dystrophin-associated proteins.

Authors:  Eric K Johnson; Liwen Zhang; Marvin E Adams; Alistair Phillips; Michael A Freitas; Stanley C Froehner; Kari B Green-Church; Federica Montanaro
Journal:  PLoS One       Date:  2012-08-24       Impact factor: 3.240

9.  Meta-analysis methods for combining multiple expression profiles: comparisons, statistical characterization and an application guideline.

Authors:  Lun-Ching Chang; Hui-Min Lin; Etienne Sibille; George C Tseng
Journal:  BMC Bioinformatics       Date:  2013-12-21       Impact factor: 3.169

10.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

View more
  12 in total

1.  Simultaneous Improvement in the Precision, Accuracy, and Robustness of Label-free Proteome Quantification by Optimizing Data Manipulation Chains.

Authors:  Jing Tang; Jianbo Fu; Yunxia Wang; Yongchao Luo; Qingxia Yang; Bo Li; Gao Tu; Jiajun Hong; Xuejiao Cui; Yuzong Chen; Lixia Yao; Weiwei Xue; Feng Zhu
Journal:  Mol Cell Proteomics       Date:  2019-05-16       Impact factor: 5.911

2.  Extended Multiplexing of Tandem Mass Tags (TMT) Labeling Reveals Age and High Fat Diet Specific Proteome Changes in Mouse Epididymal Adipose Tissue.

Authors:  Deanna L Plubell; Phillip A Wilmarth; Yuqi Zhao; Alexandra M Fenton; Jessica Minnier; Ashok P Reddy; John Klimek; Xia Yang; Larry L David; Nathalie Pamir
Journal:  Mol Cell Proteomics       Date:  2017-03-21       Impact factor: 5.911

3.  The human tubal lavage proteome reveals biological processes that may govern the pathology of hydrosalpinx.

Authors:  Elizabeth Yohannes; Avedis A Kazanjian; Morgan E Lindsay; Dennis T Fujii; Nicholas Ieronimakis; Gregory E Chow; Ronald D Beesley; Ryan J Heitmann; Richard O Burney
Journal:  Sci Rep       Date:  2019-06-20       Impact factor: 4.379

4.  Discovery and Validation of a Novel Neutrophil Activation Marker Associated with Obesity.

Authors:  Yue Pan; Jeong-Hyeon Choi; Huidong Shi; Liwen Zhang; Shaoyong Su; Xiaoling Wang
Journal:  Sci Rep       Date:  2019-03-05       Impact factor: 4.379

5.  Identification of candidate genetic variants and altered protein expression in neural stem and mature neural cells support altered microtubule function to be an essential component in bipolar disorder.

Authors:  Katarina Truvé; Toshima Z Parris; Dzeneta Vizlin-Hodzic; Susanne Salmela; Evelin Berger; Hans Ågren; Keiko Funa
Journal:  Transl Psychiatry       Date:  2020-11-09       Impact factor: 6.222

Review 6.  Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections.

Authors:  Matineh Rahmatbakhsh; Alla Gagarinova; Mohan Babu
Journal:  Front Genet       Date:  2021-07-02       Impact factor: 4.599

7.  The role of extracellular matrix in mouse and human corneal neovascularization.

Authors:  M Barbariga; F Vallone; E Mosca; F Bignami; C Magagnotti; P Fonteyne; F Chiappori; L Milanesi; P Rama; A Andolfo; G Ferrari
Journal:  Sci Rep       Date:  2019-10-03       Impact factor: 4.379

8.  Multiple Imputation Approaches Applied to the Missing Value Problem in Bottom-Up Proteomics.

Authors:  Miranda L Gardner; Michael A Freitas
Journal:  Int J Mol Sci       Date:  2021-09-06       Impact factor: 5.923

9.  The DREAM complex represses growth in response to DNA damage in Arabidopsis.

Authors:  Lucas Lang; Aladár Pettkó-Szandtner; Hasibe Tunçay Elbaşı; Hirotomo Takatsuka; Yuji Nomoto; Ahmad Zaki; Stefan Dorokhov; Geert De Jaeger; Dominique Eeckhout; Masaki Ito; Zoltán Magyar; László Bögre; Maren Heese; Arp Schnittger
Journal:  Life Sci Alliance       Date:  2021-09-28

10.  Sex Differences in the Ventral Tegmental Area and Nucleus Accumbens Proteome at Baseline and Following Nicotine Exposure.

Authors:  Angela M Lee; Mohammad Shahid Mansuri; Rashaun S Wilson; TuKiet T Lam; Angus C Nairn; Marina R Picciotto
Journal:  Front Mol Neurosci       Date:  2021-07-14       Impact factor: 5.639

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.