Literature DB >> 22331862

Faster mass spectrometry-based protein inference: junction trees are more efficient than sampling and marginalization by enumeration.

Oliver Serang1, William Stafford Noble.   

Abstract

The problem of identifying the proteins in a complex mixture using tandem mass spectrometry can be framed as an inference problem on a graph that connects peptides to proteins. Several existing protein identification methods make use of statistical inference methods for graphical models, including expectation maximization, Markov chain Monte Carlo, and full marginalization coupled with approximation heuristics. We show that, for this problem, the majority of the cost of inference usually comes from a few highly connected subgraphs. Furthermore, we evaluate three different statistical inference methods using a common graphical model, and we demonstrate that junction tree inference substantially improves rates of convergence compared to existing methods. The python code used for this paper is available at http://noble.gs.washington.edu/proj/fido.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22331862      PMCID: PMC3389307          DOI: 10.1109/TCBB.2012.26

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  13 in total

1.  Correctness of local probability in graphical models with loops.

Authors:  Y Weiss
Journal:  Neural Comput       Date:  2000-01       Impact factor: 2.026

2.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

3.  A statistical model for identifying proteins by tandem mass spectrometry.

Authors:  Alexey I Nesvizhskii; Andrew Keller; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2003-09-01       Impact factor: 6.986

4.  A hierarchical statistical model to assess the confidence of peptides and proteins inferred from tandem mass spectrometry.

Authors:  Changyu Shen; Zhiping Wang; Ganesh Shankar; Xiang Zhang; Lang Li
Journal:  Bioinformatics       Date:  2007-11-17       Impact factor: 6.937

Review 5.  Analysis and validation of proteomic data generated by tandem mass spectrometry.

Authors:  Alexey I Nesvizhskii; Olga Vitek; Ruedi Aebersold
Journal:  Nat Methods       Date:  2007-10       Impact factor: 28.547

6.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.

Authors:  J K Eng; A L McCormack; J R Yates
Journal:  J Am Soc Mass Spectrom       Date:  1994-11       Impact factor: 3.109

7.  Rapid and accurate peptide identification from tandem mass spectra.

Authors:  Christopher Y Park; Aaron A Klammer; Lukas Käll; Michael J MacCoss; William S Noble
Journal:  J Proteome Res       Date:  2008-05-28       Impact factor: 4.466

8.  A review of statistical methods for protein identification using tandem mass spectrometry.

Authors:  Oliver Serang; William Noble
Journal:  Stat Interface       Date:  2012       Impact factor: 0.582

9.  EBP, a program for protein identification using multiple tandem mass spectrometry datasets.

Authors:  Thomas S Price; Margaret B Lucitt; Weichen Wu; David J Austin; Angel Pizarro; Anastasia K Yocum; Ian A Blair; Garret A FitzGerald; Tilo Grosser
Journal:  Mol Cell Proteomics       Date:  2006-12-12       Impact factor: 5.911

10.  Efficient marginalization to compute protein posterior probabilities from shotgun mass spectrometry data.

Authors:  Oliver Serang; Michael J MacCoss; William Stafford Noble
Journal:  J Proteome Res       Date:  2010-10-01       Impact factor: 4.466

View more
  7 in total

1.  RGS10 physically and functionally interacts with STIM2 and requires store-operated calcium entry to regulate pro-inflammatory gene expression in microglia.

Authors:  Menbere Y Wendimu; Mohammed Alqinyah; Stephen Vella; Phillip Dean; Faris Almutairi; Roseanne Davila-Rivera; Shima Rayatpisheh; James Wohlschlegel; Silvia Moreno; Shelley B Hooks
Journal:  Cell Signal       Date:  2021-03-09       Impact factor: 4.850

2.  Delayed Processing of Secretin-Induced Pancreas Fluid Influences the Quality and Integrity of Proteins and Nucleic Acids.

Authors:  Zobeida Cruz-Monserrate; Kristyn Gumpper; Sabrina Kaul; Niharika Badi; Samantha Terhorst; Kelly Dubay; Gregory B Lesinski; William Fisher; Amy McElhany; Luis F Lara; Somashekar Krishna; Thomas Mace; Natalia Higuita-Castro; Lilibeth Ortega-Pineda; Michael A Freitas; Alice Hinton; Dhiraj Yadav; Phil A Hart; Stephen J Pandol; Saima Ahmed; Benoit Fatou; Hanno Steen; Darwin L Conwell
Journal:  Pancreas       Date:  2021-01-01       Impact factor: 3.243

Review 3.  Computational approaches to protein inference in shotgun proteomics.

Authors:  Yong Fuga Li; Predrag Radivojac
Journal:  BMC Bioinformatics       Date:  2012-11-05       Impact factor: 3.169

4.  Novel Application of Junction Trees to the Interpretation of Epigenetic Differences among Lung Cancer Subtypes.

Authors:  Arturo Lopez Pineda; Vanathi Gopalakrishnan
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2015-03-23

5.  Multi-omic network signatures of disease.

Authors:  David L Gibbs; Lisa Gralinski; Ralph S Baric; Shannon K McWeeney
Journal:  Front Genet       Date:  2014-01-07       Impact factor: 4.599

6.  The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference.

Authors:  Oliver Serang
Journal:  PLoS One       Date:  2014-03-13       Impact factor: 3.240

Review 7.  Soil and leaf litter metaproteomics-a brief guideline from sampling to understanding.

Authors:  Katharina M Keiblinger; Stephan Fuchs; Sophie Zechmeister-Boltenstern; Katharina Riedel
Journal:  FEMS Microbiol Ecol       Date:  2016-08-21       Impact factor: 4.194

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.