Literature DB >> 23402659

A novel algorithm for validating peptide identification from a shotgun proteomics search engine.

Ling Jian1, Xinnan Niu, Zhonghang Xia, Parimal Samir, Chiranthani Sumanasekera, Zheng Mu, Jennifer L Jennings, Kristen L Hoek, Tara Allos, Leigh M Howard, Kathryn M Edwards, P Anthony Weil, Andrew J Link.   

Abstract

Liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) has revolutionized the proteomics analysis of complexes, cells, and tissues. In a typical proteomic analysis, the tandem mass spectra from a LC-MS/MS experiment are assigned to a peptide by a search engine that compares the experimental MS/MS peptide data to theoretical peptide sequences in a protein database. The peptide spectra matches are then used to infer a list of identified proteins in the original sample. However, the search engines often fail to distinguish between correct and incorrect peptides assignments. In this study, we designed and implemented a novel algorithm called De-Noise to reduce the number of incorrect peptide matches and maximize the number of correct peptides at a fixed false discovery rate using a minimal number of scoring outputs from the SEQUEST search engine. The novel algorithm uses a three-step process: data cleaning, data refining through a SVM-based decision function, and a final data refining step based on proteolytic peptide patterns. Using proteomics data generated on different types of mass spectrometers, we optimized the De-Noise algorithm on the basis of the resolution and mass accuracy of the mass spectrometer employed in the LC-MS/MS experiment. Our results demonstrate De-Noise improves peptide identification compared to other methods used to process the peptide sequence matches assigned by SEQUEST. Because De-Noise uses a limited number of scoring attributes, it can be easily implemented with other search engines.

Entities:  

Mesh:

Year:  2013        PMID: 23402659      PMCID: PMC3608465          DOI: 10.1021/pr300631t

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  31 in total

1.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

2.  The Orbitrap: a new mass spectrometer.

Authors:  Qizhi Hu; Robert J Noll; Hongyan Li; Alexander Makarov; Mark Hardman; R Graham Cooks
Journal:  J Mass Spectrom       Date:  2005-04       Impact factor: 1.982

3.  Comparative evaluation of mass spectrometry platforms used in large-scale proteomics investigations.

Authors:  Joshua E Elias; Wilhelm Haas; Brendan K Faherty; Steven P Gygi
Journal:  Nat Methods       Date:  2005-09       Impact factor: 28.547

4.  Determination of monoisotopic masses and ion populations for large biomolecules from resolved isotopic distributions.

Authors:  M W Senko; S C Beu; F W McLaffertycor
Journal:  J Am Soc Mass Spectrom       Date:  1995-04       Impact factor: 3.109

5.  Empirical approach to false discovery rate estimation in shotgun proteomics.

Authors:  Anton A Goloborodko; Corina Mayerhofer; Alexander R Zubarev; Irina A Tarasova; Alexander V Gorshkov; Roman A Zubarev; Mikhail V Gorshkov
Journal:  Rapid Commun Mass Spectrom       Date:  2010-02       Impact factor: 2.419

6.  Direct analysis of protein complexes using mass spectrometry.

Authors:  A J Link; J Eng; D M Schieltz; E Carmack; G J Mize; D R Morris; B M Garvik; J R Yates
Journal:  Nat Biotechnol       Date:  1999-07       Impact factor: 54.908

7.  Improvements to the percolator algorithm for Peptide identification from shotgun proteomics data sets.

Authors:  Marina Spivak; Jason Weston; Léon Bottou; Lukas Käll; William Stafford Noble
Journal:  J Proteome Res       Date:  2009-07       Impact factor: 4.466

8.  Saccharomyces Genome Database: the genomics resource of budding yeast.

Authors:  J Michael Cherry; Eurie L Hong; Craig Amundsen; Rama Balakrishnan; Gail Binkley; Esther T Chan; Karen R Christie; Maria C Costanzo; Selina S Dwight; Stacia R Engel; Dianna G Fisk; Jodi E Hirschman; Benjamin C Hitz; Kalpana Karra; Cynthia J Krieger; Stuart R Miyasato; Rob S Nash; Julie Park; Marek S Skrzypek; Matt Simison; Shuai Weng; Edith D Wong
Journal:  Nucleic Acids Res       Date:  2011-11-21       Impact factor: 16.971

9.  The Universal Protein Resource (UniProt).

Authors:  Amos Bairoch; Rolf Apweiler; Cathy H Wu; Winona C Barker; Brigitte Boeckmann; Serenella Ferro; Elisabeth Gasteiger; Hongzhan Huang; Rodrigo Lopez; Michele Magrane; Maria J Martin; Darren A Natale; Claire O'Donovan; Nicole Redaschi; Lai-Su L Yeh
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

10.  Optimization of filtering criterion for SEQUEST database searching to improve proteome coverage in shotgun proteomics.

Authors:  Xinning Jiang; Xiaogang Jiang; Guanghui Han; Mingliang Ye; Hanfa Zou
Journal:  BMC Bioinformatics       Date:  2007-08-31       Impact factor: 3.169

View more
  2 in total

1.  An adaptive classification model for peptide identification.

Authors:  Xijun Liang; Zhonghang Xia; Ling Jian; Xinnan Niu; Andrew Link
Journal:  BMC Genomics       Date:  2015-11-10       Impact factor: 3.969

2.  A cost-sensitive online learning method for peptide identification.

Authors:  Xijun Liang; Zhonghang Xia; Ling Jian; Yongxiang Wang; Xinnan Niu; Andrew J Link
Journal:  BMC Genomics       Date:  2020-04-25       Impact factor: 3.969

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.