Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A novel algorithm for validating peptide identification from a shotgun proteomics search engine.

Literature DB >> 23402659

A novel algorithm for validating peptide identification from a shotgun proteomics search engine.

Ling Jian¹, Xinnan Niu, Zhonghang Xia, Parimal Samir, Chiranthani Sumanasekera, Zheng Mu, Jennifer L Jennings, Kristen L Hoek, Tara Allos, Leigh M Howard, Kathryn M Edwards, P Anthony Weil, Andrew J Link.

Abstract

Liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) has revolutionized the proteomics analysis of complexes, cells, and tissues. In a typical proteomic analysis, the tandem mass spectra from a LC-MS/MS experiment are assigned to a peptide by a search engine that compares the experimental MS/MS peptide data to theoretical peptide sequences in a protein database. The peptide spectra matches are then used to infer a list of identified proteins in the original sample. However, the search engines often fail to distinguish between correct and incorrect peptides assignments. In this study, we designed and implemented a novel algorithm called De-Noise to reduce the number of incorrect peptide matches and maximize the number of correct peptides at a fixed false discovery rate using a minimal number of scoring outputs from the SEQUEST search engine. The novel algorithm uses a three-step process: data cleaning, data refining through a SVM-based decision function, and a final data refining step based on proteolytic peptide patterns. Using proteomics data generated on different types of mass spectrometers, we optimized the De-Noise algorithm on the basis of the resolution and mass accuracy of the mass spectrometer employed in the LC-MS/MS experiment. Our results demonstrate De-Noise improves peptide identification compared to other methods used to process the peptide sequence matches assigned by SEQUEST. Because De-Noise uses a limited number of scoring attributes, it can be easily implemented with other search engines.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2013 PMID： 23402659 PMCID： PMC3608465 DOI： 10.1021/pr300631t

Source DB: PubMed Journal: J Proteome Res ISSN： 1535-3893 Impact factor: 4.466

31 in total

1. Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors: Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal: Anal Chem Date: 2002-10-15 Impact factor: 6.986

2. The Orbitrap: a new mass spectrometer.

Authors: Qizhi Hu; Robert J Noll; Hongyan Li; Alexander Makarov; Mark Hardman; R Graham Cooks
Journal: J Mass Spectrom Date: 2005-04 Impact factor: 1.982

3. Comparative evaluation of mass spectrometry platforms used in large-scale proteomics investigations.

Authors: Joshua E Elias; Wilhelm Haas; Brendan K Faherty; Steven P Gygi
Journal: Nat Methods Date: 2005-09 Impact factor: 28.547

4. Determination of monoisotopic masses and ion populations for large biomolecules from resolved isotopic distributions.

Authors: M W Senko; S C Beu; F W McLaffertycor
Journal: J Am Soc Mass Spectrom Date: 1995-04 Impact factor: 3.109

5. Empirical approach to false discovery rate estimation in shotgun proteomics.

Authors: Anton A Goloborodko; Corina Mayerhofer; Alexander R Zubarev; Irina A Tarasova; Alexander V Gorshkov; Roman A Zubarev; Mikhail V Gorshkov
Journal: Rapid Commun Mass Spectrom Date: 2010-02 Impact factor: 2.419

6. Direct analysis of protein complexes using mass spectrometry.

Authors: A J Link; J Eng; D M Schieltz; E Carmack; G J Mize; D R Morris; B M Garvik; J R Yates
Journal: Nat Biotechnol Date: 1999-07 Impact factor: 54.908

7. Improvements to the percolator algorithm for Peptide identification from shotgun proteomics data sets.

Authors: Marina Spivak; Jason Weston; Léon Bottou; Lukas Käll; William Stafford Noble
Journal: J Proteome Res Date: 2009-07 Impact factor: 4.466

8. Saccharomyces Genome Database: the genomics resource of budding yeast.

Authors: J Michael Cherry; Eurie L Hong; Craig Amundsen; Rama Balakrishnan; Gail Binkley; Esther T Chan; Karen R Christie; Maria C Costanzo; Selina S Dwight; Stacia R Engel; Dianna G Fisk; Jodi E Hirschman; Benjamin C Hitz; Kalpana Karra; Cynthia J Krieger; Stuart R Miyasato; Rob S Nash; Julie Park; Marek S Skrzypek; Matt Simison; Shuai Weng; Edith D Wong
Journal: Nucleic Acids Res Date: 2011-11-21 Impact factor: 16.971

9. The Universal Protein Resource (UniProt).

Authors: Amos Bairoch; Rolf Apweiler; Cathy H Wu; Winona C Barker; Brigitte Boeckmann; Serenella Ferro; Elisabeth Gasteiger; Hongzhan Huang; Rodrigo Lopez; Michele Magrane; Maria J Martin; Darren A Natale; Claire O'Donovan; Nicole Redaschi; Lai-Su L Yeh
Journal: Nucleic Acids Res Date: 2005-01-01 Impact factor: 16.971

10. Optimization of filtering criterion for SEQUEST database searching to improve proteome coverage in shotgun proteomics.

Authors: Xinning Jiang; Xiaogang Jiang; Guanghui Han; Mingliang Ye; Hanfa Zou
Journal: BMC Bioinformatics Date: 2007-08-31 Impact factor: 3.169

2 in total

1. An adaptive classification model for peptide identification.

Authors: Xijun Liang; Zhonghang Xia; Ling Jian; Xinnan Niu; Andrew Link
Journal: BMC Genomics Date: 2015-11-10 Impact factor: 3.969

2. A cost-sensitive online learning method for peptide identification.

Authors: Xijun Liang; Zhonghang Xia; Ling Jian; Yongxiang Wang; Xinnan Niu; Andrew J Link
Journal: BMC Genomics Date: 2020-04-25 Impact factor: 3.969

2 in total