Literature DB >> 23412978

A two-step database search method improves sensitivity in peptide sequence matches for metaproteomics and proteogenomics studies.

Pratik Jagtap1, Jill Goslinga, Joel A Kooren, Thomas McGowan, Matthew S Wroblewski, Sean L Seymour, Timothy J Griffin.   

Abstract

Large databases (>10(6) sequences) used in metaproteomic and proteogenomic studies present challenges in matching peptide sequences to MS/MS data using database-search programs. Most notably, strict filtering to avoid false-positive matches leads to more false negatives, thus constraining the number of peptide matches. To address this challenge, we developed a two-step method wherein matches derived from a primary search against a large database were used to create a smaller subset database. The second search was performed against a target-decoy version of this subset database merged with a host database. High confidence peptide sequence matches were then used to infer protein identities. Applying our two-step method for both metaproteomic and proteogenomic analysis resulted in twice the number of high confidence peptide sequence matches in each case, as compared to the conventional one-step method. The two-step method captured almost all of the same peptides matched by the one-step method, with a majority of the additional matches being false negatives from the one-step method. Furthermore, the two-step method improved results regardless of the database search program used. Our results show that our two-step method maximizes the peptide matching sensitivity for applications requiring large databases, especially valuable for proteogenomics and metaproteomics studies.
© 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23412978      PMCID: PMC3633484          DOI: 10.1002/pmic.201200352

Source DB:  PubMed          Journal:  Proteomics        ISSN: 1615-9853            Impact factor:   3.984


  15 in total

1.  The complete peptide dictionary--a meta-proteomics resource.

Authors:  Manor Askenazi; Jarrod A Marto; Michal Linial
Journal:  Proteomics       Date:  2010-12       Impact factor: 3.984

2.  The Paragon Algorithm, a next generation search engine that uses sequence temperature values and feature probabilities to identify peptides from tandem mass spectra.

Authors:  Ignat V Shilov; Sean L Seymour; Alpesh A Patel; Alex Loboda; Wilfred H Tang; Sean P Keating; Christie L Hunter; Lydia M Nuwaysir; Daniel A Schaeffer
Journal:  Mol Cell Proteomics       Date:  2007-05-27       Impact factor: 5.911

3.  Nonlinear fitting method for determining local false discovery rates from decoy database searches.

Authors:  Wilfred H Tang; Ignat V Shilov; Sean L Seymour
Journal:  J Proteome Res       Date:  2008-08-14       Impact factor: 4.466

Review 4.  Proteogenomics.

Authors:  Santosh Renuse; Raghothama Chaerkady; Akhilesh Pandey
Journal:  Proteomics       Date:  2011-01-18       Impact factor: 3.984

5.  Workflow for analysis of high mass accuracy salivary data set using MaxQuant and ProteinPilot search algorithm.

Authors:  Pratik Jagtap; Sricharan Bandhakavi; LeeAnn Higgins; Thomas McGowan; Rongxiao Sa; Matthew D Stone; John Chilton; Edgar A Arriaga; Sean L Seymour; Timothy J Griffin
Journal:  Proteomics       Date:  2012-06       Impact factor: 3.984

6.  Deep metaproteomic analysis of human salivary supernatant.

Authors:  Pratik Jagtap; Thomas McGowan; Sricharan Bandhakavi; Zheng Jin Tu; Sean Seymour; Timothy J Griffin; Joel D Rudney
Journal:  Proteomics       Date:  2012-04       Impact factor: 3.984

Review 7.  Exploring mixed microbial community functioning: recent advances in metaproteomics.

Authors:  Alma Siggins; Eoin Gunnigle; Florence Abram
Journal:  FEMS Microbiol Ecol       Date:  2012-01-16       Impact factor: 4.194

8.  An iterative workflow for mining the human intestinal metaproteome.

Authors:  Koos Rooijers; Carolin Kolmeder; Catherine Juste; Joël Doré; Mark de Been; Sjef Boeren; Pilar Galan; Christian Beauvallet; Willem M de Vos; Peter J Schaap
Journal:  BMC Genomics       Date:  2011-01-05       Impact factor: 3.969

9.  Strategies for metagenomic-guided whole-community proteomics of complex microbial environments.

Authors:  Brandi L Cantarel; Alison R Erickson; Nathan C VerBerkmoes; Brian K Erickson; Patricia A Carey; Chongle Pan; Manesh Shah; Emmanuel F Mongodin; Janet K Jansson; Claire M Fraser-Liggett; Robert L Hettich
Journal:  PLoS One       Date:  2011-11-23       Impact factor: 3.240

10.  Evaluating the potential of a novel oral lesion exudate collection method coupled with mass spectrometry-based proteomics for oral cancer biomarker discovery.

Authors:  Joel A Kooren; Nelson L Rhodus; Chuanning Tang; Pratik D Jagtap; Bryan J Horrigan; Timothy J Griffin
Journal:  Clin Proteomics       Date:  2011-09-13       Impact factor: 3.988

View more
  58 in total

1.  Large-scale mass spectrometric detection of variant peptides resulting from nonsynonymous nucleotide differences.

Authors:  Gloria M Sheynkman; Michael R Shortreed; Brian L Frey; Mark Scalf; Lloyd M Smith
Journal:  J Proteome Res       Date:  2013-11-11       Impact factor: 4.466

2.  A Meta-proteogenomic Approach to Peptide Identification Incorporating Assembly Uncertainty and Genomic Variation.

Authors:  Sujun Li; Haixu Tang; Yuzhen Ye
Journal:  Mol Cell Proteomics       Date:  2019-05-29       Impact factor: 5.911

3.  ProteoStorm: An Ultrafast Metaproteomics Database Search Framework.

Authors:  Doruk Beyter; Miin S Lin; Yanbao Yu; Rembert Pieper; Vineet Bafna
Journal:  Cell Syst       Date:  2018-09-26       Impact factor: 10.304

4.  Challenges and Opportunities for Biological Mass Spectrometry Core Facilities in the Developing World.

Authors:  Liam Bell; Bridget Calder; Reinhard Hiller; Ashwil Klein; Nelson C Soares; Stoyan H Stoychev; Barend C Vorster; David L Tabb
Journal:  J Biomol Tech       Date:  2018-03-23

5.  Distribution of cuticular proteins in different structures of adult Anopheles gambiae.

Authors:  Yihong Zhou; Majors J Badgett; John Hunter Bowen; Laura Vannini; Ron Orlando; Judith H Willis
Journal:  Insect Biochem Mol Biol       Date:  2016-05-12       Impact factor: 4.714

6.  Onco-proteogenomics: cancer proteomics joins forces with genomics.

Authors:  Javier A Alfaro; Ankit Sinha; Thomas Kislinger; Paul C Boutros
Journal:  Nat Methods       Date:  2014-11       Impact factor: 28.547

Review 7.  Progress and Challenges in Ocean Metaproteomics and Proposed Best Practices for Data Sharing.

Authors:  Mak A Saito; Erin M Bertrand; Megan E Duffy; David A Gaylord; Noelle A Held; William Judson Hervey; Robert L Hettich; Pratik D Jagtap; Michael G Janech; Danie B Kinkade; Dagmar H Leary; Matthew R McIlvin; Eli K Moore; Robert M Morris; Benjamin A Neely; Brook L Nunn; Jaclyn K Saunders; Adam I Shepherd; Nicholas I Symmonds; David A Walsh
Journal:  J Proteome Res       Date:  2019-03-12       Impact factor: 4.466

8.  GPU-DAEMON: GPU algorithm design, data management & optimization template for array based big omics data.

Authors:  Muaaz Gul Awan; Taban Eslami; Fahad Saeed
Journal:  Comput Biol Med       Date:  2018-08-16       Impact factor: 4.589

Review 9.  Proteogenomics from a bioinformatics angle: A growing field.

Authors:  Gerben Menschaert; David Fenyö
Journal:  Mass Spectrom Rev       Date:  2015-12-15       Impact factor: 10.946

Review 10.  Proteogenomics: Integrating Next-Generation Sequencing and Mass Spectrometry to Characterize Human Proteomic Variation.

Authors:  Gloria M Sheynkman; Michael R Shortreed; Anthony J Cesnik; Lloyd M Smith
Journal:  Annu Rev Anal Chem (Palo Alto Calif)       Date:  2016-03-30       Impact factor: 10.745

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.