Literature DB >> 26238326

Using SEQUEST with theoretically complete sequence databases.

Rovshan G Sadygov1,2.   

Abstract

SEQUEST has long been used to identify peptides/proteins from their tandem mass spectra and protein sequence databases. The algorithm has proven to be hugely successful for its sensitivity and specificity in identifying peptides/proteins, the sequences of which are present in the protein sequence databases. In this work, we report on work that attempts a new use for the algorithm by applying it to search a complete list of theoretically possible peptides, a de novo-like sequencing. We used freely available mass spectral data and determined a number of unique peptides as identified by SEQUEST. Using masses of these peptides and the mass accuracy of 0.001 Da, we have created a database of all theoretically possible peptide sequences corresponding to the precursor masses. We used our recently developed algorithm for determining all amino acid compositions corresponding to a mass interval, and used a lexicographic ordering to generate theoretical sequences from the compositions. The newly generated theoretical database was many-fold more complex than the original protein sequence database. We used SEQUEST to search and identify the best matches to the spectra from all theoretically possible peptide sequences. We found that SEQUEST cross-correlation score ranked the correct peptide match among the top sequence matches. The results testify to the high specificity of SEQUEST when combined with the high mass accuracy for intact peptides. Graphical Abstract ᅟ.

Entities:  

Keywords:  All theoretically possible peptides; De novo Peptide sequencing; Mass distribution of peptides; SEQUEST

Mesh:

Year:  2015        PMID: 26238326      PMCID: PMC4607654          DOI: 10.1007/s13361-015-1228-5

Source DB:  PubMed          Journal:  J Am Soc Mass Spectrom        ISSN: 1044-0305            Impact factor:   3.109


  40 in total

1.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

2.  GutenTag: high-throughput sequence tagging via an empirically derived fragmentation model.

Authors:  David L Tabb; Anita Saraf; John R Yates
Journal:  Anal Chem       Date:  2003-12-01       Impact factor: 6.986

3.  Searching sequence databases via de novo peptide sequencing by tandem mass spectrometry.

Authors:  Richard S Johnson; J Alex Taylor
Journal:  Mol Biotechnol       Date:  2002-11       Impact factor: 2.695

4.  Open mass spectrometry search algorithm.

Authors:  Lewis Y Geer; Sanford P Markey; Jeffrey A Kowalak; Lukas Wagner; Ming Xu; Dawn M Maynard; Xiaoyu Yang; Wenyao Shi; Stephen H Bryant
Journal:  J Proteome Res       Date:  2004 Sep-Oct       Impact factor: 4.466

Review 5.  Protein analysis by shotgun/bottom-up proteomics.

Authors:  Yaoyang Zhang; Bryan R Fonslow; Bing Shan; Moon-Chang Baek; John R Yates
Journal:  Chem Rev       Date:  2013-02-26       Impact factor: 60.622

6.  Error-tolerant identification of peptides in sequence databases by peptide sequence tags.

Authors:  M Mann; M Wilm
Journal:  Anal Chem       Date:  1994-12-15       Impact factor: 6.986

7.  Fourier-transform mass spectrometry for automated fragmentation and identification of 5-20 kDa proteins in mixtures.

Authors:  Jeffrey R Johnson; Fanyu Meng; Andrew J Forbes; Benjamin J Cargile; Neil L Kelleher
Journal:  Electrophoresis       Date:  2002-09       Impact factor: 3.535

8.  Examining troughs in the mass distribution of all theoretically possible tryptic peptides.

Authors:  Alexey V Nefedov; Indranil Mitra; Allan R Brasier; Rovshan G Sadygov
Journal:  J Proteome Res       Date:  2011-08-09       Impact factor: 4.466

9.  Byonic: advanced peptide and protein identification software.

Authors:  Marshall Bern; Yong J Kil; Christopher Becker
Journal:  Curr Protoc Bioinformatics       Date:  2012-12

10.  Use of singular value decomposition analysis to differentiate phosphorylated precursors in strong cation exchange fractions.

Authors:  Rovshan G Sadygov
Journal:  Electrophoresis       Date:  2014-07-24       Impact factor: 3.535

View more
  3 in total

1.  Poisson Model To Generate Isotope Distribution for Biomolecules.

Authors:  Rovshan G Sadygov
Journal:  J Proteome Res       Date:  2017-12-19       Impact factor: 4.466

2.  Protein turnover models for LC-MS data of heavy water metabolic labeling.

Authors:  Rovshan G Sadygov
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

Review 3.  A Critical Review of Bottom-Up Proteomics: The Good, the Bad, and the Future of this Field.

Authors:  Emmalyn J Dupree; Madhuri Jayathirtha; Hannah Yorkey; Marius Mihasan; Brindusa Alina Petre; Costel C Darie
Journal:  Proteomes       Date:  2020-07-06
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.