Literature DB >> 12096132

Getting more from less: algorithms for rapid protein identification with multiple short peptide sequences.

Aaron J Mackey1, Timothy A J Haystead, William R Pearson.   

Abstract

We describe two novel sequence similarity search algorithms, FASTS and FASTF, that use multiple short peptide sequences to identify homologous sequences in protein or DNA databases. FASTS searches with peptide sequences of unknown order, as obtained by mass spectrometry-based sequencing, evaluating all possible arrangements of the peptides. FASTF searches with mixed peptide sequences, as generated by Edman sequencing of unseparated mixtures of peptides. FASTF deconvolutes the mixture, using a greedy heuristic that allows rapid identification of high scoring alignments while reducing the total number of explored alternatives. Both algorithms use the heuristic FASTA comparison strategy to accelerate the search but use alignment probability, rather than similarity score, as the criterion for alignment optimality. Statistical estimates are calculated using an empirical correction to a theoretical probability. These calculated estimates were accurate within a factor of 10 for FASTS and 1000 for FASTF on our test dataset. FASTS requires only 15-20 total residues in three or four peptides to robustly identify homologues sharing 50% or greater protein sequence identity. FASTF requires about 25% more sequence data than FASTS for equivalent sensitivity, but additional sequence data are usually available from mixed Edman experiments. Thus, both algorithms can identify homologues that diverged 100 to 500 million years ago, allowing proteomic identification from organisms whose genomes have not been sequenced.

Mesh:

Substances:

Year:  2002        PMID: 12096132     DOI: 10.1074/mcp.m100004-mcp200

Source DB:  PubMed          Journal:  Mol Cell Proteomics        ISSN: 1535-9476            Impact factor:   5.911


  50 in total

Review 1.  Molecular biologist's guide to proteomics.

Authors:  Paul R Graves; Timothy A J Haystead
Journal:  Microbiol Mol Biol Rev       Date:  2002-03       Impact factor: 11.056

2.  De novo sequencing and homology searching.

Authors:  Bin Ma; Richard Johnson
Journal:  Mol Cell Proteomics       Date:  2011-11-16       Impact factor: 5.911

3.  Defining parameters for homology-tolerant database searching.

Authors:  J P Kayser; J L Vallet; R L Cerny
Journal:  J Biomol Tech       Date:  2004-12

4.  A case study of de novo sequence analysis of N-sulfonated peptides by MALDI TOF/TOF mass spectrometry.

Authors:  Bart Samyn; Griet Debyser; Kjell Sergeant; Bart Devreese; Jozef Van Beeumen
Journal:  J Am Soc Mass Spectrom       Date:  2004-12       Impact factor: 3.109

5.  T-cell recognition of Paracoccidioides brasiliensis gp43-derived peptides in patients with paracoccidioidomycosis and healthy individuals.

Authors:  Leo Kei Iwai; Márcia Yoshida; Aya Sadahiro; Washington Robert da Silva; Maria Lucia Marin; Anna Carla Goldberg; Maria Aparecida Juliano; Luiz Juliano; Maria Aparecida Shikanai-Yasuda; Jorge Kalil; Edecio Cunha-Neto; Luiz R Travassos
Journal:  Clin Vaccine Immunol       Date:  2007-02-28

6.  Separating the wheat from the chaff: unbiased filtering of background tandem mass spectra improves protein identification.

Authors:  Magno Junqueira; Victor Spirin; Tiago Santana Balbuena; Patrice Waridel; Vineeth Surendranath; Grigoriy Kryukov; Ivan Adzhubei; Henrik Thomas; Shamil Sunyaev; Andrej Shevchenko
Journal:  J Proteome Res       Date:  2008-06-18       Impact factor: 4.466

7.  Crystal structure of the autochaperone region from the Shigella flexneri autotransporter IcsA.

Authors:  Karin Kühnel; Dagmar Diezmann
Journal:  J Bacteriol       Date:  2011-02-18       Impact factor: 3.490

8.  Proteins involved in biotic and abiotic stress responses as the most significant biomarkers in the ripening of Pinot Noir skins.

Authors:  Alfredo Simone Negri; Elisa Robotti; Bhakti Prinsi; Luca Espen; Emilio Marengo
Journal:  Funct Integr Genomics       Date:  2011-01-14       Impact factor: 3.410

9.  Protein phosphorylation in amyloplasts regulates starch branching enzyme activity and protein-protein interactions.

Authors:  Ian J Tetlow; Robin Wait; Zhenxiao Lu; Rut Akkasaeng; Caroline G Bowsher; Sergio Esposito; Behjat Kosar-Hashemi; Matthew K Morell; Michael J Emes
Journal:  Plant Cell       Date:  2004-02-18       Impact factor: 11.277

10.  Evaluation of protein pattern changes in roots and leaves of Zea mays plants in response to nitrate availability by two-dimensional gel electrophoresis analysis.

Authors:  Bhakti Prinsi; Alfredo S Negri; Paolo Pesaresi; Maurizio Cocucci; Luca Espen
Journal:  BMC Plant Biol       Date:  2009-08-23       Impact factor: 4.215

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.