Literature DB >> 22798278

Shotgun protein sequencing with meta-contig assembly.

Adrian Guthals1, Karl R Clauser, Nuno Bandeira.   

Abstract

Full-length de novo sequencing from tandem mass (MS/MS) spectra of unknown proteins such as antibodies or proteins from organisms with unsequenced genomes remains a challenging open problem. Conventional algorithms designed to individually sequence each MS/MS spectrum are limited by incomplete peptide fragmentation or low signal to noise ratios and tend to result in short de novo sequences at low sequencing accuracy. Our shotgun protein sequencing (SPS) approach was developed to ameliorate these limitations by first finding groups of unidentified spectra from the same peptides (contigs) and then deriving a consensus de novo sequence for each assembled set of spectra (contig sequences). But whereas SPS enables much more accurate reconstruction of de novo sequences longer than can be recovered from individual MS/MS spectra, it still requires error-tolerant matching to homologous proteins to group smaller contig sequences into full-length protein sequences, thus limiting its effectiveness on sequences from poorly annotated proteins. Using low and high resolution CID and high resolution HCD MS/MS spectra, we address this limitation with a Meta-SPS algorithm designed to overlap and further assemble SPS contigs into Meta-SPS de novo contig sequences extending as long as 100 amino acids at over 97% accuracy without requiring any knowledge of homologous protein sequences. We demonstrate Meta-SPS using distinct MS/MS data sets obtained with separate enzymatic digestions and discuss how the remaining de novo sequencing limitations relate to MS/MS acquisition settings.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22798278      PMCID: PMC3494147          DOI: 10.1074/mcp.M111.015768

Source DB:  PubMed          Journal:  Mol Cell Proteomics        ISSN: 1535-9476            Impact factor:   5.911


  42 in total

1.  Mutation-tolerant protein identification by mass spectrometry.

Authors:  P A Pevzner; V Dancík; C L Tang
Journal:  J Comput Biol       Date:  2000       Impact factor: 1.479

2.  Getting more from less: algorithms for rapid protein identification with multiple short peptide sequences.

Authors:  Aaron J Mackey; Timothy A J Haystead; William R Pearson
Journal:  Mol Cell Proteomics       Date:  2002-02       Impact factor: 5.911

3.  Shotgun protein sequencing by tandem mass spectra assembly.

Authors:  Nuno Bandeira; Haixu Tang; Vineet Bafna; Pavel Pevzner
Journal:  Anal Chem       Date:  2004-12-15       Impact factor: 6.986

4.  Identification of protein modifications using MS/MS de novo sequencing and the OpenSea alignment algorithm.

Authors:  Brian C Searle; Surendra Dasari; Phillip A Wilmarth; Mark Turner; Ashok P Reddy; Larry L David; Srinivasa R Nagalla
Journal:  J Proteome Res       Date:  2005 Mar-Apr       Impact factor: 4.466

5.  PepNovo: de novo peptide sequencing via probabilistic network modeling.

Authors:  Ari Frank; Pavel Pevzner
Journal:  Anal Chem       Date:  2005-02-15       Impact factor: 6.986

6.  Systematic isolation of peptide signal molecules regulating development in hydra: LWamide and PW families.

Authors:  T Takahashi; Y Muneoka; J Lohmann; M S Lopez de Haro; G Solleder; T C Bosch; C N David; H R Bode; O Koizumi; H Shimizu; M Hatta; T Fujisawa; T Sugiyama
Journal:  Proc Natl Acad Sci U S A       Date:  1997-02-18       Impact factor: 11.205

7.  Sequence database searches via de novo peptide sequencing by tandem mass spectrometry.

Authors:  J A Taylor; R S Johnson
Journal:  Rapid Commun Mass Spectrom       Date:  1997       Impact factor: 2.419

8.  Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database.

Authors:  J R Yates; J K Eng; A L McCormack; D Schieltz
Journal:  Anal Chem       Date:  1995-04-15       Impact factor: 6.986

9.  Error-tolerant identification of peptides in sequence databases by peptide sequence tags.

Authors:  M Mann; M Wilm
Journal:  Anal Chem       Date:  1994-12-15       Impact factor: 6.986

10.  The primary structure of thioredoxin from Chromatium vinosum determined by high-performance tandem mass spectrometry.

Authors:  R S Johnson; K Biemann
Journal:  Biochemistry       Date:  1987-03-10       Impact factor: 3.162

View more
  13 in total

1.  Neutron-encoded signatures enable product ion annotation from tandem mass spectra.

Authors:  Alicia L Richards; Catherine E Vincent; Adrian Guthals; Christopher M Rose; Michael S Westphall; Nuno Bandeira; Joshua J Coon
Journal:  Mol Cell Proteomics       Date:  2013-09-16       Impact factor: 5.911

2.  The generating function approach for Peptide identification in spectral networks.

Authors:  Adrian Guthals; Christina Boucher; Nuno Bandeira
Journal:  J Comput Biol       Date:  2014-11-25       Impact factor: 1.479

3.  Database-independent Protein Sequencing (DiPS) Enables Full-length de Novo Protein and Antibody Sequence Determination.

Authors:  Alon Savidor; Rotem Barzilay; Dalia Elinger; Yosef Yarden; Moshit Lindzen; Alexandra Gabashvili; Ophir Adiv Tal; Yishai Levin
Journal:  Mol Cell Proteomics       Date:  2017-03-27       Impact factor: 5.911

Review 4.  Methods, Tools and Current Perspectives in Proteogenomics.

Authors:  Kelly V Ruggles; Karsten Krug; Xiaojing Wang; Karl R Clauser; Jing Wang; Samuel H Payne; David Fenyö; Bing Zhang; D R Mani
Journal:  Mol Cell Proteomics       Date:  2017-04-29       Impact factor: 5.911

5.  PepExplorer: a similarity-driven tool for analyzing de novo sequencing results.

Authors:  Felipe V Leprevost; Richard H Valente; Diogo B Lima; Jonas Perales; Rafael Melani; John R Yates; Valmir C Barbosa; Magno Junqueira; Paulo C Carvalho
Journal:  Mol Cell Proteomics       Date:  2014-05-30       Impact factor: 5.911

Review 6.  The spectral networks paradigm in high throughput mass spectrometry.

Authors:  Adrian Guthals; Jeramie D Watrous; Pieter C Dorrestein; Nuno Bandeira
Journal:  Mol Biosyst       Date:  2012-10

7.  De Novo MS/MS Sequencing of Native Human Antibodies.

Authors:  Adrian Guthals; Yutian Gan; Laura Murray; Yongmei Chen; Jeremy Stinson; Gerald Nakamura; Jennie R Lill; Wendy Sandoval; Nuno Bandeira
Journal:  J Proteome Res       Date:  2016-11-02       Impact factor: 4.466

Review 8.  A perspective toward mass spectrometry-based de novo sequencing of endogenous antibodies.

Authors:  Sebastiaan C de Graaf; Max Hoek; Sem Tamara; Albert J R Heck
Journal:  MAbs       Date:  2022 Jan-Dec       Impact factor: 6.440

9.  Sequencing-grade de novo analysis of MS/MS triplets (CID/HCD/ETD) from overlapping peptides.

Authors:  Adrian Guthals; Karl R Clauser; Ari M Frank; Nuno Bandeira
Journal:  J Proteome Res       Date:  2013-05-30       Impact factor: 4.466

Review 10.  Making proteomics data accessible and reusable: current state of proteomics databases and repositories.

Authors:  Yasset Perez-Riverol; Emanuele Alpi; Rui Wang; Henning Hermjakob; Juan Antonio Vizcaíno
Journal:  Proteomics       Date:  2015-03       Impact factor: 3.984

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.