Literature DB >> 17189379

Improving gene annotation using peptide mass spectrometry.

Stephen Tanner1, Zhouxin Shen, Julio Ng, Liliana Florea, Roderic Guigó, Steven P Briggs, Vineet Bafna.   

Abstract

Annotation of protein-coding genes is a key goal of genome sequencing projects. In spite of tremendous recent advances in computational gene finding, comprehensive annotation remains a challenge. Peptide mass spectrometry is a powerful tool for researching the dynamic proteome and suggests an attractive approach to discover and validate protein-coding genes. We present algorithms to construct and efficiently search spectra against a genomic database, with no prior knowledge of encoded proteins. By searching a corpus of 18.5 million tandem mass spectra (MS/MS) from human proteomic samples, we validate 39,000 exons and 11,000 introns at the level of translation. We present translation-level evidence for novel or extended exons in 16 genes, confirm translation of 224 hypothetical proteins, and discover or confirm over 40 alternative splicing events. Polymorphisms are efficiently encoded in our database, allowing us to observe variant alleles for 308 coding SNPs. Finally, we demonstrate the use of mass spectrometry to improve automated gene prediction, adding 800 correct exons to our predictions using a simple rescoring strategy. Our results demonstrate that proteomic profiling should play a role in any genome sequencing project.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 17189379      PMCID: PMC1781355          DOI: 10.1101/gr.5646507

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  41 in total

1.  A genomic view of alternative splicing.

Authors:  Barmak Modrek; Christopher Lee
Journal:  Nat Genet       Date:  2002-01       Impact factor: 38.330

2.  Error tolerant searching of uninterpreted tandem mass spectrometry data.

Authors:  David M Creasy; John S Cottrell
Journal:  Proteomics       Date:  2002-10       Impact factor: 3.984

3.  Initial sequencing and comparative analysis of the mouse genome.

Authors:  Robert H Waterston; Kerstin Lindblad-Toh; Ewan Birney; Jane Rogers; Josep F Abril; Pankaj Agarwal; Richa Agarwala; Rachel Ainscough; Marina Alexandersson; Peter An; Stylianos E Antonarakis; John Attwood; Robert Baertsch; Jonathon Bailey; Karen Barlow; Stephan Beck; Eric Berry; Bruce Birren; Toby Bloom; Peer Bork; Marc Botcherby; Nicolas Bray; Michael R Brent; Daniel G Brown; Stephen D Brown; Carol Bult; John Burton; Jonathan Butler; Robert D Campbell; Piero Carninci; Simon Cawley; Francesca Chiaromonte; Asif T Chinwalla; Deanna M Church; Michele Clamp; Christopher Clee; Francis S Collins; Lisa L Cook; Richard R Copley; Alan Coulson; Olivier Couronne; James Cuff; Val Curwen; Tim Cutts; Mark Daly; Robert David; Joy Davies; Kimberly D Delehaunty; Justin Deri; Emmanouil T Dermitzakis; Colin Dewey; Nicholas J Dickens; Mark Diekhans; Sheila Dodge; Inna Dubchak; Diane M Dunn; Sean R Eddy; Laura Elnitski; Richard D Emes; Pallavi Eswara; Eduardo Eyras; Adam Felsenfeld; Ginger A Fewell; Paul Flicek; Karen Foley; Wayne N Frankel; Lucinda A Fulton; Robert S Fulton; Terrence S Furey; Diane Gage; Richard A Gibbs; Gustavo Glusman; Sante Gnerre; Nick Goldman; Leo Goodstadt; Darren Grafham; Tina A Graves; Eric D Green; Simon Gregory; Roderic Guigó; Mark Guyer; Ross C Hardison; David Haussler; Yoshihide Hayashizaki; LaDeana W Hillier; Angela Hinrichs; Wratko Hlavina; Timothy Holzer; Fan Hsu; Axin Hua; Tim Hubbard; Adrienne Hunt; Ian Jackson; David B Jaffe; L Steven Johnson; Matthew Jones; Thomas A Jones; Ann Joy; Michael Kamal; Elinor K Karlsson; Donna Karolchik; Arkadiusz Kasprzyk; Jun Kawai; Evan Keibler; Cristyn Kells; W James Kent; Andrew Kirby; Diana L Kolbe; Ian Korf; Raju S Kucherlapati; Edward J Kulbokas; David Kulp; Tom Landers; J P Leger; Steven Leonard; Ivica Letunic; Rosie Levine; Jia Li; Ming Li; Christine Lloyd; Susan Lucas; Bin Ma; Donna R Maglott; Elaine R Mardis; Lucy Matthews; Evan Mauceli; John H Mayer; Megan McCarthy; W Richard McCombie; Stuart McLaren; Kirsten McLay; John D McPherson; Jim Meldrim; Beverley Meredith; Jill P Mesirov; Webb Miller; Tracie L Miner; Emmanuel Mongin; Kate T Montgomery; Michael Morgan; Richard Mott; James C Mullikin; Donna M Muzny; William E Nash; Joanne O Nelson; Michael N Nhan; Robert Nicol; Zemin Ning; Chad Nusbaum; Michael J O'Connor; Yasushi Okazaki; Karen Oliver; Emma Overton-Larty; Lior Pachter; Genís Parra; Kymberlie H Pepin; Jane Peterson; Pavel Pevzner; Robert Plumb; Craig S Pohl; Alex Poliakov; Tracy C Ponce; Chris P Ponting; Simon Potter; Michael Quail; Alexandre Reymond; Bruce A Roe; Krishna M Roskin; Edward M Rubin; Alistair G Rust; Ralph Santos; Victor Sapojnikov; Brian Schultz; Jörg Schultz; Matthias S Schwartz; Scott Schwartz; Carol Scott; Steven Seaman; Steve Searle; Ted Sharpe; Andrew Sheridan; Ratna Shownkeen; Sarah Sims; Jonathan B Singer; Guy Slater; Arian Smit; Douglas R Smith; Brian Spencer; Arne Stabenau; Nicole Stange-Thomann; Charles Sugnet; Mikita Suyama; Glenn Tesler; Johanna Thompson; David Torrents; Evanne Trevaskis; John Tromp; Catherine Ucla; Abel Ureta-Vidal; Jade P Vinson; Andrew C Von Niederhausern; Claire M Wade; Melanie Wall; Ryan J Weber; Robert B Weiss; Michael C Wendl; Anthony P West; Kris Wetterstrand; Raymond Wheeler; Simon Whelan; Jamey Wierzbowski; David Willey; Sophie Williams; Richard K Wilson; Eitan Winter; Kim C Worley; Dudley Wyman; Shan Yang; Shiaw-Pyng Yang; Evgeny M Zdobnov; Michael C Zody; Eric S Lander
Journal:  Nature       Date:  2002-12-05       Impact factor: 49.962

4.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

5.  Splicing graphs and EST assembly problem.

Authors:  Steffen Heber; Max Alekseyev; Sing-Hoi Sze; Haixu Tang; Pavel A Pevzner
Journal:  Bioinformatics       Date:  2002       Impact factor: 6.937

Review 6.  Mass spectrometry-based proteomics.

Authors:  Ruedi Aebersold; Matthias Mann
Journal:  Nature       Date:  2003-03-13       Impact factor: 49.962

Review 7.  Interpreting the protein language using proteomics.

Authors:  Ole N Jensen
Journal:  Nat Rev Mol Cell Biol       Date:  2006-06       Impact factor: 94.444

8.  Interrogating the human genome using uninterpreted mass spectrometry data.

Authors:  J S Choudhary; W P Blackstock; D M Creasy; J S Cottrell
Journal:  Proteomics       Date:  2001-05       Impact factor: 3.984

9.  Genome sequence and comparative analysis of the model rodent malaria parasite Plasmodium yoelii yoelii.

Authors:  Jane M Carlton; Samuel V Angiuoli; Bernard B Suh; Taco W Kooij; Mihaela Pertea; Joana C Silva; Maria D Ermolaeva; Jonathan E Allen; Jeremy D Selengut; Hean L Koo; Jeremy D Peterson; Mihai Pop; Daniel S Kosack; Martin F Shumway; Shelby L Bidwell; Shamira J Shallom; Susan E van Aken; Steven B Riedmuller; Tamara V Feldblyum; Jennifer K Cho; John Quackenbush; Martha Sedegah; Azadeh Shoaibi; Leda M Cummings; Laurence Florens; John R Yates; J Dale Raine; Robert E Sinden; Michael A Harris; Deirdre A Cunningham; Peter R Preiser; Lawrence W Bergman; Akhil B Vaidya; Leo H van Lin; Chris J Janse; Andrew P Waters; Hamilton O Smith; Owen R White; Steven L Salzberg; J Craig Venter; Claire M Fraser; Stephen L Hoffman; Malcolm J Gardner; Daniel J Carucci
Journal:  Nature       Date:  2002-10-03       Impact factor: 49.962

10.  Novel gene and gene model detection using a whole genome open reading frame analysis in proteomics.

Authors:  Damian Fermin; Baxter B Allen; Thomas W Blackwell; Rajasree Menon; Marcin Adamski; Yin Xu; Peter Ulintz; Gilbert S Omenn; David J States
Journal:  Genome Biol       Date:  2006-04-28       Impact factor: 13.583

View more
  86 in total

1.  Computational analysis of unassigned high-quality MS/MS spectra in proteomic data sets.

Authors:  Kang Ning; Damian Fermin; Alexey I Nesvizhskii
Journal:  Proteomics       Date:  2010-07       Impact factor: 3.984

2.  Augmented annotation of the Schizosaccharomyces pombe genome reveals additional genes required for growth and viability.

Authors:  Danny A Bitton; Valerie Wood; Paul J Scutt; Agnes Grallert; Tim Yates; Duncan L Smith; Iain M Hagan; Crispin J Miller
Journal:  Genetics       Date:  2011-01-26       Impact factor: 4.562

3.  Fast multi-blind modification search through tandem mass spectrometry.

Authors:  Seungjin Na; Nuno Bandeira; Eunok Paek
Journal:  Mol Cell Proteomics       Date:  2011-12-20       Impact factor: 5.911

4.  Template proteogenomics: sequencing whole proteins using an imperfect database.

Authors:  Natalie E Castellana; Victoria Pham; David Arnott; Jennie R Lill; Vineet Bafna
Journal:  Mol Cell Proteomics       Date:  2010-02-17       Impact factor: 5.911

Review 5.  Generating and navigating proteome maps using mass spectrometry.

Authors:  Christian H Ahrens; Erich Brunner; Ermir Qeli; Konrad Basler; Ruedi Aebersold
Journal:  Nat Rev Mol Cell Biol       Date:  2010-10-14       Impact factor: 94.444

6.  A proteogenomic survey of the Medicago truncatula genome.

Authors:  Jeremy D Volkening; Derek J Bailey; Christopher M Rose; Paul A Grimsrud; Maegen Howes-Podoll; Muthusubramanian Venkateshwaran; Michael S Westphall; Jean-Michel Ané; Joshua J Coon; Michael R Sussman
Journal:  Mol Cell Proteomics       Date:  2012-07-05       Impact factor: 5.911

7.  De novo peptide sequencing and identification with precision mass spectrometry.

Authors:  Ari M Frank; Mikhail M Savitski; Michael L Nielsen; Roman A Zubarev; Pavel A Pevzner
Journal:  J Proteome Res       Date:  2007-01       Impact factor: 4.466

8.  A ranking-based scoring function for peptide-spectrum matches.

Authors:  Ari M Frank
Journal:  J Proteome Res       Date:  2009-05       Impact factor: 4.466

9.  Ortho-proteogenomics: multiple proteomes investigation through orthology and a new MS-based protocol.

Authors:  Sébastien Gallien; Emmanuel Perrodou; Christine Carapito; Caroline Deshayes; Jean-Marc Reyrat; Alain Van Dorsselaer; Olivier Poch; Christine Schaeffer; Odile Lecompte
Journal:  Genome Res       Date:  2008-10-27       Impact factor: 9.043

10.  The PeptideAtlas Project.

Authors:  Eric W Deutsch
Journal:  Methods Mol Biol       Date:  2010
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.