Literature DB >> 27577934

Tiered Human Integrated Sequence Search Databases for Shotgun Proteomics.

Eric W Deutsch1, Zhi Sun1, David S Campbell1, Pierre-Alain Binz2, Terry Farrah1, David Shteynberg1, Luis Mendoza1, Gilbert S Omenn1,3, Robert L Moritz1.   

Abstract

The results of analysis of shotgun proteomics mass spectrometry data can be greatly affected by the selection of the reference protein sequence database against which the spectra are matched. For many species there are multiple sources from which somewhat different sequence sets can be obtained. This can lead to confusion about which database is best in which circumstances-a problem especially acute in human sample analysis. All sequence databases are genome-based, with sequences for the predicted gene and their protein translation products compiled. Our goal is to create a set of primary sequence databases that comprise the union of sequences from many of the different available sources and make the result easily available to the community. We have compiled a set of four sequence databases of varying sizes, from a small database consisting of only the ∼20,000 primary isoforms plus contaminants to a very large database that includes almost all nonredundant protein sequences from several sources. This set of tiered, increasingly complete human protein sequence databases suitable for mass spectrometry proteomics sequence database searching is called the Tiered Human Integrated Search Proteome set. In order to evaluate the utility of these databases, we have analyzed two different data sets, one from the HeLa cell line and the other from normal human liver tissue, with each of the four tiers of database complexity. The result is that approximately 0.8%, 1.1%, and 1.5% additional peptides can be identified for Tiers 2, 3, and 4, respectively, as compared with the Tier 1 database, at substantially increasing computational cost. This increase in computational cost may be worth bearing if the identification of sequence variants or the discovery of sequences that are not present in the reviewed knowledge base entries is an important goal of the study. We find that it is useful to search a data set against a simpler database, and then check the uniqueness of the discovered peptides against a more complex database. We have set up an automated system that downloads all the source databases on the first of each month and automatically generates a new set of search databases and makes them available for download at http://www.peptideatlas.org/thisp/ .

Entities:  

Keywords:  human; search databases; shotgun mass spectrometry

Mesh:

Substances:

Year:  2016        PMID: 27577934      PMCID: PMC5096980          DOI: 10.1021/acs.jproteome.6b00445

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  30 in total

1.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

2.  A mass spectrometry-friendly database for cSNP identification.

Authors:  Søren Schandorff; Jesper V Olsen; Jakob Bunkenborg; Blagoy Blagoev; Yong Zhang; Jens S Andersen; Matthias Mann
Journal:  Nat Methods       Date:  2007-06       Impact factor: 28.547

Review 3.  A face in the crowd: recognizing peptides through database search.

Authors:  Jimmy K Eng; Brian C Searle; Karl R Clauser; David L Tabb
Journal:  Mol Cell Proteomics       Date:  2011-08-29       Impact factor: 5.911

4.  MScDB: a mass spectrometry-centric protein sequence database for proteomics.

Authors:  Harald Marx; Simone Lemeer; Susan Klaeger; Thomas Rattei; Bernhard Kuster
Journal:  J Proteome Res       Date:  2013-05-14       Impact factor: 4.466

5.  Mass-spectrometry-based draft of the human proteome.

Authors:  Mathias Wilhelm; Judith Schlegl; Hannes Hahne; Amin Moghaddas Gholami; Marcus Lieberenz; Mikhail M Savitski; Emanuel Ziegler; Lars Butzmann; Siegfried Gessulat; Harald Marx; Toby Mathieson; Simone Lemeer; Karsten Schnatbaum; Ulf Reimer; Holger Wenschuh; Martin Mollenhauer; Julia Slotta-Huspenina; Joos-Hendrik Boese; Marcus Bantscheff; Anja Gerstmair; Franz Faerber; Bernhard Kuster
Journal:  Nature       Date:  2014-05-29       Impact factor: 49.962

6.  IMGT®, the international ImMunoGeneTics information system® 25 years on.

Authors:  Marie-Paule Lefranc; Véronique Giudicelli; Patrice Duroux; Joumana Jabado-Michaloud; Géraldine Folch; Safa Aouinti; Emilie Carillon; Hugo Duvergey; Amélie Houles; Typhaine Paysan-Lafosse; Saida Hadi-Saljoqi; Souphatta Sasorith; Gérard Lefranc; Sofia Kossida
Journal:  Nucleic Acids Res       Date:  2014-11-05       Impact factor: 19.160

7.  The neXtProt knowledgebase on human proteins: current status.

Authors:  Pascale Gaudet; Pierre-André Michel; Monique Zahn-Zabal; Isabelle Cusin; Paula D Duek; Olivier Evalet; Alain Gateau; Anne Gleizes; Mario Pereira; Daniel Teixeira; Ying Zhang; Lydie Lane; Amos Bairoch
Journal:  Nucleic Acids Res       Date:  2015-01       Impact factor: 16.971

8.  COSMIC: exploring the world's knowledge of somatic mutations in human cancer.

Authors:  Simon A Forbes; David Beare; Prasad Gunasekaran; Kenric Leung; Nidhi Bindal; Harry Boutselakis; Minjie Ding; Sally Bamford; Charlotte Cole; Sari Ward; Chai Yin Kok; Mingming Jia; Tisham De; Jon W Teague; Michael R Stratton; Ultan McDermott; Peter J Campbell
Journal:  Nucleic Acids Res       Date:  2014-10-29       Impact factor: 16.971

9.  Ensembl 2016.

Authors:  Andrew Yates; Wasiu Akanni; M Ridwan Amode; Daniel Barrell; Konstantinos Billis; Denise Carvalho-Silva; Carla Cummins; Peter Clapham; Stephen Fitzgerald; Laurent Gil; Carlos García Girón; Leo Gordon; Thibaut Hourlier; Sarah E Hunt; Sophie H Janacek; Nathan Johnson; Thomas Juettemann; Stephen Keenan; Ilias Lavidas; Fergal J Martin; Thomas Maurel; William McLaren; Daniel N Murphy; Rishi Nag; Michael Nuhn; Anne Parker; Mateus Patricio; Miguel Pignatelli; Matthew Rahtz; Harpreet Singh Riat; Daniel Sheppard; Kieron Taylor; Anja Thormann; Alessandro Vullo; Steven P Wilder; Amonida Zadissa; Ewan Birney; Jennifer Harrow; Matthieu Muffato; Emily Perry; Magali Ruffier; Giulietta Spudich; Stephen J Trevanion; Fiona Cunningham; Bronwen L Aken; Daniel R Zerbino; Paul Flicek
Journal:  Nucleic Acids Res       Date:  2015-12-19       Impact factor: 16.971

10.  Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation.

Authors:  Nuala A O'Leary; Mathew W Wright; J Rodney Brister; Stacy Ciufo; Diana Haddad; Rich McVeigh; Bhanu Rajput; Barbara Robbertse; Brian Smith-White; Danso Ako-Adjei; Alexander Astashyn; Azat Badretdin; Yiming Bao; Olga Blinkova; Vyacheslav Brover; Vyacheslav Chetvernin; Jinna Choi; Eric Cox; Olga Ermolaeva; Catherine M Farrell; Tamara Goldfarb; Tripti Gupta; Daniel Haft; Eneida Hatcher; Wratko Hlavina; Vinita S Joardar; Vamsi K Kodali; Wenjun Li; Donna Maglott; Patrick Masterson; Kelly M McGarvey; Michael R Murphy; Kathleen O'Neill; Shashikant Pujar; Sanjida H Rangwala; Daniel Rausch; Lillian D Riddick; Conrad Schoch; Andrei Shkeda; Susan S Storz; Hanzhen Sun; Francoise Thibaud-Nissen; Igor Tolstoy; Raymond E Tully; Anjana R Vatsan; Craig Wallin; David Webb; Wendy Wu; Melissa J Landrum; Avi Kimchi; Tatiana Tatusova; Michael DiCuccio; Paul Kitts; Terence D Murphy; Kim D Pruitt
Journal:  Nucleic Acids Res       Date:  2015-11-08       Impact factor: 16.971

View more
  11 in total

1.  Flexible and Fast Mapping of Peptides to a Proteome with ProteoMapper.

Authors:  Luis Mendoza; Eric W Deutsch; Zhi Sun; David S Campbell; David D Shteynberg; Robert L Moritz
Journal:  J Proteome Res       Date:  2018-09-28       Impact factor: 4.466

2.  PTMProphet: Fast and Accurate Mass Modification Localization for the Trans-Proteomic Pipeline.

Authors:  David D Shteynberg; Eric W Deutsch; David S Campbell; Michael R Hoopmann; Ulrike Kusebauch; Dave Lee; Luis Mendoza; Mukul K Midha; Zhi Sun; Anthony D Whetton; Robert L Moritz
Journal:  J Proteome Res       Date:  2019-07-22       Impact factor: 4.466

Review 3.  Advances in the Chromosome-Centric Human Proteome Project: looking to the future.

Authors:  Young-Ki Paik; Gilbert S Omenn; William S Hancock; Lydie Lane; Christopher M Overall
Journal:  Expert Rev Proteomics       Date:  2017-11-10       Impact factor: 3.940

4.  Progress in the Chromosome-Centric Human Proteome Project as Highlighted in the Annual Special Issue IV.

Authors:  Young-Ki Paik; Christopher M Overall; Eric W Deutsch; William S Hancock; Gilbert S Omenn
Journal:  J Proteome Res       Date:  2016-11-04       Impact factor: 4.466

5.  The Arabidopsis PeptideAtlas: Harnessing worldwide proteomics data to create a comprehensive community proteomics resource.

Authors:  Klaas J van Wijk; Tami Leppert; Qi Sun; Sascha S Boguraev; Zhi Sun; Luis Mendoza; Eric W Deutsch
Journal:  Plant Cell       Date:  2021-11-04       Impact factor: 12.085

Review 6.  Advances and Utility of the Human Plasma Proteome.

Authors:  Eric W Deutsch; Gilbert S Omenn; Zhi Sun; Michal Maes; Maria Pernemalm; Krishnan K Palaniappan; Natasha Letunica; Yves Vandenbrouck; Virginie Brun; Sheng-Ce Tao; Xiaobo Yu; Philipp E Geyer; Vera Ignjatovic; Robert L Moritz; Jochen M Schwenk
Journal:  J Proteome Res       Date:  2021-10-21       Impact factor: 5.370

7.  Multiaspect Examinations of Possible Alternative Mappings of Identified Variant Peptides: A Case Study on the HEK293 Cell Line.

Authors:  Wai-Kok Choong; Ting-Yi Sung
Journal:  ACS Omega       Date:  2022-05-02

8.  Research on the Human Proteome Reaches a Major Milestone: >90% of Predicted Human Proteins Now Credibly Detected, According to the HUPO Human Proteome Project.

Authors:  Gilbert S Omenn; Lydie Lane; Christopher M Overall; Ileana M Cristea; Fernando J Corrales; Cecilia Lindskog; Young-Ki Paik; Jennifer E Van Eyk; Siqi Liu; Stephen R Pennington; Michael P Snyder; Mark S Baker; Nuno Bandeira; Ruedi Aebersold; Robert L Moritz; Eric W Deutsch
Journal:  J Proteome Res       Date:  2020-10-19       Impact factor: 4.466

9.  Progress on the HUPO Draft Human Proteome: 2017 Metrics of the Human Proteome Project.

Authors:  Gilbert S Omenn; Lydie Lane; Emma K Lundberg; Christopher M Overall; Eric W Deutsch
Journal:  J Proteome Res       Date:  2017-10-09       Impact factor: 4.466

10.  Enhanced Missing Proteins Detection in NCI60 Cell Lines Using an Integrative Search Engine Approach.

Authors:  Elizabeth Guruceaga; Alba Garin-Muga; Gorka Prieto; Bartolomé Bejarano; Miguel Marcilla; Consuelo Marín-Vicente; Yasset Perez-Riverol; J Ignacio Casal; Juan Antonio Vizcaíno; Fernando J Corrales; Victor Segura
Journal:  J Proteome Res       Date:  2017-10-11       Impact factor: 4.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.