Literature DB >> 23627461

MScDB: a mass spectrometry-centric protein sequence database for proteomics.

Harald Marx1, Simone Lemeer, Susan Klaeger, Thomas Rattei, Bernhard Kuster.   

Abstract

Protein sequence databases are indispensable tools for life science research including mass spectrometry (MS)-based proteomics. In current database construction processes, sequence similarity clustering is used to reduce redundancies in the source data. Albeit powerful, it ignores the peptide-centric nature of proteomic data and the fact that MS is able to distinguish similar sequences. Therefore, we introduce an approach that structures the protein sequence space at the peptide level using theoretical and empirical information from large-scale proteomic data to generate a mass spectrometry-centric protein sequence database (MScDB). The core modules of MScDB are an in-silico proteolytic digest and a peptide-centric clustering algorithm that groups protein sequences that are indistinguishable by mass spectrometry. Analysis of various MScDB uses cases against five complex human proteomes, resulting in 69 peptide identifications not present in UniProtKB as well as 79 putative single amino acid polymorphisms. MScDB retains ~99% of the identifications in comparison to common databases despite a 3-48% increase in the theoretical peptide search space (but comparable protein sequence space). In addition, MScDB enables cross-species applications such as human/mouse graft models, and our results suggest that the uncertainty in protein assignments to one species can be smaller than 20%.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23627461     DOI: 10.1021/pr400215r

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  5 in total

1.  A proteomic atlas of the legume Medicago truncatula and its nitrogen-fixing endosymbiont Sinorhizobium meliloti.

Authors:  Harald Marx; Catherine E Minogue; Dhileepkumar Jayaraman; Alicia L Richards; Nicholas W Kwiecien; Alireza F Siahpirani; Shanmugam Rajasekar; Junko Maeda; Kevin Garcia; Angel R Del Valle-Echevarria; Jeremy D Volkening; Michael S Westphall; Sushmita Roy; Michael R Sussman; Jean-Michel Ané; Joshua J Coon
Journal:  Nat Biotechnol       Date:  2016-10-17       Impact factor: 54.908

2.  Tiered Human Integrated Sequence Search Databases for Shotgun Proteomics.

Authors:  Eric W Deutsch; Zhi Sun; David S Campbell; Pierre-Alain Binz; Terry Farrah; David Shteynberg; Luis Mendoza; Gilbert S Omenn; Robert L Moritz
Journal:  J Proteome Res       Date:  2016-09-12       Impact factor: 4.466

3.  An integrative strategy to identify the entire protein coding potential of prokaryotic genomes by proteogenomics.

Authors:  Ulrich Omasits; Adithi R Varadarajan; Michael Schmid; Sandra Goetze; Damianos Melidis; Marc Bourqui; Olga Nikolayeva; Maxime Québatte; Andrea Patrignani; Christoph Dehio; Juerg E Frey; Mark D Robinson; Bernd Wollscheid; Christian H Ahrens
Journal:  Genome Res       Date:  2017-11-15       Impact factor: 9.043

4.  Challenges in Clinical Metaproteomics Highlighted by the Analysis of Acute Leukemia Patients with Gut Colonization by Multidrug-Resistant Enterobacteriaceae.

Authors:  Julia Rechenberger; Patroklos Samaras; Anna Jarzab; Juergen Behr; Martin Frejno; Ana Djukovic; Jaime Sanz; Eva M González-Barberá; Miguel Salavert; Jose Luis López-Hontangas; Karina B Xavier; Laurent Debrauwer; Jean-Marc Rolain; Miguel Sanz; Marc Garcia-Garcera; Mathias Wilhelm; Carles Ubeda; Bernhard Kuster
Journal:  Proteomes       Date:  2019-01-08

Review 5.  A Bioinformatics Guide to Plant Microbiome Analysis.

Authors:  Rares Lucaciu; Claus Pelikan; Samuel M Gerner; Christos Zioutis; Stephan Köstlbacher; Harald Marx; Craig W Herbold; Hannes Schmidt; Thomas Rattei
Journal:  Front Plant Sci       Date:  2019-10-23       Impact factor: 5.753

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.