Literature DB >> 36269744

Characterization of peptide-protein relationships in protein ambiguity groups via bipartite graphs.

Karin Schork1,2,3, Michael Turewicz1,2, Julian Uszkoreit1,2, Jörg Rahnenführer3, Martin Eisenacher1,2.   

Abstract

In bottom-up proteomics, proteins are enzymatically digested into peptides before measurement with mass spectrometry. The relationship between proteins and their corresponding peptides can be represented by bipartite graphs. We conduct a comprehensive analysis of bipartite graphs using quantified peptides from measured data sets as well as theoretical peptides from an in silico digestion of the corresponding complete taxonomic protein sequence databases. The aim of this study is to characterize and structure the different types of graphs that occur and to compare them between data sets. We observed a large influence of the accepted minimum peptide length during in silico digestion. When changing from theoretical peptides to measured ones, the graph structures are subject to two opposite effects. On the one hand, the graphs based on measured peptides are on average smaller and less complex compared to graphs using theoretical peptides. On the other hand, the proportion of protein nodes without unique peptides, which are a complicated case for protein inference and quantification, is considerably larger for measured data. Additionally, the proportion of graphs containing at least one protein node without unique peptides rises when going from database to quantitative level. The fraction of shared peptides and proteins without unique peptides as well as the complexity and size of the graphs highly depends on the data set and organism. Large differences between the structures of bipartite peptide-protein graphs have been observed between database and quantitative level as well as between analyzed species. In the analyzed measured data sets, the proportion of protein nodes without unique peptides ranged from 6.4% to 55.0%. This highlights the need for novel methods that can quantify proteins without unique peptides. The knowledge about the structure of the bipartite peptide-protein graphs gained in this study will be useful for the development of such algorithms.

Entities:  

Year:  2022        PMID: 36269744      PMCID: PMC9586388          DOI: 10.1371/journal.pone.0276401

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.752


  29 in total

1.  Probability-based protein identification by searching sequence databases using mass spectrometry data.

Authors:  D N Perkins; D J Pappin; D M Creasy; J S Cottrell
Journal:  Electrophoresis       Date:  1999-12       Impact factor: 3.535

Review 2.  Protein inference: a review.

Authors:  Ting Huang; Jingjing Wang; Weichuan Yu; Zengyou He
Journal:  Brief Bioinform       Date:  2012-02-28       Impact factor: 11.622

3.  Protein and gene model inference based on statistical modeling in k-partite graphs.

Authors:  Sarah Gerster; Ermir Qeli; Christian H Ahrens; Peter Bühlmann
Journal:  Proc Natl Acad Sci U S A       Date:  2010-06-18       Impact factor: 11.205

4.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification.

Authors:  Jürgen Cox; Matthias Mann
Journal:  Nat Biotechnol       Date:  2008-11-30       Impact factor: 54.908

5.  PIA: An Intuitive Protein Inference Engine with a Web-Based User Interface.

Authors:  Julian Uszkoreit; Alexandra Maerkens; Yasset Perez-Riverol; Helmut E Meyer; Katrin Marcus; Christian Stephan; Oliver Kohlbacher; Martin Eisenacher
Journal:  J Proteome Res       Date:  2015-06-10       Impact factor: 4.466

Review 6.  Mass Spectrometry Applied to Bottom-Up Proteomics: Entering the High-Throughput Era for Hypothesis Testing.

Authors:  Ludovic C Gillet; Alexander Leitner; Ruedi Aebersold
Journal:  Annu Rev Anal Chem (Palo Alto Calif)       Date:  2016-03-30       Impact factor: 10.745

Review 7.  Thousand and one ways to quantify and compare protein abundances in label-free bottom-up proteomics.

Authors:  Mélisande Blein-Nicolas; Michel Zivy
Journal:  Biochim Biophys Acta       Date:  2016-03-03

8.  limma powers differential expression analyses for RNA-sequencing and microarray studies.

Authors:  Matthew E Ritchie; Belinda Phipson; Di Wu; Yifang Hu; Charity W Law; Wei Shi; Gordon K Smyth
Journal:  Nucleic Acids Res       Date:  2015-01-20       Impact factor: 16.971

9.  EPIFANY: A Method for Efficient High-Confidence Protein Inference.

Authors:  Julianus Pfeuffer; Timo Sachsenberg; Tjeerd M H Dijkstra; Oliver Serang; Knut Reinert; Oliver Kohlbacher
Journal:  J Proteome Res       Date:  2020-02-13       Impact factor: 4.466

Review 10.  Bipartite graphs in systems biology and medicine: a survey of methods and applications.

Authors:  Georgios A Pavlopoulos; Panagiota I Kontou; Athanasia Pavlopoulou; Costas Bouyioukos; Evripides Markou; Pantelis G Bagos
Journal:  Gigascience       Date:  2018-04-01       Impact factor: 6.524

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.