Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A linear programming model for protein inference problem in shotgun proteomics.

Literature DB >> 22954624

A linear programming model for protein inference problem in shotgun proteomics.

Abstract

MOTIVATION: Assembling peptides identified from tandem mass spectra into a list of proteins, referred to as protein inference, is an important issue in shotgun proteomics. The objective of protein inference is to find a subset of proteins that are truly present in the sample. Although many methods have been proposed for protein inference, several issues such as peptide degeneracy still remain unsolved.
RESULTS: In this article, we present a linear programming model for protein inference. In this model, we use a transformation of the joint probability that each peptide/protein pair is present in the sample as the variable. Then, both the peptide probability and protein probability can be expressed as a formula in terms of the linear combination of these variables. Based on this simple fact, the protein inference problem is formulated as an optimization problem: minimize the number of proteins with non-zero probabilities under the constraint that the difference between the calculated peptide probability and the peptide probability generated from peptide identification algorithms should be less than some threshold. This model addresses the peptide degeneracy issue by forcing some joint probability variables involving degenerate peptides to be zero in a rigorous manner. The corresponding inference algorithm is named as ProteinLP. We test the performance of ProteinLP on six datasets. Experimental results show that our method is competitive with the state-of-the-art protein inference algorithms. AVAILABILITY: The source code of our algorithm is available at: https://sourceforge.net/projects/prolp/. CONTACT: zyhe@dlut.edu.cn. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Online.

Mesh：

Substances：
Peptides
Proteins

Year: 2012 PMID： 22954624 DOI： 10.1093/bioinformatics/bts540

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

4 in total

1. Concerning the accuracy of Fido and parameter choice.

Authors: Oliver Serang
Journal: Bioinformatics Date: 2012-11-28 Impact factor: 6.937

2. An analysis of proteogenomics and how and when transcriptome-informed reduction of protein databases can enhance eukaryotic proteomics.

Authors: Laura Fancello; Thomas Burger
Journal: Genome Biol Date: 2022-06-20 Impact factor: 17.906

3. PGCA: An algorithm to link protein groups created from MS/MS data.

Authors: David Kepplinger; Mandeep Takhar; Mayu Sasaki; Zsuzsanna Hollander; Derek Smith; Bruce McManus; W Robert McMaster; Raymond T Ng; Gabriela V Cohen Freue
Journal: PLoS One Date: 2017-05-31 Impact factor: 3.240

4. DeepPep: Deep proteome inference from peptide profiles.

Authors: Minseung Kim; Ameen Eetemadi; Ilias Tagkopoulos
Journal: PLoS Comput Biol Date: 2017-09-05 Impact factor: 4.475

4 in total